Two routing strategies, score-based and classification-based, are presented in the paper. These strategies are designed for easy training and efficient inference, and they achieve accuracy comparable to the most capable LLM while reducing costs. The authors highlight a practical and explainable accuracy-cost trade-off using the BIRD dataset in their experiments.
Key takeaways:
- The paper introduces the first large language model (LLM) routing approach for Text-to-SQL, which selects the most cost-effective LLM for each query.
- Two routing strategies are presented, score- and classification-based, that achieve accuracy comparable to the most capable LLM while reducing costs.
- The routers are designed for ease of training and efficient inference.
- The experiments highlight a practical and explainable accuracy-cost trade-off on the BIRD dataset.