Skip to content

Question on implementation #4

@zachschillaci27

Description

@zachschillaci27

Thanks for sharing the paper and code!

I have a question regarding the technical implementation and the differences between Text2SQL and TAG. From the schematic in your paper below, you outline TAG with this three-step process:

Screenshot 2024-10-28 at 08 37 01

From what I can see here, this looks to me like a standard Text2SQL function calling pattern that one would implement for an OpenAI-like assistant with function (tool) calling. That is, you provide a Text2SQL function (tool) to the LLM that takes in the user's query and uses a separate LLM call (with information on the database schema provided in the system prompt) to generate an SQL query. The query is then run and the response is returned as a tool message and passed back to the orchestrator LLM. Running the orchestrator in an iterative function calling loop allows the assistant to apply additional semantic reasoning and execute multiple Text2SQL calls if needed. Once the complete response has been obtained, the LLM can provide a text-based response to the user.

Whenever I have implemented such a pattern for an LLM assistant I have referred to this process as simply Text2SQL - would this, however, fit your definition of TAG? Perhaps the distinction is more important for academic benchmarking cases, but I would appreciate your input here. Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions