ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations

Xinyi Ni; Qiuyang Wang; Yukun Zhang; Pengyu Hong

doi:10.48550/arxiv.2501.16945

Back

Preprint

ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations

Xinyi Ni, Qiuyang Wang, Yukun Zhang and Pengyu Hong

01/28/2025

DOI: https://doi.org/10.48550/arxiv.2501.16945

Handle:

https://hdl.handle.net/10192/74173

Abstract

Computer Science - Artificial Intelligence

Computer Science - Computation and Language

Computer Science - Learning

Computer Science - Software Engineering

LLM-based tool agents offer natural language interfaces, enabling users to seamlessly interact with computing services. While REST APIs are valuable resources for building such agents, they must first be transformed into AI-compatible tools. Automatically generating AI-compatible tools from REST API documents can greatly streamline tool agent development and minimize user learning curves. However, API documentation often suffers from a lack of standardization, inconsistent schemas, and incomplete information. To address these issues, we developed \textbf{ToolFactory}, an open-source pipeline for automating tool generation from unstructured API documents. To enhance the reliability of the developed tools, we implemented an evaluation method to diagnose errors. Furthermore, we built a knowledge base of verified tools, which we leveraged to infer missing information from poorly documented APIs. We developed the API Extraction Benchmark, comprising 167 API documents and 744 endpoints in various formats, and designed a JSON schema to annotate them. This annotated dataset was utilized to train and validate ToolFactory. The experimental results highlight the effectiveness of ToolFactory. We also demonstrated ToolFactory by creating a domain-specific AI agent for glycomaterials research. ToolFactory exhibits significant potential for facilitating the seamless integration of scientific REST APIs into AI workflows.

Metrics

49 Record Views

Details

Title: ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations
Creators: Xinyi Ni - Brandeis University
Qiuyang Wang - Brandeis University
Yukun Zhang - Brandeis University
Pengyu Hong - Brandeis University
Identifiers: 9924437865801921
Academic Unit: Michtom School of Computer Science; Benjamin and Mae Volen National Center for Complex Systems
Language: English
Resource Type: Preprint

ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations

Abstract

Metrics

Details

Brandeis University Social media