[PR #188] [CLOSED] automating tool benchmarks #192

Closed
opened 2026-02-16 00:18:29 -05:00 by yindo · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/langchain-ai/langchain-benchmarks/pull/188
Author: @isahers1
Created: 7/3/2024
Status: Closed

Base: mainHead: isaac/toolbenchmarks


📝 Commits (10+)

📊 Changes

29 files changed (+1094 additions, -1356 deletions)

View changed files

.github/workflows/_benchmarks.yml (+30 -0)
📝 Makefile (+2 -0)
📝 archived/csv-qa/custom_agent.py (+1 -1)
📝 archived/csv-qa/pandas_agent_gpt_35.py (+1 -1)
📝 archived/csv-qa/pandas_agent_gpt_4.py (+1 -1)
📝 archived/csv-qa/pandas_ai.py (+1 -1)
📝 archived/csv-qa/streamlit_app.py (+1 -1)
📝 archived/extraction/streamlit_app.py (+1 -1)
📝 archived/langchain-docs-benchmarking/packages/openai-functions-agent/openai_functions_agent/agent.py (+1 -1)
📝 archived/langchain-docs-benchmarking/run_evals.py (+1 -1)
📝 docs/source/notebooks/extraction/chat_extraction.ipynb (+1 -1)
📝 docs/source/notebooks/extraction/email.ipynb (+1 -1)
📝 docs/source/notebooks/extraction/intro.ipynb (+1 -1)
📝 docs/source/notebooks/retrieval/comparing_techniques.ipynb (+2 -1)
📝 docs/source/notebooks/retrieval/multi_modal_benchmarking/multi_modal_eval.ipynb (+1 -1)
📝 docs/source/notebooks/retrieval/multi_modal_benchmarking/multi_modal_eval_baseline.ipynb (+1 -1)
📝 docs/source/notebooks/retrieval/semi_structured_benchmarking/semi_structured.ipynb (+1 -1)
📝 docs/source/notebooks/retrieval/semi_structured_benchmarking/ss_eval_multi_vector.ipynb (+1 -1)
📝 langchain_benchmarks/extraction/evaluators.py (+1 -1)
📝 langchain_benchmarks/extraction/implementations.py (+1 -1)

...and 9 more files

📄 Description

No description provided


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/langchain-ai/langchain-benchmarks/pull/188 **Author:** [@isahers1](https://github.com/isahers1) **Created:** 7/3/2024 **Status:** ❌ Closed **Base:** `main` ← **Head:** `isaac/toolbenchmarks` --- ### 📝 Commits (10+) - [`3aae5da`](https://github.com/langchain-ai/langchain-benchmarks/commit/3aae5da7691d0a53760110f90d2a520fcb0e0251) first draft - [`ea10c07`](https://github.com/langchain-ai/langchain-benchmarks/commit/ea10c07358b369f384ea7e5890b332514ee559a8) second draft - [`a601b6d`](https://github.com/langchain-ai/langchain-benchmarks/commit/a601b6de9c5a7b0b3816fc936e5adb52ab3c812e) fmt - [`84cca0a`](https://github.com/langchain-ai/langchain-benchmarks/commit/84cca0a2ee0e8b4c2137773f17e173287f1715dc) fmt - [`9378a8d`](https://github.com/langchain-ai/langchain-benchmarks/commit/9378a8d9ed6d5808ff9dbe5f334385d472b6e60f) fmt - [`17c7794`](https://github.com/langchain-ai/langchain-benchmarks/commit/17c77941309507478ce6dba6a8efcd412c7e514e) semantic search working (i think) - [`47382f8`](https://github.com/langchain-ai/langchain-benchmarks/commit/47382f82d920ce088cce8f78d38ae3f326133f53) Merge branch 'isaac/toolbenchmarks' of https://github.com/langchain-ai/langchain-benchmarks into isaac/toolbenchmarks - [`ab16d66`](https://github.com/langchain-ai/langchain-benchmarks/commit/ab16d66903bd251159824768320d2ea868ca43f2) fmt - [`94506e4`](https://github.com/langchain-ai/langchain-benchmarks/commit/94506e472698f34864d024e9533f3c9747ef4be7) fmt - [`b10c099`](https://github.com/langchain-ai/langchain-benchmarks/commit/b10c099c7aebdca466b79a93bfbb5e8b3d6e0f34) fmt ### 📊 Changes **29 files changed** (+1094 additions, -1356 deletions) <details> <summary>View changed files</summary> ➕ `.github/workflows/_benchmarks.yml` (+30 -0) 📝 `Makefile` (+2 -0) 📝 `archived/csv-qa/custom_agent.py` (+1 -1) 📝 `archived/csv-qa/pandas_agent_gpt_35.py` (+1 -1) 📝 `archived/csv-qa/pandas_agent_gpt_4.py` (+1 -1) 📝 `archived/csv-qa/pandas_ai.py` (+1 -1) 📝 `archived/csv-qa/streamlit_app.py` (+1 -1) 📝 `archived/extraction/streamlit_app.py` (+1 -1) 📝 `archived/langchain-docs-benchmarking/packages/openai-functions-agent/openai_functions_agent/agent.py` (+1 -1) 📝 `archived/langchain-docs-benchmarking/run_evals.py` (+1 -1) 📝 `docs/source/notebooks/extraction/chat_extraction.ipynb` (+1 -1) 📝 `docs/source/notebooks/extraction/email.ipynb` (+1 -1) 📝 `docs/source/notebooks/extraction/intro.ipynb` (+1 -1) 📝 `docs/source/notebooks/retrieval/comparing_techniques.ipynb` (+2 -1) 📝 `docs/source/notebooks/retrieval/multi_modal_benchmarking/multi_modal_eval.ipynb` (+1 -1) 📝 `docs/source/notebooks/retrieval/multi_modal_benchmarking/multi_modal_eval_baseline.ipynb` (+1 -1) 📝 `docs/source/notebooks/retrieval/semi_structured_benchmarking/semi_structured.ipynb` (+1 -1) 📝 `docs/source/notebooks/retrieval/semi_structured_benchmarking/ss_eval_multi_vector.ipynb` (+1 -1) 📝 `langchain_benchmarks/extraction/evaluators.py` (+1 -1) 📝 `langchain_benchmarks/extraction/implementations.py` (+1 -1) _...and 9 more files_ </details> ### 📄 Description _No description provided_ --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
yindo added the pull-request label 2026-02-16 00:18:29 -05:00
yindo closed this issue 2026-02-16 00:18:29 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: langchain-ai/langchain-benchmarks#192