[PR #188] [CLOSED] automating tool benchmarks #192

New Issue

2026-02-16T00:18:29-05:00

yindo commented

2026-02-16 00:18:29 -05:00

📋 Pull Request Information

Original PR: https://github.com/langchain-ai/langchain-benchmarks/pull/188
Author: @isahers1
Created: 7/3/2024
Status: ❌ Closed

Base: main ← Head: isaac/toolbenchmarks

📝 Commits (10+)

3aae5da first draft
ea10c07 second draft
a601b6d fmt
84cca0a fmt
9378a8d fmt
17c7794 semantic search working (i think)
47382f8 Merge branch 'isaac/toolbenchmarks' of https://github.com/langchain-ai/langchain-benchmarks into isaac/toolbenchmarks
ab16d66 fmt
94506e4 fmt
b10c099 fmt

📊 Changes

29 files changed (+1094 additions, -1356 deletions)

View changed files

➕ .github/workflows/_benchmarks.yml (+30 -0)
📝 Makefile (+2 -0)
📝 archived/csv-qa/custom_agent.py (+1 -1)
📝 archived/csv-qa/pandas_agent_gpt_35.py (+1 -1)
📝 archived/csv-qa/pandas_agent_gpt_4.py (+1 -1)
📝 archived/csv-qa/pandas_ai.py (+1 -1)
📝 archived/csv-qa/streamlit_app.py (+1 -1)
📝 archived/extraction/streamlit_app.py (+1 -1)
📝 archived/langchain-docs-benchmarking/packages/openai-functions-agent/openai_functions_agent/agent.py (+1 -1)
📝 archived/langchain-docs-benchmarking/run_evals.py (+1 -1)
📝 docs/source/notebooks/extraction/chat_extraction.ipynb (+1 -1)
📝 docs/source/notebooks/extraction/email.ipynb (+1 -1)
📝 docs/source/notebooks/extraction/intro.ipynb (+1 -1)
📝 docs/source/notebooks/retrieval/comparing_techniques.ipynb (+2 -1)
📝 docs/source/notebooks/retrieval/multi_modal_benchmarking/multi_modal_eval.ipynb (+1 -1)
📝 docs/source/notebooks/retrieval/multi_modal_benchmarking/multi_modal_eval_baseline.ipynb (+1 -1)
📝 docs/source/notebooks/retrieval/semi_structured_benchmarking/semi_structured.ipynb (+1 -1)
📝 docs/source/notebooks/retrieval/semi_structured_benchmarking/ss_eval_multi_vector.ipynb (+1 -1)
📝 langchain_benchmarks/extraction/evaluators.py (+1 -1)
📝 langchain_benchmarks/extraction/implementations.py (+1 -1)

...and 9 more files

📄 Description

No description provided

_{🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.}

## 📋 Pull Request Information **Original PR:** https://github.com/langchain-ai/langchain-benchmarks/pull/188 **Author:** [@isahers1](https://github.com/isahers1) **Created:** 7/3/2024 **Status:** ❌ Closed **Base:** `main` ← **Head:** `isaac/toolbenchmarks` --- ### 📝 Commits (10+) - [`3aae5da`](https://github.com/langchain-ai/langchain-benchmarks/commit/3aae5da7691d0a53760110f90d2a520fcb0e0251) first draft - [`ea10c07`](https://github.com/langchain-ai/langchain-benchmarks/commit/ea10c07358b369f384ea7e5890b332514ee559a8) second draft - [`a601b6d`](https://github.com/langchain-ai/langchain-benchmarks/commit/a601b6de9c5a7b0b3816fc936e5adb52ab3c812e) fmt - [`84cca0a`](https://github.com/langchain-ai/langchain-benchmarks/commit/84cca0a2ee0e8b4c2137773f17e173287f1715dc) fmt - [`9378a8d`](https://github.com/langchain-ai/langchain-benchmarks/commit/9378a8d9ed6d5808ff9dbe5f334385d472b6e60f) fmt - [`17c7794`](https://github.com/langchain-ai/langchain-benchmarks/commit/17c77941309507478ce6dba6a8efcd412c7e514e) semantic search working (i think) - [`47382f8`](https://github.com/langchain-ai/langchain-benchmarks/commit/47382f82d920ce088cce8f78d38ae3f326133f53) Merge branch 'isaac/toolbenchmarks' of https://github.com/langchain-ai/langchain-benchmarks into isaac/toolbenchmarks - [`ab16d66`](https://github.com/langchain-ai/langchain-benchmarks/commit/ab16d66903bd251159824768320d2ea868ca43f2) fmt - [`94506e4`](https://github.com/langchain-ai/langchain-benchmarks/commit/94506e472698f34864d024e9533f3c9747ef4be7) fmt - [`b10c099`](https://github.com/langchain-ai/langchain-benchmarks/commit/b10c099c7aebdca466b79a93bfbb5e8b3d6e0f34) fmt ### 📊 Changes **29 files changed** (+1094 additions, -1356 deletions) <details> <summary>View changed files</summary> ➕ `.github/workflows/_benchmarks.yml` (+30 -0) 📝 `Makefile` (+2 -0) 📝 `archived/csv-qa/custom_agent.py` (+1 -1) 📝 `archived/csv-qa/pandas_agent_gpt_35.py` (+1 -1) 📝 `archived/csv-qa/pandas_agent_gpt_4.py` (+1 -1) 📝 `archived/csv-qa/pandas_ai.py` (+1 -1) 📝 `archived/csv-qa/streamlit_app.py` (+1 -1) 📝 `archived/extraction/streamlit_app.py` (+1 -1) 📝 `archived/langchain-docs-benchmarking/packages/openai-functions-agent/openai_functions_agent/agent.py` (+1 -1) 📝 `archived/langchain-docs-benchmarking/run_evals.py` (+1 -1) 📝 `docs/source/notebooks/extraction/chat_extraction.ipynb` (+1 -1) 📝 `docs/source/notebooks/extraction/email.ipynb` (+1 -1) 📝 `docs/source/notebooks/extraction/intro.ipynb` (+1 -1) 📝 `docs/source/notebooks/retrieval/comparing_techniques.ipynb` (+2 -1) 📝 `docs/source/notebooks/retrieval/multi_modal_benchmarking/multi_modal_eval.ipynb` (+1 -1) 📝 `docs/source/notebooks/retrieval/multi_modal_benchmarking/multi_modal_eval_baseline.ipynb` (+1 -1) 📝 `docs/source/notebooks/retrieval/semi_structured_benchmarking/semi_structured.ipynb` (+1 -1) 📝 `docs/source/notebooks/retrieval/semi_structured_benchmarking/ss_eval_multi_vector.ipynb` (+1 -1) 📝 `langchain_benchmarks/extraction/evaluators.py` (+1 -1) 📝 `langchain_benchmarks/extraction/implementations.py` (+1 -1) _...and 9 more files_ </details> ### 📄 Description _No description provided_ --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>

yindo added the pull-request label 2026-02-16 00:18:29 -05:00

yindo closed this issue

2026-02-16 00:18:29 -05:00

yindo referenced this issue

2026-02-16 00:18:30 -05:00

[PR #192] [MERGED] Release 0.0.13 #197

Sign in to join this conversation.

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: langchain-ai/langchain-benchmarks#192