[PR #154] [MERGED] Change multiverse math to multiverse math (tiny) and add another multiverse math set #162

Closed
opened 2026-02-16 00:18:23 -05:00 by yindo · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/langchain-ai/langchain-benchmarks/pull/154
Author: @eyurtsev
Created: 12/19/2023
Status: Merged
Merged: 12/19/2023
Merged by: @eyurtsev

Base: mainHead: eugene/expand_mathset


📝 Commits (1)

📊 Changes

2 files changed (+65 additions, -10 deletions)

View changed files

📝 langchain_benchmarks/registration.py (+1 -1)
📝 langchain_benchmarks/tool_usage/tasks/multiverse_math.py (+64 -9)

📄 Description

  • This PR adds a multiverse math consisting of 20 questions.
  • Question about rounding has been removed to simplify evaluation.

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/langchain-ai/langchain-benchmarks/pull/154 **Author:** [@eyurtsev](https://github.com/eyurtsev) **Created:** 12/19/2023 **Status:** ✅ Merged **Merged:** 12/19/2023 **Merged by:** [@eyurtsev](https://github.com/eyurtsev) **Base:** `main` ← **Head:** `eugene/expand_mathset` --- ### 📝 Commits (1) - [`84cbc6a`](https://github.com/langchain-ai/langchain-benchmarks/commit/84cbc6aad88defef42f2bfb6a394932bb4d7b1ab) x ### 📊 Changes **2 files changed** (+65 additions, -10 deletions) <details> <summary>View changed files</summary> 📝 `langchain_benchmarks/registration.py` (+1 -1) 📝 `langchain_benchmarks/tool_usage/tasks/multiverse_math.py` (+64 -9) </details> ### 📄 Description * This PR adds a multiverse math consisting of 20 questions. * Question about rounding has been removed to simplify evaluation. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
yindo added the pull-request label 2026-02-16 00:18:23 -05:00
yindo closed this issue 2026-02-16 00:18:23 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: langchain-ai/langchain-benchmarks#162