feat(sdk): add baseten-hosted models to evals workflow by eyurtsev · Pull Request #1682 · langchain-ai/deepagents

Eugene Yurtsev (eyurtsev) · 2026-03-06T18:14:00Z

Adds GLM-5 (zai-org/GLM-5) and MiniMax-M2.5 (MiniMaxAI/MiniMax-M2.5) via Baseten's OpenAI-compatible inference endpoint as eval targets, mirroring the existing Ollama Cloud entries for these models. Uses ChatOpenAI with base_url=https://inference.baseten.co/v1 following the same pattern as the NVIDIA special case in conftest.py. Requires a BASETEN_API_KEY repo secret.

Created with Deep Agents CLI.

# Conflicts: # .github/scripts/get_eval_models.py # .github/workflows/evals.yml

Copilot

Pull request overview

Adds Baseten-hosted models as additional eval targets by routing baseten: model specs through Baseten’s OpenAI-compatible endpoint in the eval harness, and wiring the required secret + model entries into the GitHub Actions eval workflow.

Changes:

Add a baseten: model prefix handler in the eval model fixture that instantiates ChatOpenAI with base_url=https://inference.baseten.co/v1.
Extend the evals GitHub Actions workflow matrix/options and environment to include Baseten models and BASETEN_API_KEY.
Update the model-matrix generator script to include the Baseten models in both MODELS and SET1.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
`libs/deepagents/tests/evals/conftest.py`	Adds Baseten-specific model initialization via `ChatOpenAI` using Baseten’s OpenAI-compatible `base_url`.
`.github/workflows/evals.yml`	Adds Baseten models to workflow input options/matrix and injects `BASETEN_API_KEY` into the eval job environment.
`.github/scripts/get_eval_models.py`	Registers Baseten models in the “all” and “set1” model selections used to build the workflow matrix.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

feat(sdk): add baseten-hosted models to evals workflow

1c0daae

github-actions bot added github_actions PR touching `.github` deepagents Related to the `deepagents` SDK / agent harness internal User is a member of the `langchain-ai` GitHub organization feature New feature/enhancement or request for one labels Mar 6, 2026

Merge remote-tracking branch 'origin/main' into eugene/add-baseten-evals

cb9af34

# Conflicts: # .github/scripts/get_eval_models.py # .github/workflows/evals.yml

Eugene Yurtsev (eyurtsev) marked this pull request as ready for review March 6, 2026 21:33

Copilot AI review requested due to automatic review settings March 6, 2026 21:33

Copilot started reviewing on behalf of Eugene Yurtsev (eyurtsev) March 6, 2026 21:34 View session

Copilot AI reviewed Mar 6, 2026

View reviewed changes

Eugene Yurtsev (eyurtsev) merged commit 04dc92b into main Mar 6, 2026
41 checks passed

Eugene Yurtsev (eyurtsev) deleted the eugene/add-baseten-evals branch March 6, 2026 21:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sdk): add baseten-hosted models to evals workflow#1682

feat(sdk): add baseten-hosted models to evals workflow#1682
Eugene Yurtsev (eyurtsev) merged 2 commits intomainfrom
eugene/add-baseten-evals

Eugene Yurtsev (eyurtsev) commented Mar 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Eugene Yurtsev (eyurtsev) commented Mar 6, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants