fix(tools): Tool name should align with what llm knows by Danelegend · Pull Request #7352 · onyx-dot-app/onyx

Danelegend · 2026-01-12T01:42:43Z

Description

When reconstructing the chat history for the llm, the provided tool name is pulled from the database. Meanwhile, the llm is told the tool name from from a prompt. Currently there is a misalignment between the tool name in the database and the tool name in the prompt.

This migration ensures that tool.name is aligned with what we expect the llm to see. Although this doesn't solve the fundamental problem of needing to align database + prompt name, it solves the immediate problem. A larger followup PR can come to refactor this logic.

How Has This Been Tested?

Run migration and ensure that the database presents us with correct tools.
Validate on braintrust that tool history is consistent with the action that it took.

Additional Options

closes https://linear.app/onyx-app/issue/ENG-3360/tool-inconsistency

[Optional] Override Linear Check

Summary by cubic

Aligns tool names in the database with the names the LLM is instructed to use, fixing mismatched tool calls and history reconstruction. Addresses ENG-3360 (tool inconsistency).

Migration
- Adds Alembic migration to set tool.name to expected LLM-visible names (e.g., internal_search, web_search, research_agent); downgrade restores class-name values.
- Clarifies Tool.name is the LLM-facing name in the model.
Code Updates
- Deep Research now looks up the Research Agent by LLM name (research_agent) and compares using in_code_tool_id (ResearchAgent).
- Replaces RESEARCH_AGENT_DB_NAME with RESEARCH_AGENT_IN_CODE_ID where appropriate and simplifies seed migration usage.

^{Written for commit e38f2c2. Summary will update on new commits.}

Danelegend · 2026-01-12T01:43:29Z

backend/alembic/versions/c1d2e3f4a5b6_add_deep_research_tool.py



 DEEP_RESEARCH_TOOL = {
-    "name": RESEARCH_AGENT_DB_NAME,


Not good to leave this as variable since if code changes (as it does here), then the migration changes. Hardcoding this value.

cubic-dev-ai

No issues found across 6 files

greptile-apps · 2026-01-12T01:49:25Z

Greptile Overview

Greptile Summary

What This PR Fixes

This PR addresses a critical tool name inconsistency between what the LLM is instructed to use and what's stored in the database.

The Problem:

Tool implementation classes define NAME constants (e.g., SearchTool.NAME = "internal_search")
The LLM is told to use these names when making tool calls
But the database stored different names (e.g., name="SearchTool" - the class name)
When reconstructing chat history, the code builds tool_id_to_name_map = {tool.id: tool.name} and sends this to the LLM as the function_name in tool call history
Result: LLM sees mismatched tool names in history (e.g., "SearchTool") vs what it knows (e.g., "internal_search"), causing confusion

The Solution:

Migration d25168c2beee: Updates all built-in tool name fields to match their NAME constants:
- SearchTool: "SearchTool" → "internal_search"
- WebSearchTool: "WebSearchTool" → "web_search"
- ImageGenerationTool: "ImageGenerationTool" → "generate_image"
- PythonTool: "PythonTool" → "python"
- OpenURLTool: "OpenURLTool" → "open_url"
- KnowledgeGraphTool: "KnowledgeGraphTool" → "run_kg_search"
- ResearchAgent: "ResearchAgent" → "research_agent"
Code Refactoring: Renames RESEARCH_AGENT_DB_NAME → RESEARCH_AGENT_IN_CODE_ID for clarity, and updates dr_loop.py to use RESEARCH_AGENT_TOOL_NAME (the LLM-facing name) when calling get_tool_by_name().
Migration Cleanup: Removes import dependency in c1d2e3f4a5b6 by hardcoding "ResearchAgent".
Documentation: Adds comment to Tool.name field clarifying it's the LLM-facing name.

Verification

I verified all NAME constants in tool implementations match the migration mappings:

✅ All 7 tools have correct LLM-facing names
✅ No remaining usages of old RESEARCH_AGENT_DB_NAME constant
✅ All comparisons now use in_code_tool_id instead of name where appropriate
✅ Migration is idempotent (UPDATE statements are safe if tools don't exist)

Minor Issue Found

One style comment on the downgrade migration comment wording - purely cosmetic, doesn't affect functionality.

Confidence Score: 4/5

This PR is safe to merge with one minor style suggestion that can be addressed post-merge
Score of 4 reflects a well-executed fix for a critical bug. The migration correctly aligns tool names, all code changes are consistent, and I verified the NAME constants match. The only reason it's not a 5 is the minor comment clarity issue in the downgrade function, which is purely cosmetic and doesn't affect functionality. All tool name mappings are correct, constant renaming is consistent throughout, and the migration handles edge cases properly (UPDATE is safe even if tools don't exist).
backend/alembic/versions/d25168c2beee_tool_name_consistency.py - minor comment clarity issue, but functionally correct

Important Files Changed

File Analysis

Filename	Score	Overview
backend/alembic/versions/d25168c2beee_tool_name_consistency.py	4/5	New migration aligns tool.name with LLM-facing names; maps are correct but downgrade logic has minor issue
backend/alembic/versions/c1d2e3f4a5b6_add_deep_research_tool.py	5/5	Removes import dependency, hardcodes "ResearchAgent" as intended
backend/onyx/deep_research/dr_loop.py	5/5	Updates get_tool_by_name to use RESEARCH_AGENT_TOOL_NAME (LLM name) correctly

Sequence Diagram

sequenceDiagram
    participant LLM as LLM
    participant ToolImpl as Tool Implementation<br/>(e.g., SearchTool.NAME)
    participant DB as Database<br/>(tool.name)
    participant ChatUtils as chat_utils.py<br/>(tool_id_to_name_map)
    participant History as Message History
    
    Note over ToolImpl: NAME = "internal_search"
    Note over DB: BEFORE: name = "SearchTool"<br/>AFTER: name = "internal_search"
    
    rect rgb(240, 240, 255)
        Note over DB,ChatUtils: Tool History Reconstruction Flow
        ChatUtils->>DB: get_tools(db_session)
        DB-->>ChatUtils: List of tools
        ChatUtils->>ChatUtils: Build tool_id_to_name_map<br/>{tool.id: tool.name}
        ChatUtils->>History: Reconstruct tool calls<br/>using tool_name from map
        History->>LLM: Send tool call history<br/>{"function_name": tool.name}
    end
    
    rect rgb(255, 240, 240)
        Note over LLM,ToolImpl: Before Migration (BROKEN)
        LLM->>History: Sees "SearchTool" in history
        Note over LLM: Confused! I only know<br/>"internal_search"
    end
    
    rect rgb(240, 255, 240)
        Note over LLM,ToolImpl: After Migration (FIXED)
        LLM->>History: Sees "internal_search" in history
        Note over LLM: Perfect! Matches what I know
    end

greptile-apps

_{4 files reviewed, 4 comments}

_{Edit Code Review Agent Settings | Greptile}

backend/alembic/versions/19a896a66d3f_tool_name_consistency.py

greptile-apps · 2026-01-12T01:49:32Z

backend/alembic/versions/c1d2e3f4a5b6_add_deep_research_tool.py

 DEEP_RESEARCH_TOOL = {
-    "name": RESEARCH_AGENT_DB_NAME,
+    "name": "ResearchAgent",


There's an inconsistency between this migration and the new 19a896a66d3f migration:

This migration (potentially already run in production) inserts the tool with name: "ResearchAgent"

The new migration 19a896a66d3f then changes it to "research_agent"

But this hardcoded value will persist if someone re-runs or reviews this migration

While the hardcoding is fine (removes the import dependency), the value should match what the final state will be after 19a896a66d3f runs. Consider changing this to:

Suggested change

DEEP_RESEARCH_TOOL = {

"name": RESEARCH_AGENT_DB_NAME,

"name": "ResearchAgent",

"name": "research_agent",

However, be aware that modifying an already-run migration can cause issues. If this migration has already run in production, you may want to leave it as-is and let 19a896a66d3f handle the update. But if databases exist where this hasn't run yet, they'll get the wrong initial value.

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/alembic/versions/c1d2e3f4a5b6_add_deep_research_tool.py Line: 20:21 Comment: There's an inconsistency between this migration and the new `19a896a66d3f` migration: 1. This migration (potentially already run in production) inserts the tool with `name: "ResearchAgent"` 2. The new migration `19a896a66d3f` then changes it to `"research_agent"` 3. But this hardcoded value will persist if someone re-runs or reviews this migration While the hardcoding is fine (removes the import dependency), the value should match what the final state will be after `19a896a66d3f` runs. Consider changing this to: ```suggestion "name": "research_agent", ``` However, be aware that modifying an already-run migration can cause issues. If this migration has already run in production, you may want to leave it as-is and let `19a896a66d3f` handle the update. But if databases exist where this hasn't run yet, they'll get the wrong initial value. How can I resolve this? If you propose a fix, please make it concise.

No, we want to keep it consistent with what the name was at that point in time.

cubic-dev-ai

6 issues found across 80 files (changes from recent commits).

Note: This PR contains a large number of files. cubic only reviews up to 75 files per PR, so some files may not have been reviewed.

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="backend/onyx/db/release_notes.py">

<violation number="1">
P1: URL fragment should come after query parameters. The current construction `#{version_anchor}?{urlencode(utm_params)}` places the fragment before the query string, which means the UTM parameters will be treated as part of the fragment and never sent to the server. This breaks all UTM tracking/analytics.</violation>
</file>

<file name="backend/onyx/server/features/notifications/api.py">

<violation number="1">
P1: Docstring says "Get all undismissed notifications" but the code now uses `include_dismissed=True`, which returns dismissed notifications as well. Either the docstring needs to be updated to match the new behavior, or this change is incorrect.</violation>
</file>

<file name="web/src/layouts/actions-layouts.tsx">

<violation number="1">
P2: The `Section` component defaults to `alignItems="center"`, but the original `div` had `align-items: stretch` (CSS default). This may cause children without explicit widths to be centered instead of stretched. Consider adding `alignItems="stretch"` to preserve the original behavior.</violation>
</file>

<file name="backend/alembic/versions/8405ca81cc83_notifications_constraint.py">

<violation number="1">
P2: Comment/code mismatch: Comment mentions cleaning up 'reindex' notifications but the DELETE filters on `title = 'New Notification'`. Please update either the comment to match the actual deletion criteria, or fix the WHERE clause if 'reindex' notifications have a different title.</violation>
</file>

<file name="backend/onyx/server/query_and_chat/session_loading.py">

<violation number="1">
P0: `RESEARCH_AGENT_DB_NAME` is not imported or defined anywhere in the codebase. This will cause a `NameError` at runtime. Based on the existing import and usage on line 460, this should be `RESEARCH_AGENT_IN_CODE_ID`.</violation>
</file>

<file name="web/src/components/dateRangeSelectors/AdminDateRangeSelector.tsx">

<violation number="1">
P2: Restoring the `className="w-auto p-0"` on the new `Popover.Content` is necessary to match the previous calendar layout; otherwise the refresh Popover defaults add padding and change width, causing the calendar/preset list to be misaligned.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

backend/onyx/server/query_and_chat/session_loading.py

evan-onyx

how does get_built_in_tool_by_id work now? I'm surprised that tool calling with internal tools still works after the migration

evan-onyx · 2026-01-12T19:16:06Z

backend/alembic/versions/d25168c2beee_tool_name_consistency.py

+
+# Mapping of in_code_tool_id to the NAME constant from each tool class
+# These are the currently seeded tool names
+CURRENT_TOOL_NAME_MAPPING = {


the keys here look wrong, should be i.e. internal_search

This is a mapping in_code_id -> name for how it currently looks (for the downgrade).

Basically this is just how it currently looks now, so that we can downgrade appropriately

Danelegend · 2026-01-12T19:59:13Z

how does get_built_in_tool_by_id work now? I'm surprised that tool calling with internal tools still works after the migration

get_built_in_tool_by_id takes in in_code_tool_id. This hasn't been changed. We're only updating the tool.name value

Danelegend · 2026-01-12T19:59:56Z

@greptile

greptile-apps

_{1 file reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-01-12T20:05:49Z

backend/alembic/versions/d25168c2beee_tool_name_consistency.py

+    # Reverse the migration by setting name back to in_code_tool_id
+    # This matches the original pattern where name was the class name


The downgrade comment says "setting name back to in_code_tool_id" but the code actually sets it to current_name from CURRENT_TOOL_NAME_MAPPING, which happens to equal the in_code_tool_id. This is correct behavior, but the comment is slightly misleading.

Consider clarifying the comment to be more explicit:

Suggested change

# Reverse the migration by setting name back to in_code_tool_id

# This matches the original pattern where name was the class name

# Reverse the migration by setting name back to the class name (which matches in_code_tool_id)

# This restores the original pattern where tool.name was the same as the tool class name

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}

Prompt To Fix With AI

This is a comment left during a code review. Path: backend/alembic/versions/d25168c2beee_tool_name_consistency.py Line: 72:73 Comment: The downgrade comment says "setting name back to in_code_tool_id" but the code actually sets it to `current_name` from `CURRENT_TOOL_NAME_MAPPING`, which happens to equal the in_code_tool_id. This is correct behavior, but the comment is slightly misleading. Consider clarifying the comment to be more explicit: ```suggestion # Reverse the migration by setting name back to the class name (which matches in_code_tool_id) # This restores the original pattern where tool.name was the same as the tool class name ``` <sub>Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!</sub> How can I resolve this? If you propose a fix, please make it concise.

cubic-dev-ai

3 issues found across 3 files (changes from recent commits).

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="backend/tests/integration/tests/migrations/test_tool_seeding.py">

<violation number="1" location="backend/tests/integration/tests/migrations/test_tool_seeding.py:100">
P2: The `in_code_tool_id` field is defined in `ToolSeedingExpectedResult` and populated in `EXPECTED_TOOLS` but is never validated in `validate_tool`. Consider adding an assertion to verify `tool[4] == expected.in_code_tool_id`.</violation>
</file>

<file name="web/tests/e2e/utils/tools.ts">

<violation number="1" location="web/tests/e2e/utils/tools.ts:9">
P2: `searchOption` selector uses a misspelled tool name (`intenral_search`), so tests can no longer find the search tool option.</violation>
</file>

<file name="backend/tests/integration/tests/personas/test_unified_assistant.py">

<violation number="1" location="backend/tests/integration/tests/personas/test_unified_assistant.py:41">
P3: Align the assertion failure message with the tool name actually being validated so test failures mention the correct tool.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

backend/tests/integration/tests/migrations/test_tool_seeding.py

web/tests/e2e/utils/tools.ts

backend/tests/integration/tests/personas/test_unified_assistant.py

cubic-dev-ai

1 issue found across 16 files (changes from recent commits).

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="backend/onyx/tools/tool_implementations/web_search/web_search_tool.py">

<violation number="1">
P2: Dead code: This `if not all_search_results:` branch is unreachable because the same condition already raises a `ToolCallException` earlier in the method (around line 248). If `all_search_results` is empty, the function will have already thrown an exception before reaching this code.

Either remove this dead branch, or if the intent is to handle empty results gracefully without throwing, remove the earlier exception.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

Danelegend added 2 commits January 11, 2026 17:36

nit

edf9e68

Add comment

318a66a

Danelegend requested a review from a team as a code owner January 12, 2026 01:42

Danelegend commented Jan 12, 2026

View reviewed changes

cubic-dev-ai bot reviewed Jan 12, 2026

View reviewed changes

greptile-apps bot reviewed Jan 12, 2026

View reviewed changes

Danelegend and others added 7 commits January 11, 2026 17:55

Change this

c57ee34

Delete backend/alembic/versions/19a896a66d3f_tool_name_consistency.py

9b47267

remove try-except

08ac2f2

fix revision id

3571a8d

update revision

ca54a85

Merge branch 'main' into tool_name_migration

a688f4e

make up to date

75ac128

cubic-dev-ai bot reviewed Jan 12, 2026

View reviewed changes

backend/onyx/server/query_and_chat/session_loading.py Show resolved Hide resolved

.

2af27ba

evan-onyx reviewed Jan 12, 2026

View reviewed changes

greptile-apps bot reviewed Jan 12, 2026

View reviewed changes

evan-onyx approved these changes Jan 12, 2026

View reviewed changes

Danelegend added 2 commits January 12, 2026 13:09

change to list

6b7d560

nit

d72fd84

cubic-dev-ai bot reviewed Jan 12, 2026

View reviewed changes

backend/tests/integration/tests/migrations/test_tool_seeding.py Show resolved Hide resolved

web/tests/e2e/utils/tools.ts Outdated Show resolved Hide resolved

backend/tests/integration/tests/personas/test_unified_assistant.py Show resolved Hide resolved

Danelegend added 2 commits January 12, 2026 13:49

Merge branch 'main' into tool_name_migration

3725e34

fix test issues

2c881c8

cubic-dev-ai bot reviewed Jan 12, 2026

View reviewed changes

Danelegend and others added 2 commits January 12, 2026 15:51

Merge branch 'main' into tool_name_migration

e02c4c1

nit

e38f2c2

Danelegend added this pull request to the merge queue Jan 13, 2026

Merged via the queue into main with commit 58a943f Jan 13, 2026
74 checks passed

Danelegend deleted the tool_name_migration branch January 13, 2026 01:09

jessicasingh7 pushed a commit that referenced this pull request Jan 21, 2026

fix(tools): Tool name should align with what llm knows (#7352)

dbcce01

		# Reverse the migration by setting name back to in_code_tool_id
		# This matches the original pattern where name was the class name

Conversation

Danelegend commented Jan 12, 2026 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How Has This Been Tested?

Additional Options

Summary by cubic

Uh oh!

Danelegend Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Overview

Greptile Summary

What This PR Fixes

Verification

Minor Issue Found

Confidence Score: 4/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Danelegend Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

evan-onyx left a comment

Choose a reason for hiding this comment

Uh oh!

evan-onyx Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Danelegend Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Danelegend commented Jan 12, 2026

Uh oh!

Danelegend commented Jan 12, 2026

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Danelegend commented Jan 12, 2026 •

edited by cubic-dev-ai bot

Loading

greptile-apps bot commented Jan 12, 2026 •

edited

Loading