Truncate long chat sessions #190986

OaenHed · 2026-03-28T19:26:58Z

OaenHed
Mar 28, 2026

🏷️ Discussion Type

Bug

💬 Feature/Topic Area

Visual Studio

Body

After any meaningfully long session with a chat host, there will always come a time when an exception is generated due to too many tokens, forcing users to start a new session ... and lose the built up context. This behaviour makes little sense, since chat hosts don't (can't?) actively look at the beginning of the session anyways. This isn't a guess; I've tested this multiple times. Anything written at the start of a long-ish session cannot be directly retrieved even if explicit instructions are given that this is going to happen.

Would it be possible to simply truncate the top x% of the tokens when the limit is reached, to allow the session to continue?

2026-03-28T19:27:35Z

github-actions[bot]
bot Mar 28, 2026

💬 Your Product Feedback Has Been Submitted 🎉

Thank you for taking the time to share your insights with us! Your feedback is invaluable as we build a better GitHub experience for all our users.

Here's what you can expect moving forward ⏩

Your input will be carefully reviewed and cataloged by members of our product teams.
- Due to the high volume of submissions, we may not always be able to provide individual responses.
- Rest assured, your feedback will help chart our course for product improvements.
Other users may engage with your post, sharing their own perspectives or experiences.
GitHub staff may reach out for further clarification or insight.
- We may 'Answer' your discussion if there is a current solution, workaround, or roadmap/changelog post related to the feedback.

Where to look to see what's shipping 👀

Read the Changelog for real-time updates on the latest GitHub features, enhancements, and calls for feedback.
Explore our Product Roadmap, which details upcoming major releases and initiatives.

What you can do in the meantime 💻

Upvote and comment on other user feedback Discussions that resonate with you.
Add more information at any point! Useful details include: use cases, relevant labels, desired outcomes, and any accompanying screenshots.

As a member of the GitHub community, your participation is essential. While we can't promise that every suggestion will be implemented, we want to emphasize that your feedback is instrumental in guiding our decisions and priorities.

Thank you once again for your contribution to making GitHub even better! We're grateful for your ongoing support and collaboration in shaping the future of our platform. ⭐

0 replies

asaddevx · 2026-03-29T05:16:46Z

asaddevx
Mar 29, 2026

Hi @OaenHed, this is a known limitation with LLM-based chat systems, and I understand how frustrating it is to lose context mid-session.

Why This Happens

Copilot Chat (and similar AI assistants) use a context window — a limited number of tokens (roughly 8k-128k depending on the model) that the AI can "see" at once. Once you exceed this limit, the system has to drop older messages to make room for new ones.

Currently, when the limit is reached, the session throws an error rather than automatically truncating. You're correct that chat hosts don't actively use the earliest messages after a certain point — but the system still keeps them in the context buffer until it overflows.

Current Workarounds

While the product doesn't yet support automatic truncation, here are some ways to mitigate this:

1. Manual Truncation

Periodically start a new session and summarize the key context from your previous session
Example prompt: "Here's what we've established so far: [summary]. Let's continue from here."

2. Use @workspace References

Instead of letting context build up naturally, explicitly reference relevant files using @workspace or file paths. This keeps the context focused on what matters.

3. Break Complex Tasks into Steps

Complete one logical task per session
Document decisions as you go (in a markdown file or comments)
Reference that documentation in new sessions

How to Get This Feature Added

Since this is a product feature request, the best way to get it implemented is to:

File an issue in the vscode-copilot-release repository
Tag it as a feature request and explain the user impact
Reference this discussion to show others are experiencing the same pain point

Product teams prioritize features based on user demand, so the more visibility this gets, the more likely it is to be addressed.

Hope this helps explain the situation and gives you some useful workarounds in the meantime. Let me know if you'd like help crafting a feature request issue!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Community

Truncate long chat sessions #190986

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

GitHub Community

Truncate long chat sessions #190986

Uh oh!

OaenHed Mar 28, 2026

🏷️ Discussion Type

💬 Feature/Topic Area

Body

Replies: 2 comments

Uh oh!

github-actions[bot] bot Mar 28, 2026

Uh oh!

asaddevx Mar 29, 2026

Why This Happens

Current Workarounds

1. Manual Truncation

2. Use @workspace References

3. Break Complex Tasks into Steps

How to Get This Feature Added

OaenHed
Mar 28, 2026

github-actions[bot]
bot Mar 28, 2026

asaddevx
Mar 29, 2026