Skip to content

Conversation

@ozanarmagan
Copy link
Contributor

Change Summary

Update outdated information about the context window for RAG

PR Checklist

Updated context window limits explanation for clarity and accuracy.
:::tip Context Window Limits
Although we retain the entire conversation history in Typesense, only the most recent 3000 tokens (approximately 1200 characters) of the conversation history will be sent for generating the standalone question due to the context limit.
:::tip Context Window Limits
Although the entire conversation history is stored in Typesense, only the most recent messages that fit within the configured ```max_bytes``` window are sent when generating the standalone question. Before selecting messages, we subtracts the number of bytes required by the system instructions; the remaining byte budget determines how many recent messages can be included.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

typo: "Before selecting messages, we subtracts"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants