Use Converse API for Bedrock provider #377

richet · 2025-08-29T19:42:40Z

What this does

Removes the use of the Invoke endpoint
Opens up the use of most of the Bedrock models instead of just having Anthropic.
Allows for document uploads on endpoints that dont support it via the Invoke API. e.g. claude-3-haiku

Type of change

Scope check

I read the Contributing Guide
This aligns with RubyLLM's focus on LLM communication
This isn't application-specific logic that belongs in user code
This benefits most users, not just my specific use case

Quality check

I ran overcommit --install and all hooks pass
I tested my changes thoroughly
I updated documentation if needed
I didn't modify auto-generated files manually (models.json, aliases.json)

API changes

Breaking change
New public methods/classes
Changed method signatures
No API changes

Related issues

crmne

Adding Converse API is a great idea but this PR has significant issues:

No tests!
Code complexity
It doesn't follow the lead of the other providers in terms of method names, modules, etc.
Overcommit was not installed

lib/ruby_llm/providers/bedrock.rb

lib/ruby_llm/providers/bedrock/chat.rb

lib/ruby_llm/providers/bedrock/streaming/message_processing.rb

lib/ruby_llm/providers/bedrock/streaming/tool_call_handling.rb

richet · 2025-09-12T06:38:10Z

@crmne Several changes made per your comments. Well tested and ready for review.

codecov · 2025-09-14T09:04:29Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.57%. Comparing base (4ff2231) to head (2d460e2).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #377   +/-   ##
=======================================
  Coverage   84.57%   84.57%           
=======================================
  Files          37       37           
  Lines        1932     1932           
  Branches      499      499           
=======================================
  Hits         1634     1634           
  Misses        298      298

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

crmne

This is a big patch @richet so thank you for the effort but there are still significant changes needed:

I still see that the organization of the code, especially in chat.rb, doesn't respect the organization of the other providers. There are a ton of methods there that belong in other modules in a provider implementation. Check the OpenAI provider for an example of what belongs where. I'll resume reviewing it when that's done.

Thank you again for the monumental effort.

lib/ruby_llm/providers/bedrock/chat.rb

lib/ruby_llm/providers/bedrock/streaming.rb

richet · 2025-09-16T04:28:59Z

@crmne Thanks for the review and pushing me to clean this up a bit more. I renamed and cleaned up a lot of the methods to a point where I think its close to what you have in OpenAI.

richet · 2025-10-08T01:36:56Z

@crmne The VCR cassettes seem to conflict when your main branch is updated so I have fixed them again and hopefully this one could be reviewed again soon 🤞

michaeldiscala · 2025-11-17T22:28:12Z

Hi @richet and @crmne 👋 we're very interested in using Bedrock with RubyLLM and would love to help move this forward if we can. Are there any ways we could pitch in? thank you!

tpaulshippy · 2025-11-18T05:18:18Z

spec/ruby_llm/chat_tools_spec.rb

 RSpec.describe RubyLLM::Chat do
  include_context 'with configured RubyLLM'

+  # Helper to mitigate Bedrock rate limits in CI by retrying with backoff


We only need this and the test delays when recording VCRs right? Would be nice to isolate it somehow. Bedrock is a bit of a pain isn't it?

Yes thats correct. This PR also keeps getting merge conflicts due to the cassettes also.

tpaulshippy · 2025-11-18T05:19:30Z

Hi @richet and @crmne 👋 we're very interested in using Bedrock with RubyLLM and would love to help move this forward if we can. Are there any ways we could pitch in? thank you!

Which models in bedrock are you looking to use? Just curious.

richet · 2025-11-18T18:37:35Z

Which models in bedrock are you looking to use? Just curious.

I've been running this fork in prod for about a month now using Haiku 3.5 and Sonnet 4 in the AWS APAC region.

tpaulshippy · 2025-11-18T19:09:49Z

Which models in bedrock are you looking to use? Just curious.

I've been running this fork in prod for about a month now using Haiku 3.5 and Sonnet 4 in the AWS APAC region.

I use those models on Bedrock with RubyLLM without these changes. Isn't this for other non-Anthropic models?

richet · 2025-11-18T19:25:51Z

I havent looked at the source for the last month but unless something has changed it is currently pinned to the US region models. In my case we have to use APAC. We also found that our rate limits using the Converse endpoint are larger than Invoke which are the main changes this PR makes. The ability to use all of the other models due to the Converse endpoint usage is a bonus.

tpaulshippy · 2025-11-18T19:28:52Z

FYI and for others, #338 did get merged.

bensheldon · 2025-11-18T19:38:30Z

I'm a longtime lurker in this thread 😁

Which models in bedrock are you looking to use? Just curious.

I'd like to use the Amazon Nova models. I'm already using them directly via AWS SDk / Converse and they're good enough my uses (summarization, translation, vision model) for how inexpensive they are.

tpaulshippy · 2025-11-18T19:47:27Z

Awesome. Considering bringing this into my fork soon and just trying to gauge interest.

michaeldiscala · 2025-11-19T04:22:20Z

Which models in bedrock are you looking to use? Just curious

We are also primarily looking to use the Anthropic models, so we can also access them via the current invoke implementation.

As context for my original question - we’d like to build on top of Ruby LLM, and there are a couple of features we were hoping to contribute back. Before we start that work, we want to make sure we’re targeting the right baseline. Since this PR is a significant change for the Bedrock provider, it'd be ideal if this was merged in first.

After researching a bit more though, I'm realizing the features that are most pressing for us may actually be accessible via the current invoke_model implementation (guard rails and cross region inference profiles) so our needs may not actually push toward the converse API.

We would still love to help move the bedrock provider forward though -- regardless of which API endpoint makes the most sense for RubyLLM's broader goals -- and will keep following along. Thanks!

richet added 6 commits August 30, 2025 07:39

wip

ebb8813

adding tool calls

95937b9

Track token usage

01e4cf1

exclude ruby version

db5a606

removed excess debugging

076677e

Merge branch 'crmne:main' into bedrock-anthropic-converse

aa8ef21

crmne requested changes Sep 3, 2025

View reviewed changes

richet marked this pull request as draft September 3, 2025 18:57

richet added 7 commits September 4, 2025 17:04

style clean up

b21eafe

style clean up

1d3a693

updated vcr cassettes

caf44be

updated branch

d789c5b

Clean up

c4f00c7

Overcommit passed..

929d841

Improved bedrock provider test coverage

d7b1709

richet marked this pull request as ready for review September 12, 2025 06:35

richet changed the title ~~WIP - Use Converse API for Bedrock provider~~ Use Converse API for Bedrock provider Sep 12, 2025

Merge branch 'main' into bedrock-anthropic-converse

2d460e2

crmne requested changes Sep 14, 2025

View reviewed changes

lib/ruby_llm/providers/bedrock/chat.rb Outdated Show resolved Hide resolved

lib/ruby_llm/providers/bedrock/chat.rb Outdated Show resolved Hide resolved

lib/ruby_llm/providers/bedrock/streaming.rb Outdated Show resolved Hide resolved

richet added 7 commits September 15, 2025 13:07

Removed unused methods

ba373c4

Fixed tool call specs

4015c81

Removed require_relative for bedrock streaming

8126ecb

Updated cassettes

1e3c65e

Merged main

6b95c9c

Style clean up

92ebfcd

Removed old bedrock invoke methods

11d2883

Merge branch 'main' into bedrock-anthropic-converse

bd3161d

richet added 2 commits October 8, 2025 14:20

Merged main

e833b32

Merged main

135a66d

tpaulshippy reviewed Nov 18, 2025

View reviewed changes

Uh oh!

Use Converse API for Bedrock provider #377

Are you sure you want to change the base?

Use Converse API for Bedrock provider #377

Conversation

richet commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this does

Type of change

Scope check

Quality check

API changes

Related issues

Uh oh!

crmne left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

richet commented Sep 12, 2025

Uh oh!

codecov bot commented Sep 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

crmne left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

richet commented Sep 16, 2025

Uh oh!

richet commented Oct 8, 2025

Uh oh!

michaeldiscala commented Nov 17, 2025

Uh oh!

tpaulshippy Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

richet Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

tpaulshippy commented Nov 18, 2025

Uh oh!

richet commented Nov 18, 2025

Uh oh!

tpaulshippy commented Nov 18, 2025

Uh oh!

richet commented Nov 18, 2025

Uh oh!

tpaulshippy commented Nov 18, 2025

Uh oh!

bensheldon commented Nov 18, 2025

Uh oh!

tpaulshippy commented Nov 18, 2025

Uh oh!

michaeldiscala commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

richet commented Aug 29, 2025 •

edited

Loading

codecov bot commented Sep 14, 2025 •

edited

Loading