✨ Added continuation support for Amazon Bedrock #675

JonahSussman · 2025-02-20T06:44:16Z

Closes #136
Closes #350

I refactored model_provider.py, putting each ModelProvider into its own class. This will allow you to override certain model-specific functionality. For example, a custom LLM invoke for Bedrock (the main reason I refactored)

Bedrock will now continue to generate tokens even if max_tokens is reached. Look at the following Python script and attached logs:

from langchain.globals import set_debug, set_verbose

from kai.kai_config import KaiConfigModels, SupportedModelProviders
from kai.llm_interfacing.model_provider import ModelProvider

set_verbose(True)
set_debug(True)

m = ModelProvider.from_config(
    config=KaiConfigModels(
        provider=SupportedModelProviders.CHAT_BEDROCK,
        args={"model_id": "us.anthropic.claude-3-5-sonnet-20241022-v2:0"},
    )
)

print(m.invoke("Generate a long poem. 10,000 words. It must be very long.").content)

tester.log

Ugly langchain color codes aside, you can see how we do two requests now, resulting in one continuous poem at the end.

I tested locally with Amazon Bedrock and OpenAI, but I would like some assistance testing other providers to make sure I didn't mess anything else up.

Separately, we should probably think about how to integrate streaming into the project. It should be fairly straightforward now with the Chatter class. We can create a message with a separate messageToken and update it as we get more chunks. WDYT?

Signed-off-by: JonahSussman <[email protected]>

fabianvf

This seems reasonable. Does using a model that requires continuation break the caching?

fabianvf · 2025-02-20T13:22:58Z

kai/llm_interfacing/model_provider.py

+            case SupportedModelProviders.CHAT_DEEP_SEEK:
+                return ModelProviderChatDeepSeek(config, demo_mode, cache)
+            case _:
+                assert_never(config.provider)


This should be a typed exception rather than an assertion, unless it actually can't be reached. I'm assuming we can trigger this with bad configuration though right?

This is a new thing I recently found out about type checking in Python. If a type checker thinks that something can't happen, it will be assigned the type Never. This allows you to do exhaustiveness checking and get a static error if you don't handle every case.

We shouldn't ever have an issue with config because the Pydantic model makes sure it's one of the enum values.

JonahSussman · 2025-02-20T13:36:08Z

@fabianvf It shouldn't since we cache BaseMessages, and invike_llm returns one of those

Signed-off-by: JonahSussman <[email protected]>

jwmatthews · 2025-02-20T18:43:09Z

Looking great with claude via bedrock

Tested with

  AmazonBedrock: &active
    provider: "ChatBedrock"
    args:
      model_id: "us.anthropic.claude-3-5-sonnet-20241022-v2:0"

I confirmed that with an older binary against claude I was seeing a partial output stop with
Stateful EJBs can be converted to a CDI bean by replacing the @Stateful annotation with a bean-defining annotation that encompasses the appropriate scope (e.g., @ApplicationScoped). for ShoppingCartService.java

With this PR I see the expected full contents of the fix:

PR looks to be performing as we expected

$ grep "Continuing..." *
kai-rpc-server.log:INFO - 2025-02-20 13:37:06,527 - kai.kai.llm_interfacing.model_provider - Thread-1 - [model_provider.py:363 - invoke_llm()] - Message did not fit in max tokens. Continuing...
kai-rpc-server.log:INFO - 2025-02-20 13:37:52,221 - kai.kai.llm_interfacing.model_provider - Thread-1 - [model_provider.py:363 - invoke_llm()] - Message did not fit in max tokens. Continuing...
kai-rpc-server.log:INFO - 2025-02-20 13:44:17,735 - kai.kai.llm_interfacing.model_provider - Thread-1 - [model_provider.py:363 - invoke_llm()] - Message did not fit in max tokens. Continuing...

jwmatthews · 2025-02-20T18:56:42Z

I did hit. An error occurred (ValidationException) when calling the InvokeModel operation: messages: final assistant content cannot end with trailing whitespace

Logs: https://gist.githubusercontent.com/jwmatthews/43400716b53f0df30c558b2081f4d939/raw/560ea0805957260ef1db6e530487aa5a82bc3443/kai-rpc-server.log

Saw this error when processing: "Stateless EJBs can be converted to a CDI bean by replacing the @stateless annotation with a scope eg @ApplicationScoped" at Medium Effort

Signed-off-by: JonahSussman <[email protected]>

✨ Preliminary continuation support for Amazon Bedrock

fbc3408

Signed-off-by: JonahSussman <[email protected]>

JonahSussman force-pushed the feature/continuation branch from 313ae17 to fbc3408 Compare February 20, 2025 06:45

JonahSussman changed the title ~~✨ Preliminary continuation support for Amazon Bedrock~~ ✨ Added continuation support for Amazon Bedrock Feb 20, 2025

JonahSussman requested review from jwmatthews, shawn-hurley and fabianvf February 20, 2025 07:01

fabianvf reviewed Feb 20, 2025

View reviewed changes

jwmatthews mentioned this pull request Feb 20, 2025

We are seeing different # of issues and incidents as we run analysis on coolstore 'main' branch #631

Closed

Added back configurable fields

ba960ea

Signed-off-by: JonahSussman <[email protected]>

JonahSussman added 2 commits February 20, 2025 13:56

Removed surrounding whitespace from AIMessage

65ed0ea

Signed-off-by: JonahSussman <[email protected]>

Fixed challenge

2062e5f

Signed-off-by: JonahSussman <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ Added continuation support for Amazon Bedrock #675

✨ Added continuation support for Amazon Bedrock #675

JonahSussman commented Feb 20, 2025 •

edited

Loading

fabianvf left a comment

fabianvf Feb 20, 2025

JonahSussman Feb 20, 2025

JonahSussman commented Feb 20, 2025

jwmatthews commented Feb 20, 2025 •

edited

Loading

jwmatthews commented Feb 20, 2025

✨ Added continuation support for Amazon Bedrock #675

Are you sure you want to change the base?

✨ Added continuation support for Amazon Bedrock #675

Conversation

JonahSussman commented Feb 20, 2025 • edited Loading

fabianvf left a comment

Choose a reason for hiding this comment

fabianvf Feb 20, 2025

Choose a reason for hiding this comment

JonahSussman Feb 20, 2025

Choose a reason for hiding this comment

JonahSussman commented Feb 20, 2025

jwmatthews commented Feb 20, 2025 • edited Loading

jwmatthews commented Feb 20, 2025

JonahSussman commented Feb 20, 2025 •

edited

Loading

jwmatthews commented Feb 20, 2025 •

edited

Loading