Strands + LangSmith

A Strands-native agent example with LangSmith integration: OTEL tracing, PromptHub, datasets, evaluators, and experiments.

Quickstart

cp .env.example .env   # fill in your keys
uv sync
uv run jupyter notebook demo.ipynb

What the notebook does

The notebook (demo.ipynb) walks through the full lifecycle step by step:

Setup — Load env vars and wire up OTEL tracing so Strands spans export to LangSmith.
Push prompt to Hub — Push the agent's system prompt to LangSmith PromptHub (versioned).
Create agent — Pull the system prompt back from Hub and create a Strands agent with three tools: lookup_knowledge_base, calculator, and web_search.
Create dataset — Upload a QA evaluation dataset to LangSmith (6 examples covering knowledge lookups, math, and multi-tool questions).
Create evaluator — Attach an LLM-judge evaluator (correctness) to the dataset with an inline prompt.
Run experiment — Run the agent against every dataset example via client.evaluate(), scored by a PII-detection code evaluator.
Demo — Run a single multi-tool agent call and inspect the trace in LangSmith.
Cleanup — Optionally delete all created resources.

The agent is defined in agent.py and reused across experiments and the demo. The system prompt is pulled from PromptHub at runtime, so you can update it in Hub without changing code.

Project structure

agent.py                 # Agent definition: tools + create_agent() + ask()
demo.ipynb               # Walkthrough notebook (run cells top-to-bottom)
langsmith_exporter.py    # OTEL exporter: transforms Strands spans for LangSmith
setup/
  config.py              # LangSmith client, constants, create_model()
  prompts.py             # Push/pull agent prompt to/from Hub
  datasets.py            # Create evaluation dataset
  evaluators.py          # Create LLM-judge evaluator on dataset
  cleanup.py             # Delete all created LangSmith resources

Switching models

All model configuration lives in setup/config.py in the create_model() function. Comment/uncomment the provider you want:

OpenAI (default)

def create_model():
    from strands.models.openai import OpenAIModel
    return OpenAIModel(
        model_id="gpt-4o",
        client_args={"api_key": os.getenv("OPENAI_API_KEY")},
    )

Requires OPENAI_API_KEY in your .env.

Amazon Bedrock

def create_model():
    from strands.models import BedrockModel
    return BedrockModel(
        model_id="us.anthropic.claude-sonnet-4-5-20250929-v1:0",
    )

Uses the default boto3 credential chain (env vars, ~/.aws/credentials, IAM role, etc).

Bedrock credentials in .env — two options:

# Option 1: Direct credentials (IAM user or temporary)
AWS_ACCESS_KEY_ID=AKIA...
AWS_SECRET_ACCESS_KEY=...
AWS_SESSION_TOKEN=...          # required if using temporary/SSO credentials

# Option 2: Bearer token
AWS_BEARER_TOKEN_BEDROCK=...

To get these:

IAM user keys: AWS Console → IAM → Users → Security credentials → Create access key
Temporary credentials (SSO/STS): Run aws sts get-session-token or aws sso login — this gives all three values including AWS_SESSION_TOKEN
Bearer token: Less common; see Bedrock API key docs

You must also enable model access in the Bedrock console for the models you want to use. The us. prefix on model IDs is a cross-region inference profile — make sure access is enabled in at least one US region.

Playground & evaluators: The notebook includes a cell (step 0.5) to upload these credentials as LangSmith workspace secrets, which the Playground and online evaluators need to call Bedrock server-side. If you're using temporary credentials, all three values (AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, AWS_SESSION_TOKEN) must be uploaded — missing the session token will cause "invalid security token" errors.

Anthropic (direct API)

def create_model():
    from strands.models.anthropic import AnthropicModel
    return AnthropicModel(
        model_id="claude-sonnet-4-20250514",
        client_args={"api_key": os.getenv("ANTHROPIC_API_KEY")},
    )

Requires pip install 'strands-agents[anthropic]' and ANTHROPIC_API_KEY in your .env.

Ollama (local)

def create_model():
    from strands.models.ollama import OllamaModel
    return OllamaModel(model_id="llama3.1:8b")

Requires a running Ollama server (ollama serve).

Environment variables

See .env.example for the full list. At minimum you need:

Variable	Required for
`LANGSMITH_API_KEY`	All LangSmith features (tracing, Hub, datasets, evaluators)
`OPENAI_API_KEY`	OpenAI model provider (if using OpenAI)
`AWS_ACCESS_KEY_ID`	Bedrock model provider + evaluator
`AWS_SECRET_ACCESS_KEY`	Bedrock model provider + evaluator
`AWS_SESSION_TOKEN`	Bedrock (required if using temporary/SSO credentials)
`AWS_DEFAULT_REGION`	Bedrock (defaults to `us-east-1`)

Cleanup

To delete all LangSmith resources created by the demo:

uv run python -m setup.cleanup

Or uncomment the cleanup cell at the bottom of the notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
setup		setup
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
demo.ipynb		demo.ipynb
langsmith_exporter.py		langsmith_exporter.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Strands + LangSmith

Quickstart

What the notebook does

Project structure

Switching models

OpenAI (default)

Amazon Bedrock

Anthropic (direct API)

Ollama (local)

Environment variables

Cleanup

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Strands + LangSmith

Quickstart

What the notebook does

Project structure

Switching models

OpenAI (default)

Amazon Bedrock

Anthropic (direct API)

Ollama (local)

Environment variables

Cleanup

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages