Skip to content

Latest commit

 

History

History
38 lines (29 loc) · 1.08 KB

File metadata and controls

38 lines (29 loc) · 1.08 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[0.1.1] - 2026-02-13

Changed

  • Version bump for TestPyPI upload

[0.1.0] - 2026-01-10

Added

  • Initial release
  • AWS vLLM on Neuron model provider for Strands Agents SDK
  • Support for OpenAI-compatible API endpoints
  • Full streaming support with async generators
  • Tool/function calling support
  • Structured output support via tool calls
  • Comprehensive unit and integration tests
  • Example implementations:
    • Weather agent example
    • MCP (Model Context Protocol) integration example
    • Streaming examples
  • Configuration options for:
    • Model selection
    • Temperature, top_p, max_tokens
    • Stop sequences
    • Tensor parallel configuration
    • Neuron-specific settings
  • Pyproject.toml with modern Python packaging
  • Formatting and linting configuration (Ruff, Black, mypy)
  • Docker infrastructure for vLLM Neuron server deployment