Skip to content

longtermrisk/research-assistant

Repository files navigation

Research-Assistant

This monorepo contains an implementation of LLM agents that help with research. It consists of the following parts:

  • automator implements the agent as a python sdk built on top of MCP clients
  • a number of MCP servers implement tools that can be used by our agent or in any other MCP client:
    • terminal-mcp implements tools for an interactive terminal capable of running background tasks, a jupyter notebook tool, and tools to interact with a local codebase
    • talk-to-model contains a send_message tool for agents that help evaluate other LLMs
    • web-mcp contains a google search and a markdown broswer tool, but it's currently not working so well: often, websites are turned into markdown documents that exceed the token limits
  • squiggpy contains a squiggle-like python library and a prompt for agents to use the library

Setup

Run python install.py to setup everything up

Testing

The automator component includes pytest-based tests to verify core functionality:

Running Tests

# Navigate to automator directory
cd automator

# Install with dev dependencies
pip install -e .[dev]

# Run all tests
pytest

# Run specific test files
pytest tests/test_quickstart.py
pytest tests/test_rag_hook.py

# Run tests with verbose output
pytest -v

# Run tests excluding RAG functionality (if dependencies not available)
pytest -m "not rag"

Test Coverage

  • test_quickstart.py: Tests basic agent creation and terminal tool functionality (based on examples/quickstart.py)
  • test_rag_hook.py: Tests RAG (Retrieval-Augmented Generation) hook functionality (based on examples/rag/rag_hook_example.py)
  • test_system.py: Tests core system functionality including imports, RAG operations, and workspace setup

Test Requirements

  • Basic tests require only core dependencies
  • RAG tests require the optional [rag] dependencies: pip install -e .[rag]
  • All tests use temporary directories to avoid affecting your actual workspace

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published