SnapAnsAI Bot is a tool for instant answers based on screenshots. The user presses a hotkey, and the screenshot is automatically sent to a Telegram bot, which analyzes the image using OpenAI API and returns a text response.
✅ Hotkeys — automatic screenshot capture (multiple combinations can be set in config.json
).
✅ Automatic image sending to the Telegram bot.
✅ AI Selection — support for OpenAI for image processing.
✅ Flexible settings in config.json
(change API, hotkeys, OS, etc.).
✅ Cross-platform: Windows, Linux, macOS (auto-detection of OS).
✅ Logging in bot.log
for easy debugging (all errors and events are recorded in the log).
✅ Multilingual support — the bot supports language switching via config.json
and adding new languages in lang.json
.
📚 Education: Screenshots of tasks and instant explanations.
🌝 Work: Capturing important on-screen data and quick analysis.
🧫 Research: Extracting text from charts and documents.
💬 Communication: Processing text and images in real time.
🏃 Daily use: Quickly obtaining text from any screen.
🔍 Proctoring: ?
🐍 Python — main programming language.
🤖 Telegram Bot API — interaction with the bot.
🧠 OpenAI API — image and text processing.
🖥️ PyAutoGUI — screenshot capture (Windows).
🐧 gnome-screenshot — screenshot capture (Ubuntu).
🍏 screencapture — screenshot capture (macOS).
⌨️ pynput — hotkey handler.
🔗 Requests — interaction with APIs.
📃 Logging — logging system.
flameshot
— a screenshot tool
SnapAnsAI Bot consists of several components:
📌 System Components:
1️⃣ Telegram Bot — receives AI-processed messages from the bot.
2️⃣ AI Processing Module — sends images to OpenAI or DeepSeek and retrieves a response.
3️⃣ Screenshot Capture Module — operates via PyAutoGUI, gnome-screenshot
, flameshot
, or screencapture
.
4️⃣ Logging Module — records all events and errors in bot.log
.
5️⃣ Configuration Module — loads and manages config.json
.
snapansai-bot/
│── config/ # Configuration files
│ ├── config.json # Configuration file
│ ├── lang.json # Translation file for multilingual support
│── logs/ # Logs
│ │── session_temp/ # Temporary files (screenshots and cache)
│ ├── bot.log # Bot log file
│── methods/ # Screenshot processing modules
│ ├── flameshot.py # Screenshot handler via flameshot
│── src/ # Project source code
│ ├── ai_api.py # AI API handler
│ ├── bot.py # Main Telegram bot
│ ├── config_handler.py # Configuration management (loading, saving)
│ ├── screenshot_sender.py # Screenshot sending script
│── test/ # AI testing scripts
│ ├── test_ai.py # Project launch script
│── venv/ # Virtual environment
│── README.md # Project documentation
│── requirements.txt # Dependencies file
│── start.sh # Project launch script
🔹 System Startup:
- The bot starts with
start.sh
. - Initializes the Telegram Bot API token.
- Initializes the Chat ID.
- Initializes the AI API Key.
- Detects the user's OS (Windows/Linux/macOS).
- Loads configuration from
config.json
. - Launches the hotkey handler.
- Waits for a hotkey press (
alt+s
,ctrl+m
, etc.).
🔹 Screenshot Processing:
- When a hotkey is pressed, a screenshot is taken and saved in the temporary folder
logs/session_temp/
. - The image is sent to the AI server for analysis.
- The received text response is sent to the Telegram bot.
- The screenshot is deleted from the temporary folder.
🔹 Using flameshot
:
- If
flameshot
is used, the save path can be set in the configuration. - To disable pop-up notifications about saving, run:
flameshot config
and uncheck Show desktop notifications
in the General tab.
bash start.sh
During installation, you will need to provide the API key for the Telegram bot and AI. After installation, you can press the hotkeys, and screenshots will be processed by AI and sent to the Telegram bot.
In config.json
, you can edit:
"bot_token": ""
- Telegram Bot API token"chat_id": 0
- Telegram Chat ID (auto-detection)"language": ""
- Language (available: en, ru, de)"api_key": ""
- AI API key (available: OpenAI, DeepSeek)"model": ""
- AI model (OpenAI: gpt-4o-mini, gpt-4o, o3-mini, gpt-4.5-preview, etc.)"hotkey": ""
- Hotkeys (you can specify one or more, e.g.,"hotkey": "alt+s, ctrl+m, p, /"
)"os": ""
- OS (auto-detection)"method":
- Default is "default" — built-in OS screenshot managers are used. Available:"flameshot"
.
In lang.json
, you can edit:
"prompt": ""
- Prompt- Add other languages
✅ Improved hotkey processing
✅ Integration with OpenAI
✅ Tested on Ubuntu 24.04 (Wayland)
✅ Flexible configuration via config.json
✅ Added flameshot
as an alternative method
🔜 Testing on Windows, MacOS
🔜 Adding Docker
🔜 Integration with other AI models (DeepSeek, Claude, Gemini, etc.)
- USDT (TRC20)/USDC (TRC20):
TUQj3sguQjmKFJEMotyb3kERVgnfvhzG7o
- SOL (Solana):
6VA9oJbkszteTZJbH6mmLioKTSq4r4E3N1bsoPaxQgr4
- XRP (XRP):
rDkEZehHFqSjiGdBHsseR64fCcRXuJbgfr
✨ Designed for convenience and quick access to answers.