sourcecatcher.com

A reverse image search tool for InSomnia

See the Reddit release thread for more information about Sourcecatcher

Setup

Sourcecatcher is published as an OCI container.

Directory structure

$ tree
.
├── config
│   ├── nitter
│   │   └── sessions.jsonl
│   └── sourcecatcher
│       ├── config-discord.toml
│       └── config.yaml
└── live
    ├── discord.db
    ├── phash_index.ann
    └── twitter_scraper.db

See Config files section for configuration file setup. The live directory contains Sourcecatcher's databases, it should be persisted to a host directory (See next section).

Quadlet setup

Create quadlet generator file. Remember to configure container network and volume mounts setup.

$ cat ~/.config/containers/systemd/sourcecatcher.container 
[Unit]
Description=Sourcecatcher reverse image search service
After=network.target

[Container]
ContainerName=sourcecatcher
Image=ghcr.io/evanc577/sourcecatcher:latest
AutoUpdate=registry
Network=bridge
PublishPort=9000:80
Volume=/home/sourcecatcher/config/sourcecatcher/:/sourcecatcher/config/:Z,ro
Volume=/home/sourcecatcher/config/nitter/sessions.jsonl:/nitter/sessions.jsonl:Z,ro
Volume=/home/sourcecatcher/live/:/sourcecatcher/live/:Z

[Install]
WantedBy=multi-user.target default.target

Start the container

$ systemctl --user daemon-reload
$ systemctl --user start sourcecatcher.service

Config files

`config.yaml`

config.yaml contains runtime information needed by Sourcecatcher.

# Don't need to change for OCI container
media_dir: "/sourcecatcher/images/"
nitter_instance: "http://0.0.0.0:8080"

# Image hashing options
cpus: 4
recalculate_kmeans: False

# Set to true to enable scraping discord server channels for Twitter links
scrape_discord: true

# These users will show up first in search results
priority_users:
  - "hf_dreamcatcher"
  - "jp_dreamcatcher"
  - "7_DREAMERS"
  - "2Moori"

# Set of users to scrape via Nitter
users:
  - "hf_dreamcatcher"
  - "7_DREAMERS"
  - "2Moori"

`config-discord.toml`

database_file = "working/discord.db"
discord_token = "your-discord-api-token"

# List of Discord channel IDs to scape
watched_channels = [
    "253293425460248580",
    "253293450030481418",
]

`sessions.jsonl`

Twitter user accounts used for running a local nitter instance. See upstream Nitter documentation for how to generate this file.

Name		Name	Last commit message	Last commit date
Latest commit History 193 Commits
.github/workflows		.github/workflows
nitter		nitter
scripts		scripts
src		src
systemd		systemd
.containerignore		.containerignore
.gitignore		.gitignore
Containerfile		Containerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sourcecatcher.com

Setup

Directory structure

Quadlet setup

Config files

`config.yaml`

`config-discord.toml`

`sessions.jsonl`

About

Releases

Packages

Languages

License

evanc577/sourcecatcher

Folders and files

Latest commit

History

Repository files navigation

sourcecatcher.com

Setup

Directory structure

Quadlet setup

Config files

config.yaml

config-discord.toml

sessions.jsonl

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`config.yaml`

`config-discord.toml`

`sessions.jsonl`

Packages