Gate.io P2P Scraper

Scrapes Gate.io P2P buy listings with Puppeteer and saves merchant data to JSON/CSV. Ships with auto-scroll, robust logging, environment-based config, and an optional filtered view around a target merchant.

Features

Headless scraping with Puppeteer (uses bundled Chromium by default)
Configurable target URL (USDT-KES by default)
Auto-scroll to load dynamic content
One-off or continuous runs (configurable interval)
Outputs in JSON and CSV
Logs for activity, warnings, and errors
Optional filtered output for the merchant adjacent to a target (default: "coinftw")

Requirements

Node.js 18+ and npm
Internet access
Optional: Google Chrome/Chromium (if you prefer a system browser instead of bundled Chromium)

Install

npm install

Configuration

Copy .env.example to .env and adjust as needed:

TARGET_URL=https://www.gate.io/p2p/buy/USDT-KES
SCRAPE_INTERVAL_MS=60000
TARGET_MERCHANT=coinftw
# CHROME_EXECUTABLE=/usr/bin/google-chrome-stable

URL: change TARGET_URL to the P2P page you want.
Browser binary: set CHROME_EXECUTABLE to use a system Chrome/Chromium; otherwise the bundled Chromium is used.

export CHROME_EXECUTABLE=/usr/bin/google-chrome-stable

Note: The filtered output looks for merchant name coinftw by default (see scraper/filterMerchant.js).

Usage

Run a single scrape:

npm run scrape:once

Run continuously (default interval from .env):

npm start

Docker

Build the image:

docker build -t gateio-p2p-scraper:latest .

Run with your .env and persist outputs to the host:

docker run --rm \
	--env-file .env \
	-v "$(pwd)/data:/app/data" \
	-v "$(pwd)/logs:/app/logs" \
	gateio-p2p-scraper:latest

Or use docker-compose:

docker compose up --build

Notes:

Inside Docker, leave CHROME_EXECUTABLE unset so Puppeteer uses its bundled Chromium.
Data will appear in ./data and logs in ./logs on your host.

Outputs

data/gateio_p2p_merchants.json
data/gateio_p2p_merchants.csv
data/filtered_merchants.json (adjacent to the target merchant)
data/filtered_merchants.csv (adjacent to the target merchant)

Logs

logs/activity.log — high-level steps
logs/errors.log — errors and stack traces
logs/warnings.log — non-fatal warnings

Troubleshooting

Browser not found at executablePath
- Clear CHROME_EXECUTABLE or point it to a valid path; by default the bundled Chromium is used.
Timeouts waiting for selectors or no data extracted
- The site may have changed. Update selectors in scraper/extract.js and the waitForSelector in scraper/scraper.js.
Permission errors writing files
- Ensure the process can write to data/ and logs/ folders.

Development

Lint:

npm run lint

Format:

npm run format

Tests (none added yet):

npm test

Project structure

src/
- index.js — CLI/entrypoint for the enhanced app lifecycle
- app.js — main application class orchestrating services
- config/ — environment configs and validation schemas
scraper/ — scraping pipeline, selectors, database, metrics, logging, shutdown, etc.
scripts/healthcheck.js — fast readiness/self-check used locally and in Docker HEALTHCHECK
data/, logs/ — runtime outputs (mounted in Docker)
Dockerfile, .dockerignore, docker-compose.yml — containerization
.editorconfig, .prettierrc, .nvmrc — editor/formatting/runtime defaults

Notes on maintainability and robustness

Extraction selectors are centralized in scraper/extract.js and include fallbacks; if the site changes, update them there.
Config and intervals are environment-driven via .env to avoid code changes.
CSV writing escapes values and ensures directories exist.
Graceful shutdown on SIGINT/SIGTERM to stop intervals cleanly.

License

ISC

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
.github/workflows		.github/workflows
docs		docs
old		old
scraper		scraper
scripts		scripts
src		src
test-results		test-results
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.env.example		.env.example
.eslintignore		.eslintignore
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.npmignore		.npmignore
.nvmrc		.nvmrc
.prettierrc		.prettierrc
Dockerfile		Dockerfile
ENHANCED-STEALTH-FEATURES.md		ENHANCED-STEALTH-FEATURES.md
FEATURE-1-COMPLETE.md		FEATURE-1-COMPLETE.md
FEATURE-3-COMPLETE.md		FEATURE-3-COMPLETE.md
FEATURES-12-15-COMPLETE.md		FEATURES-12-15-COMPLETE.md
FEATURES-4-5-COMPLETE.md		FEATURES-4-5-COMPLETE.md
FEATURES-6-7-8-COMPLETE.md		FEATURES-6-7-8-COMPLETE.md
FEATURES-9-10-11-COMPLETE.md		FEATURES-9-10-11-COMPLETE.md
README.md		README.md
debug-page-load.js		debug-page-load.js
docker-compose.yml		docker-compose.yml
eslint.config.js		eslint.config.js
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json
test-complete-app-logic-fixed.js		test-complete-app-logic-fixed.js
test-complete-app-logic.js		test-complete-app-logic.js
test-comprehensive-features.js		test-comprehensive-features.js
test-enhanced-stealth-features.js		test-enhanced-stealth-features.js
test-features-16-20.js		test-features-16-20.js
test-features-4-5.js		test-features-4-5.js
test-robust-selectors.js		test-robust-selectors.js
test-selector-improvements.js		test-selector-improvements.js
test-simple-stealth.js		test-simple-stealth.js
test-stealth-implementation.js		test-stealth-implementation.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gate.io P2P Scraper

Features

Requirements

Install

Configuration

Usage

Docker

Outputs

Logs

Troubleshooting

Development

Project structure

Notes on maintainability and robustness

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

salaamdev/gateio-p2p-scraper

Folders and files

Latest commit

History

Repository files navigation

Gate.io P2P Scraper

Features

Requirements

Install

Configuration

Usage

Docker

Outputs

Logs

Troubleshooting

Development

Project structure

Notes on maintainability and robustness

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages