Skip to content

Link Dumper is a powerful Python-based web crawler designed for pentesting and reconnaissance.

Notifications You must be signed in to change notification settings

walidzitouni/Link_dumper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Link Dumper - Web Crawler for Extracting Links and JavaScript Files

Alt Text

📌 Overview

Link Dumper is a powerful Python-based web crawler designed for pentesting and reconnaissance. It scans websites for URLs and extracts JavaScript (.js), text (.txt), JSON (.json), and XML (.xml) files. This is useful for subdomain enumeration, API key discovery, and security analysis.

🚀 Features

  • Extracts URLs from <a> and <script> tags
  • Finds JavaScript files that might contain sensitive data
  • Crawls additional links recursively for deeper analysis
  • Extracts API keys & version numbers from JavaScript files
  • Saves results in JSON format for further processing
  • Handles relative and absolute links automatically
  • Multi-threaded crawling for fast performance

🛠️ Installation

1️⃣ Clone the Repository

git clone https://github.com/walidzitouni/Link_dumper.git
cd Link-Dumper

2️⃣ Install Dependencies

pip install -r requirements.txt

📌 Usage

Basic Usage

Run the script and enter a target URL and Select [1]:

bash main.sh

or

python3 Link_dumper.py

For version 2, you can also provide a file with a list of URLs and Select [2]:

bash main.sh 

Example Output

The tool will generate:

  • file_links.json → Contains extracted links
  • file_js.json → Contains extracted JavaScript file URLs

🔥 Features in Detail

1️⃣ Crawling & Link Extraction

  • Parses all <a> and <script> tags on a given website.
  • Extracts internal & external URLs.
  • Saves extracted URLs in file_links.json.

2️⃣ JavaScript File Detection

  • Finds all JavaScript files (.js).
  • Useful for discovering API endpoints, hidden parameters, and sensitive data.
  • Saves .js file URLs in file_js.json.

3️⃣ Recursive Crawling

  • Reads URLs from file_links.json.
  • Recrawls found pages for deeper enumeration.
  • Avoids duplicate links and saves unique results.

4️⃣ Multi-Threading

  • Uses concurrent.futures for parallel processing.
  • Makes scanning large websites much faster.

⚠️ Legal Disclaimer

This tool is intended for educational and ethical security testing purposes only.
Do not use it on websites without explicit permission.

The author takes no responsibility for any misuse of this tool.


📜 License

MIT License - Feel free to use and modify this tool.


👤 Author


What’s Included:

✅ Proper Markdown formatting
✅ Installation, usage, and advanced examples
✅ Legal disclaimer
✅ License information

Would you like badges (Python version, license, etc.) or a contribution guide added? 🚀

About

Link Dumper is a powerful Python-based web crawler designed for pentesting and reconnaissance.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published