A desktop application to scan your drive and find duplicate files. It helps you identify and manage duplicate files to free up disk space.
- Scan any directory for duplicate files
- Filter by file extensions (e.g., .jpg, .pdf, .mp3)
- Set minimum file size to ignore small files
- Choose between different comparison methods:
- Content Hash (MD5): Most accurate, compares actual file content
- File Size: Fast but less accurate
- Filename: Quick check for identically named files
- View detailed information about duplicate groups
- Export results to CSV for further analysis
- Python 3.11 or higher
- PyQt5
- Clone the repository:
git clone https://github.com/yourusername/duplicate-file-finder.git
cd duplicate-file-finder
- Create a virtual environment:
python -m venv venv
# On Windows:
venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate
- Install dependencies:
pip install -r requirements.txt
Run the application:
python duplicate_finder.py
- Enter the directory path you want to scan or use the quick access buttons (Home, Documents, Downloads)
- Optionally set file extensions to filter (e.g.,
.jpg,.png,.pdf
) - Set the minimum file size to skip small files
- Choose your preferred scan method (Content Hash recommended for accuracy)
- Click "Start Scan"
- View details of each duplicate group by clicking the "View" button
- Export results to CSV for record-keeping or further analysis
You can create a standalone executable using PyInstaller:
pip install pyinstaller
pyinstaller --onefile --windowed duplicate_finder.py
The executable will be created in the dist
directory.
Demo Screencast from 2025-03-16 20-01-20.webm
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.