CAPTCHA Solver: A Scalable and Efficient Approach

Overview

This project focuses on building a scalable CAPTCHA solving system capable of identifying text-based CAPTCHAs with varying character lengths (1 to 6) and diverse fonts. Using a modular approach,implemented and trained seven deep learning models: one for predicting CAPTCHA length and six specialized for CAPTCHA classification based on length.

The project emphasizes scalability by leveraging TensorFlow Lite (TFLite) for efficient deployment on resource-constrained devices like Raspberry Pi, ensuring fast and accurate CAPTCHA classification.

Features

Scalable Design: Utilizes a modular seven-model architecture to handle variable CAPTCHA lengths efficiently.
Font Robustness: Models trained on datasets containing diverse fonts to improve generalization.
Efficient Deployment: Conversion of TensorFlow models to TFLite format ensures compatibility with edge devices like Raspberry Pi.
High Performance: Employs preprocessing and optimized training configurations to achieve high accuracy and efficient resource utilization.

Architecture

The system is based on a divide-and-conquer approach, where:

Length Prediction Model: Predicts the length of the CAPTCHA.
Classification Models: Six specialized models classify CAPTCHAs for lengths 1 through 6.

Key Components

Data Preprocessing: Explored various preprocessing techniques, with the best configuration enhancing image clarity and model performance.
Training Models: Each model trained on datasets specific to its target length for optimized performance.
Font Handling: Included images of all font variations in training datasets to ensure robustness.

Training Configuration

Epochs: 20
Batch Size: 64
Early Stopping: Enabled to prevent overfitting.

Dataset

Generated 64,000 images for training each model (90/10 train-test split).
Data preprocessing included thresholding and noise removal to enhance input quality.

TensorFlow Lite Conversion

To ensure scalability and compatibility with resource-constrained environments:

Model Conversion: TensorFlow models were converted to TFLite format.
Quantization (Optional): Experimented with quantized models for faster inference, though with reduced accuracy.

The TFLite models were deployed on a Raspberry Pi, achieving efficient classification within acceptable time limits.

Scalability

Local vs. Edge Computing: Training was performed on local machines with GPU acceleration, while inference was executed on Raspberry Pi using TFLite models.
Performance Metrics: The system demonstrated high computational efficiency on edge devices:
- Classification Time: ~900 seconds for 4,000 images.
- Resource Utilization: Optimal use of CPU cores and memory during inference.

Results

Score: 2249/4000
Deployment Metrics:
- Classification time on Raspberry Pi: ~900 seconds.
- Efficient model loading and prediction pipeline reduced resource usage and processing time.

Future Work

Optimized Multithreading: To enhance classification speed on Raspberry Pi.
Advanced Preprocessing: Explore image segmentation and additional preprocessing techniques.
Quantization-aware Training: Improve TFLite model performance without significant accuracy loss.

How to Use

Run length_model training length_model_train.py
Run cpatha_model/process.sh to automatically generate model for all six lengths
Pi/ folder has tf to tf_lite converter and classifier files

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Pi		Pi
captcha_models		captcha_models
data_download		data_download
length_model		length_model
README.md		README.md
requirements_training.txt		requirements_training.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CAPTCHA Solver: A Scalable and Efficient Approach

Overview

Features

Architecture

Key Components

Training Configuration

Dataset

TensorFlow Lite Conversion

Scalability

Results

Future Work

How to Use

About

Releases

Packages

Languages

hari9-9/Captcha-Solver

Folders and files

Latest commit

History

Repository files navigation

CAPTCHA Solver: A Scalable and Efficient Approach

Overview

Features

Architecture

Key Components

Training Configuration

Dataset

TensorFlow Lite Conversion

Scalability

Results

Future Work

How to Use

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages