Words with Cheaters

A wordfeud solver that uses a fine-tuned tesseract model to parse the board and rack from a screenshot and then solve for the highest scoring move.

Requirements

Python 3.12
Tessaract OCR (brew install tesseract)

Installation

Install dependencies:

python3 -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt

Set the TESSDATA_PREFIX environment variable to the dataset directory:

export TESSDATA_PREFIX=$(pwd)/dataset

Usage

Take screenshot of your wordfeud board (only works on iOS), place it in it's own directory in the screenshots directory i.e:

├── screenshots
│   ├── IMG_0083
│   │   └── screenshot.png

Run the main script to parse (with the model you copied in the installation, otherwise defaults to eng) and solve all the screenshots in the screenshots directory.

python main.py -m words-with-cheaters --solve

main.py will set up the json files for you, but you will need to validate they are accurate.

├── screenshots
│   ├── IMG_0083
│   │   ├── board.json
│   │   ├── rack.json
│   │   └── screenshot.png

The algorithm could be a lot faster but it generally solves for all possible words in <10 seconds for a 15x15 board with 7 tiles, including wild cards on an M2 in a single thread.

The way it works is to check every valid series on the board (a valid series includes exists if it touches another tile) for every length of word at and below the rack length as a pattern in the dictionary. It then checks if the rack can satisfy the resulting words before checking the whole board for validty and scoring the placement.

OCR Training

To improve the OCR training, first prepare a dataset for the OCR trainer:

Set up enough screenshot.png files with accurate board.json and rack.json files in the screenshots directory. Then run the prepare_dataset.py script to generate the training data:

python prepare_dataset.py

Clone tesstrain next to this project:

cd .. & git clone [email protected]:tesseract-ocr/tesstrain.git

Then run the following command to generate the training data:

make training MODEL_NAME=words-with-cheaters \
START_MODEL=words-with-cheaters \
TESSDATA=../words-with-cheaters/dataset \
GROUND_TRUTH_DIR=../words-with-cheaters/dataset/training

And update the model in the repository:

cp data/words-with-cheaters.traineddata ../words-with-cheaters/datasets

Finally, use your trained model to OCR the screenshots.

python main.py -m words-with-cheaters --solve

Note: The OCR will not run if there are already board.json and rack.json files in the screenshot directory.

TODO

Serve the solver as an API, running this on a smaller machine might show that a optimized algorithm is necessary.
Implement a strategy algorithm to consider:
- Word length (as there is a significant bonus to finishing as fast as possible).
- Availability of multipliers produced by the move.
- Holding high value tiles if their value isn't being maximized by multipliers.
The dictionary is not complete.
Previously used wild cards should not count for points in any future moves. This will need to be encoded in the board state.
Parse a screenshot of the board and rack to get the board state and rack, this could also solve the above.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github/workflows		.github/workflows
dataset		dataset
screenshots/example		screenshots/example
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
add_to_dictionary.py		add_to_dictionary.py
board.py		board.py
cell.py		cell.py
dictionary.py		dictionary.py
dictionary.txt		dictionary.txt
game.py		game.py
main.py		main.py
parser.py		parser.py
prepare_dataset.py		prepare_dataset.py
profile_main.py		profile_main.py
pyproject.toml		pyproject.toml
rack.py		rack.py
requirements.txt		requirements.txt
tile.py		tile.py
word.py		word.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Words with Cheaters

Requirements

Installation

Usage

OCR Training

TODO

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

jvm986/words-with-cheaters

Folders and files

Latest commit

History

Repository files navigation

Words with Cheaters

Requirements

Installation

Usage

OCR Training

TODO

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages