Skip to content

Latest commit

 

History

History
22 lines (11 loc) · 866 Bytes

README.md

File metadata and controls

22 lines (11 loc) · 866 Bytes

Arjuna: Generate Indonesian Poets and Poems using python base NLP model

Arjuna is a NLP Project to generate poets and poems based on Bahasa language. This repository is sources that used at PyCon ID 2021 talks created by @miqdude and @veronicaads.

This repository consist two main folder: scraper and model.

Scraper

This folder contains python script that used to do web scraping to https://www.kompas.id/kategori/sastra/ as dataset for the model.

You need to register to https://www.kompas.id first before use this scraper to fill the email and password parameter.

Please install all the requirements library at requirements folder : pip install -r requirements.txt

python kompas_sastra_scraper.py --user_email [email protected] --password xxxxxxxxxx --depth 5

Model