Skip to content

hispavista/snowball

 
 

Repository files navigation

This contains the source code for the snowball compiler and the stemming
algorithms on the website.

Generate headers/sources files
-----------
go to folder /libstemmer and run

./mkmodules.pl modules.h src_c modules.txt ../mkinc.mak
./mkmodules.pl modules_utf8.h src_c  modules_utf8.txt ../mkinc_utf8.mak

go to root folder of the projectand run make 

If there were no errors during execution you should have src_c folder with the different stemmers created

If you want to add more languages, you have to create a folder inside "algorithms" folder with the language name, and put the sbl file (stem_ISO_8859_1.sbl). Then edit the modules*.txt files in libstemmer folder to add your language to the configuration (language UTF_8 langiso)

See http://snowball.tartarus.org/ for more details.

About

Snowball compiler and stemming algoritms

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C 90.4%
  • Java 6.6%
  • Perl 3.0%