Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hfst_lookup.py gets stuck #2

Open
hfst-importer opened this issue Feb 7, 2013 · 1 comment
Open

hfst_lookup.py gets stuck #2

hfst-importer opened this issue Feb 7, 2013 · 1 comment

Comments

@hfst-importer
Copy link

If I try to use attached script to analyse attached tokenised piece of news from hs.fi, the lookupping gets stuck in:
{u'POS': [u'ADVERB'], u'WORD_ID': [u'ennen']}
{u'CASE': [u'NOM', u'PAR'], u'GUESS': [u'COMPOUND'], u'ALLO': [u'IA'], u'POS': [u'NOUN', u'NOUN'], u'NUM': [u'SG', u'PL'], u'BOUNDARY': [u'COMPOUND'], u'WORD_ID': [u'kerta', u'er\xe4']}
{u'CASE': [u'NOM'], u'NUM': [u'SG'], u'SUBCAT': [u'CARD'], u'POS': [u'NUMERAL'], u'WORD_ID': [u'155']}
{u'CASE': [u'ILL'], u'ALLO': [u'VN'], u'NUM': [u'SG'], u'POS': [u'NOUN'], u'WORD_ID': [u'miljoona']}
{u'CASE': [u'ILL'], u'ALLO': [u'VN'], u'NUM': [u'SG'], u'POS': [u'NOUN'], u'WORD_ID': [u'euro']}

That is:
"ennen kertaeriä 155 miljoonaan euroon loka-joulukuussa."

Omorfi used is in googlecode git master with default settings.

Reported by: flammie

@hfst-importer
Copy link
Author

HS tokenised, as of course sf.net does not have ability to upload two files at once :-\

Original comment by: flammie

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants