Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unicode titles #107

Closed
egonelbre opened this issue Mar 25, 2015 · 1 comment
Closed

Unicode titles #107

egonelbre opened this issue Mar 25, 2015 · 1 comment

Comments

@egonelbre
Copy link

The current scheme of simply removing non alphanumeric letters doesn't work for titles containing only non-ansi letters. For example Öö and 世界 would end up as the same slug. Similarly @ sign and # sign.

My current approach is this https://github.com/egonelbre/fedwiki/blob/master/slug.go#L36 and examples of how it behaves https://github.com/egonelbre/fedwiki/blob/master/slug_test.go#L9.

Essentially keep all unicode letters/numbers replace common symbols with their entity equivalents, except -, ,, ., ,_, which are replaced by dashes. Also it's formatted such that there won't be double dashes or dashes around separator. Of course there's a problem, this scheme isn't backwards compatible.

Any thoughts regarding this issue? Probably it's been discussed before, but I didn't find the appropriate page.

@WardCunningham
Copy link
Member

We are deeply interested in how the slug-creation process can be made more international without breaking any interoperability between existing servers and the client codes they serve.

Please join the discussion at #103.

@paul90 paul90 closed this as completed Jul 19, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants