Skip to content

id: url is unique, title is not.#1

Open
Amallyn wants to merge 1 commit intomanticoresoftware:masterfrom
Amallyn:patch-1
Open

id: url is unique, title is not.#1
Amallyn wants to merge 1 commit intomanticoresoftware:masterfrom
Amallyn:patch-1

Conversation

@Amallyn
Copy link
Copy Markdown

@Amallyn Amallyn commented Jan 10, 2021

crc32($match['url']) is unique.
crc32($title) - Different pages can have the same title or no title at all.
Would crc32(md5($match['url'])) also be better?

Previous crawled websites would need a new database and to be crawled again to update ids.

crc32($match['url']) is unique.
crc32($title) - Different pages can have the same title or no title at all
Would crc32(md5($match['url'])) also be better?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant