A possible (but slow) workaround for context limit #15

nonetrix · 2024-01-31T22:31:11Z

nonetrix
Jan 31, 2024

I think a good workaround for the context limit could be to have another instance of the LLM summarize the page, then the next page, so on and so on, until you finally ask it to summarize all the summaries into one and remove duplicate information, or present both as possible answers if there is conflicting infomation. Not sure how well this would work, but might work decently somewhat perhaps

mamei16 · 2024-02-01T18:16:14Z

mamei16
Feb 1, 2024
Maintainer

I have experimented with both extractive and abstractive summarization methods to allow a LLM to process whole webpages, but classic extractive summarization algorithms often "summarize away" important parts of the page and rip sentences out of context, while using a language model for abstractive summarization brings us back to the start: Now the language model that is supposed to summarize the web page needs to have a large enough context limit to fit the whole page, and that context needs to fit in memory as well!

This is why I have focused on web searches, where the model mostly searches for specific pieces of information rather than broad concepts, which the model can often recall from its training data. Ideally, the model would iteratively keep on searching until the retrieved information is deemed to be enough to answer the user's question (like Bing®™ is doing), but I think if I tried to do this with the DuckDuckGo API, it would result in rate-limiting very quickly.

Appreciate you sharing the idea though and thank you for coming to my Ted talk.

PS: Your Tokyo Night color theme for KDE is awesome, imma use that

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A possible (but slow) workaround for context limit #15

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

A possible (but slow) workaround for context limit #15

nonetrix Jan 31, 2024

Replies: 1 comment

mamei16 Feb 1, 2024 Maintainer

nonetrix
Jan 31, 2024

mamei16
Feb 1, 2024
Maintainer