Summarization feature. #403

dreemur99 · 2023-08-27T21:58:23Z

dreemur99
Aug 27, 2023

Idea for a feature. Context lengths are usually 2048 or 4096 (for now). How about a feature that chunks up longer inputs into 2048-token chunks. These chunks are then processed and summarized by the llm to lets say 256 tokens and fed back into the chat input. So if you have like a 10000 token chat length (not including the user input and a user determined range of up to say 1024 token or something) the previous tokens can be summarized and included into the prompt for around 1000 tokens. It's not perfect but it could be the start of an active "long term memory" or "extended short term memory".

Ive seen from the new ui kobold that it only uses the most recent chat tokens and that it does not go back to the early chat at all. Once past the max context length, thats it, kobold ignores it. This rudimentary memory idea might be solution for this.

(Breakdown of how input would look being fed to the llm. Depending on model context length and vram available, as according to current context lengths)

If chat token length is above max context length, then divide chat into user-set chunks of 512 - 2048 tokens.
Iterate through the chunks and summarize chunks to to user-set token range of 128 - 1024 tokens.
Feed chunks back into the llm input along with world info, current chat, user input, etc.

[Summarization of previous chat: 128 - 1024 tokens]
[Current chat: 1024 - 2048 tokens]
[User input: 256 - 1024 tokens]

LostRuins · 2023-08-28T02:27:20Z

LostRuins
Aug 28, 2023
Maintainer

There is an experimental memory summarization feature inside Lite that only works for instruct models. Click on the Memory tab and you will see a button for it there.

you can press this to add stuff from your context into memory.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Summarization feature. #403

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Summarization feature. #403

dreemur99 Aug 27, 2023

Replies: 1 comment

LostRuins Aug 28, 2023 Maintainer

dreemur99
Aug 27, 2023

LostRuins
Aug 28, 2023
Maintainer