Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[klmc2] per-prompt momentum #5

Open
dmarx opened this issue Jan 14, 2023 · 2 comments
Open

[klmc2] per-prompt momentum #5

dmarx opened this issue Jan 14, 2023 · 2 comments

Comments

@dmarx
Copy link
Owner

dmarx commented Jan 14, 2023

No description provided.

@dmarx
Copy link
Owner Author

dmarx commented Jan 14, 2023

i think it'll start with breaking out the initialization of v into a separate v for each prompt, just before i in trange(n)

    v = torch.randn_like(x) * sigma
    vs = [v.clone() for _ in prompts] # per-prompt momentum

the complexity comes from taking the step, which generates a new velocity. maybe i need to project the new velocity onto the old one as a component? rotate the velocity with PCA and use the principal basis?

@dmarx dmarx closed this as completed Jan 14, 2023
@dmarx dmarx reopened this Jan 14, 2023
@dmarx
Copy link
Owner Author

dmarx commented Jan 14, 2023

or maybe the hessian is what I need? maybe I could even cheat some and rotate the hessian towards the prompts proportionate to their weights? orient the second order curvature?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant