Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Applying Block Movement Pruning for BART #40

Open
apurvnagvenkar opened this issue Jul 21, 2022 · 1 comment
Open

Applying Block Movement Pruning for BART #40

apurvnagvenkar opened this issue Jul 21, 2022 · 1 comment

Comments

@apurvnagvenkar
Copy link

Hi,
I am working to prune BART model for seq2seq purpose. Currently, I have replaced this code with BART based functionalities. After executing I am getting drop in number of parameters for both attention and FFN but dimension reduction happens only for FFN which results in slowness. My questions are following:

  1. Is this right code to refer to or should I follow this command_line.py?
  2. Is there any existing code which works for BART based models for Conditonal Generation or Seq2Seq?
@robotsp
Copy link

robotsp commented Mar 8, 2023

Hi, I am working to prune BART model for seq2seq purpose. Currently, I have replaced this code with BART based functionalities. After executing I am getting drop in number of parameters for both attention and FFN but dimension reduction happens only for FFN which results in slowness. My questions are following:

  1. Is this right code to refer to or should I follow this command_line.py?
  2. Is there any existing code which works for BART based models for Conditonal Generation or Seq2Seq?

I am doing the same thing as you. Did you fix the problem? @apurvnagvenkar

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants