Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
모델 동작까지 확인하였습니다.
다만 성능이 예상보다 많이 떨어집니다.
기존 monologg님의 distilkobert를 encoder와 decoder로 사용하였습니다.
encoder layer 3개, decoder layer 3개 로써 기존 distilbert 모델 그대로 사용하였습니다.
5에폭에 rougelsum은 12가 약간 넘는 수치였습니다.
대신 eval time이 기존 bart모델의 encoder layer 3, decoder layer 3 보다는 빨랐습니다.
https://wandb.ai/final_project/optimization/runs/z88c6i2r?workspace=user-abbymark