Name	Name	Last commit message	Last commit date
Latest commit floatingbigcat Update README.md Jan 30, 2025 03a41ae · Jan 30, 2025 History 18 Commits
assets	assets	Add files via upload	Jan 14, 2025
base_model	base_model	init commit	Jan 7, 2025
cfgs	cfgs	init commit	Jan 7, 2025
evaluation/fishfarm	evaluation/fishfarm	add evaluation package	Jan 14, 2025
policy	policy	init commit	Jan 7, 2025
scripts	scripts	small fix	Jan 14, 2025
tasks	tasks	init commit	Jan 7, 2025
.gitignore	.gitignore	init commit	Jan 7, 2025
LICENSE	LICENSE	Create LICENSE	Jan 14, 2025
README.md	README.md	Update README.md	Jan 30, 2025
logging_utils.py	logging_utils.py	init commit	Jan 7, 2025
optim_modules.py	optim_modules.py	init commit	Jan 7, 2025
requirements.txt	requirements.txt	init commit	Jan 7, 2025
svd_reinforce_hydra.py	svd_reinforce_hydra.py	small fix	Jan 14, 2025
utils.py	utils.py	small fix	Jan 14, 2025

Repository files navigation

Transformer²: Self-adaptive LLMs 🐙

📚 [Paper] | 📄 [Blog]

Self-adaptive large language models (LLMs) aim to solve the challenges posed by traditional fine-tuning methods, which are often computationally intensive and static in their ability to handle diverse tasks.

We are excited to introduce Transformer², a novel self-adaptation framework that adapts LLMs for unseen tasks in real-time by selectively adjusting only the singular components of their weight matrices. During inference, Transformer² employs a two-pass mechanism: first, a dispatch system identifies the task properties, and then task-specific "expert" vectors, trained using reinforcement learning, are dynamically mixed to obtain targeted behavior for the incoming prompt.

Installation

1. Clone the Repo

git clone https://github.com/SakanaAI/self-adaptive-llms
cd self-adaptive-llms

2. Install Libraries

conda create -n t2 python=3.11 -y
conda activate t2
pip install --upgrade pip
pip install -r requirements.txt

3. Install Tasks Evaluator

cd evaluation/fishfarm
pip install -e .

Usage

We provide example scripts for both training and evaluation.

Please change the argument in the provided script to choose among models and tasks

Training

bash scripts/train_task_expert.sh

Evaluation

Prompt-based evaluation

Classification experts can be loaded by specifying the CLS_EXPERT_PATH in the script.

bash scripts/eval_prompt_based.sh

Few-shots evaluation

bash scripts/eval_few_shot.sh

Citation

If you find Transformer^2 useful for your research, please cite using this BibTeX:

@misc{sun2025transformersquaredselfadaptivellms,
      title={Transformer-Squared: Self-adaptive LLMs}, 
      author={Qi Sun and Edoardo Cetin and Yujin Tang},
      year={2025},
      eprint={2501.06252},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2501.06252}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer²: Self-adaptive LLMs 🐙

Installation

1. Clone the Repo

2. Install Libraries

3. Install Tasks Evaluator

Usage

Training

Evaluation

Prompt-based evaluation

Few-shots evaluation

Citation

About

Releases

Packages

Contributors 3

Languages

License

SakanaAI/self-adaptive-llms

Folders and files

Latest commit

History

Repository files navigation

Transformer2: Self-adaptive LLMs 🐙

Installation

1. Clone the Repo

2. Install Libraries

3. Install Tasks Evaluator

Usage

Training

Evaluation

Prompt-based evaluation

Few-shots evaluation

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Transformer²: Self-adaptive LLMs 🐙

Packages