Cube Status Critic

English | 中文

Using IRL(Inverse Reinforcement Learning) to train a reward function for a Rubik's cube/other cube's status.

Usage

Collect human prefer data through custom cstimer

When solved a scramble, click prefer to switch the preference against last scramble, or use shortcut Ctrl + J or Ctrl + K to set prefer or not.

There while be an up arrow besied the result to showing the preference to this scramble against the last one.

All the timing ways should be supported, if you got any problems in using custom cstimer, please issue here.

Export data

Use cstimer's Export to file built-in function to export the data file, which will be used in the traing part.

Write config in `config.py` and train

Make a config.py firstly like:

cp config.py.template config.py

Custom your config in the config.py

Train, export and deployment ONNX model

# main.py

workspace  = train.Workspace(cfg, args)
embed() 
# instead of rigid sequential calls
# workspace.train()
# workspace.infer()
# workspace.to_onnx()

By using Ipython.embed() to start an interactive terminal and techniques like reloading the WorkSpace class instance, we have greatly facilitated the interactive training and exporting of ONNX models. Additionally, it allows for modifying training hyperparameters at runtime, changing the implementation of workspace interfaces, etc.

Currently, in our custom cstimer, we have implemented preference value inference for the Rubik's clock state. Special thanks to ZZY and JYH from Northwestern Polytechnical University (NWPU) for providing the initial data.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.vscode		.vscode
cstimer @ 7d8581d		cstimer @ 7d8581d
pics		pics
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
README_CN.md		README_CN.md
akasha.md		akasha.md
config.py.template		config.py.template
dataset.py		dataset.py
loss.py		loss.py
main.py		main.py
models.py		models.py
todo.md		todo.md
train.py		train.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cube Status Critic

Usage

Collect human prefer data through custom cstimer

Export data

Write config in `config.py` and train

Train, export and deployment ONNX model

About

Releases

Packages

Languages

Alex-Beng/CubeStatusCritic

Folders and files

Latest commit

History

Repository files navigation

Cube Status Critic

Usage

Collect human prefer data through custom cstimer

Export data

Write config in config.py and train

Train, export and deployment ONNX model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Write config in `config.py` and train

Packages