Ferule

Ferule is a pun that explains the role of this library well. The word for rigor in education "motivating ruler" brings together tools for measuring and comparing performance of neural networks.

A feature of the project is that it uses the functionality of the TVM engine. The library uses an RPC connection to deliver data and take measurements on a mobile device. The following tools are currently available:

cross-compile allows you to tune and get the source object for one device and measure performance of the compiled network on the other;
config-overlay tool to get the best configuration for multiple devices

Quickstart

Run the following code inside the repository

foo@bar:~$ python -m pip install --upgrade pip build
foo@bar:~$ python -m build --sdist --wheel
foo@bar:~$ python -m pip install . --prefer-binary --force-reinstall --find-links dist/

Tools are now available from the command line

foo@bar:~$ cross-compile
Usage: cross-compile [OPTIONS] COMMAND [ARGS]...

Also you can use library tools without installation (preferably for changing tuning settings manually)

foo@bar:~$ export PYTHONPATH="${PYTHONPATH}:~/Ferule/src"
foo@bar:~$ python -m ferule.cross_compile

cross-compile

Program for cross-compilation and collection inference statistics. It works in two modes: tune and execute.

~$ cross-compile tune tunes the neural network according to the selected autotuner, compiles the model and measures the run time. Detailed information can be obtained using the --help flag.

foo@bar:~$ cross-compile tune atvm mace_mobilenet_v1 -p 9090 -k sd888

~$ cross-compile exec measure the execution time of source object on the passed device. It can accept JSON obtained as a result of tuning, while additionally compiling (specify --target and --target_host if needed).

foo@bar:~$ cross-compile exec atvm sd888.mace_mobilenet_v1.float32.atvm.so -p 9090 -k kirin710

Additionally, there is ~$ cross-compile view command that displays the path to the folder where models, logs and source objects are saved.

config-overlay

This command allows you to get the optimal configuration for multiple layers. For this, the formula is used

where $N$ is the number of devices, $t_i$ is the inference time of the selected configuration on the $i$ -th device, and $\theta_i$ is the best device time on the current layer.

foo@bar:~$ config-overlay meizu.mace_mobilenet_v1.float32.atvm.json htc.mace_mobilenet_v1.float32.atvm.json -k meizu -k htc -p 9090 --layer 6

Executing the command allows you to get logs and the optimal configuration for the specified layers. Graphs for visual evaluation will also be built.

best config	configs sorted by metrics

In the first graph, the best configuration is indicated among all configurations sorted by inference time on the first device. The second plot shows sorted configurations by the previously defined metric.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
img		img
src/ferule		src/ferule
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ferule

Quickstart

cross-compile

config-overlay

About

Releases

Packages

Languages

License

valvarl/Ferule

Folders and files

Latest commit

History

Repository files navigation

Ferule

Quickstart

cross-compile

config-overlay

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages