Guitar to Multi Hot Piano #6

anthonio9 · 2024-01-27T11:03:28Z

Instead of using 6* 1440 bins on the output use one vector that will represent all string together. With that in mind use below as well:

use sigmoid istead of softmax
use binary cross entropy instead of categorical cross entropy

Work work work!

anthonio9 · 2024-01-27T11:11:52Z

There are multiple ways to achieve this:

get the current 6 one-hot vectors describing the pitch and put them together into one vector. This approach seems fine, except for the part that every string that has no pitch will get a randomized bin.
combine all one-hot vectors into one, but randomize the bin only if all strings are mute. A bit harder to achieve with code, maybe will provide better end results?

Let's try both.

But then, how should this be evaluated? The training part is easy, cause the network will progress with the loss function, however real evaluation is a bit heavy. At first all evaluation should be ditched, only the training loss will matter and the plots with logits and the ground truth.

anthonio9 · 2024-01-27T11:45:40Z

This log-sum-exp trick may be helpful for enhancing the loss function with sigmoid.. but first, how to apply sigmoid with binary_cross_entropy so that cuda.amp.GradScaler works fine?

EDIT: it seems that using binary_cross_entropy_with_logits is good enough for replacing sigmoid and bce.

No ground truth plotting and no real metrics yet. Just loss. related to: #6

anthonio9 · 2024-01-27T13:22:12Z

1st approach is partially implemented, plotting should not show any ground truth yet, and the only metric tested is the loss coming from the loss function. It's a small step forward!

No ground truth plotting and no real metrics yet. Just loss. related to: #6

anthonio9 · 2024-02-12T22:19:01Z

So how should I go about the post-processing? Here's one way, simply by finding up to 6 peaks, cause torch.topk() does not really work well for this application: https://discuss.pytorch.org/t/pytorch-argrelmax-or-c-function/36404/2

related to: #6

anthonio9 · 2024-02-16T16:57:40Z

Finding peaks was a struggle and is not perfect at all. I think now is a good time to abandon this idea.

related to: #6

anthonio9 self-assigned this Jan 27, 2024

anthonio9 added a commit that referenced this issue Jan 27, 2024

Multi Hot encoding

a787645

No ground truth plotting and no real metrics yet. Just loss. related to: #6

anthonio9 added a commit that referenced this issue Jan 27, 2024

Multi Hot encoding

70e9deb

No ground truth plotting and no real metrics yet. Just loss. related to: #6

anthonio9 added a commit that referenced this issue Jan 27, 2024

Multi Hot encoding

583ea20

No ground truth plotting and no real metrics yet. Just loss. related to: #6

anthonio9 added a commit that referenced this issue Feb 15, 2024

Find peaks to find the peak logits

69e5342

related to: #6

anthonio9 added a commit that referenced this issue Feb 16, 2024

Unsuccessful peak finding

e4ac867

related to: #6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guitar to Multi Hot Piano #6

Guitar to Multi Hot Piano #6

anthonio9 commented Jan 27, 2024

anthonio9 commented Jan 27, 2024 •

edited

Loading

anthonio9 commented Jan 27, 2024 •

edited

Loading

anthonio9 commented Jan 27, 2024

anthonio9 commented Feb 12, 2024

anthonio9 commented Feb 16, 2024

Guitar to Multi Hot Piano #6

Guitar to Multi Hot Piano #6

Comments

anthonio9 commented Jan 27, 2024

anthonio9 commented Jan 27, 2024 • edited Loading

anthonio9 commented Jan 27, 2024 • edited Loading

anthonio9 commented Jan 27, 2024

anthonio9 commented Feb 12, 2024

anthonio9 commented Feb 16, 2024

anthonio9 commented Jan 27, 2024 •

edited

Loading

anthonio9 commented Jan 27, 2024 •

edited

Loading