Fast Realtime Pitch Detection

v1 Baseline v2 Worklet v3 Worklet+Worker+Gate

This demo runs a neural network specialized for fast pitch detection of human voices. It is based on the FCPE model, with the audio extractor rewritten in browser-compatible WASM and the ONNX model quantised to FP16 to minimise latency.

Paper: https://arxiv.org/abs/2509.15140

Model: nn/fcpe.single.fp16.onnx

(not initialized) Backend WASM threads: -- Force 1 thread MIDI out (quantized semitones)

Voicing threshold thr=0.0060 conf=0.0000 (below)

Sample demos

Choose a sample, run FCPE, and inspect note + Hz directly on the waveform.

Choose a sample to detect pitches