fix(classifier): refuse to guess kits + source-named folders; fix sept1 morpher
PLN-flagged chain of labeling errors, traced to the SSOT classifier: - 'jazz' was matched to BREAK, but jazz is a multisample KIT (jazz:0=kick, :1=snare/hat…). A folder name is not a reliable family signal: it may be one family, a heterogeneous kit, or a demucs grab named after a SOURCE song (wap, take5, the_revolution, xplosive, rample*). classify_sample_family now fires ONLY on names that lexically encode an instrument; everything else is None (= needs per-sample analysis). No 'kit registry' (that's name-guessing too). - removed over-reaching genre/source tokens: jazz, dnb, jungle, loop from break; drum from perc. This also FIXES jungle_pads (→pad, was break) and jungle_vocals (→vox). amen kept (amencutup genuinely is the Amen break). - tempo: strip Tidal '--' line comments before parsing cps (ton_numero's commented-out morpher no longer counts); a track with a live 'cps (range …)' is now flagged morph even when it also declares a fixed setcps. morphing=1 (Septembre 1er, 60→180), was 0. - report: + stage_tempo_by_year, sources/roadmap, recurrence gig_slugs, classified/unclassified coverage (21% of palette uses need analysis, honest). - tests: classifier refuses kit/source names; jungle_pads→pad guard. 60 green.
Showing
This diff is collapsed.
Click to expand it.
Please
register
or
sign in
to comment