https://github.com/Music-and-Culture-Technology-Lab/omnizart and https://basicpitch.spotify.com/
They work better if you apply some source separation before (e.g, https://github.com/sigsep/open-unmix-pytorch, https://github.com/facebookresearch/demucs, or https://mvsep.com)
Still, I think the best results are from proprietary models (specifically https://www.ableton.com/en/manual/converting-audio-to-midi/ and https://www.celemony.com/en/melodyne/what-is-melodyne)
https://github.com/Music-and-Culture-Technology-Lab/omnizart and https://basicpitch.spotify.com/
They work better if you apply some source separation before (e.g, https://github.com/sigsep/open-unmix-pytorch, https://github.com/facebookresearch/demucs, or https://mvsep.com)
Still, I think the best results are from proprietary models (specifically https://www.ableton.com/en/manual/converting-audio-to-midi/ and https://www.celemony.com/en/melodyne/what-is-melodyne)