Optical music recognition

Optical music recognition (OMR) or Music OCR is the application of optical character recognition to interpret sheet music or printed scores into editable or playable form. Once captured digitally, the music can be saved in commonly used file formats, e.g. MIDI (for playback) and MusicXML (for page layout).

History

Early research into recognition of printed sheet music was performed at the graduate level in the late 1960s at MIT and other institutions.[1] Successive efforts were made to localize and remove musical staff lines leaving symbols to be recognized and parsed. The first commercial music-scanning product, MIDISCAN (now SmartScore), was released in 1991 by Musitek corporation.

Unlike OCR of text, where words are parsed sequentially, music notation involves parallel elements, as when several voices are present along with unattached performance symbols positioned nearby. Therefore, the spatial relationship between notes, expression marks, dynamics, articulations and other annotations is an important part of the expression of the music.

Proprietary software

  • capella-scan[2]
  • ForteScan Light by Fortenotation[3]
  • MIDI-Connections Scan by MIDI-Connections[4]
  • MP Scan by Braeburn.[5] Uses SharpEye SDK.
  • NoteScan bundled with Nightingale[6]
  • OMeR (Optical Music easy Reader) Add-on for Harmony Assistant and Melody Assistant: Myriad Software[7] (ShareWare)
  • PhotoScore by Neuratron.[8] The Light version of PhotoScore is used in Sibelius. PhotoScore uses the SharpEye SDK.
  • ScoreMaker by Kawai[9]
  • Scorscan by npcImaging.[10] Based on SightReader(?)
  • SharpEye By Visiv[11]
    • VivaldiScan (same as SharpEye)[12]
  • SmartScore By Musitek.[13] Formerly packaged as "MIDISCAN". (SmartScore Lite is used in Finale).

Free/open source software

Similar but different

PDFtoMUSIC by Myriad is often seen as a Music OCR software, but it does actually no optical character recognition. The program simply reads PDF files which have been created by some scorewriter, locates the musical glyphs which have been written directly as characters of a music notation font. The optical recognition consists of concluding the musical relationship of those glyphs from their relative position in space, i.e. on the logical page of the PDF document, and combine those to a musical score. Only the PRO version can export this to a MusicXML file, while the standard version works only for the scorewriters by Myriad.[16]

See also

  • Music information retrieval (MIR) is the broader problem of retrieving music information from media including music scores and audio.
  • Optical character recognition (OCR) is the recognition of text which can be applied to document retrieval, analogously to OMR and MIR. However, a complete OMR system must faithfully represent text that is present in music scores, so OMR is in fact a superset of OCR.[17]

References

  1. Pruslin, Dennis Howard (1966). "Automatic Recognition of Sheet Music" (PDF). Retrieved 2007-01-24.
  2. Info capella-scan
  3. FORTE Scan Light Archived 2013-09-22 at the Wayback Machine.
  4. MIDI-Connections SCAN 2.0 Archived 2013-12-20 at the Wayback Machine.
  5. Music Publisher Scanning Edition
  6. NoteScan
  7. OMeR
  8. PhotoScore Ultimate 7
  9. Scoremaker (Japanese) Archived 2013-10-08 at Archive.is
  10. ScorScan
  11. SharpEye
  12. VivaldiScan
  13. SmartScore Archived 2012-04-17 at the Wayback Machine.
  14. Audiveris - Github page
  15. OpenOMR
  16. "PDFtoMusic Pro". myriad-online.com. 2015. Retrieved 13 November 2015.
  17. Bainbridge, David; Bell, Tim (2001). "The challenge of optical music recognition" (PDF). Computers and the Humanities. 35.2: 95–121. Retrieved 23 February 2017.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.