Xuedong Huang

Xuedong Huang
Xuedong Huang
Born	October 20, 1962; Hunan, China
Citizenship	American (since 1995)
Alma mater	Hunan University; Tsinghua University; Edinburgh University
Awards	Wired Magazine Next List 2016 ; 2011 Asian American Engineer of the Year ; IEEE 1993 Paper Award; Allen Newell Research Excellence Medal
	Scientific career
Fields	Speech Recognition; CNTK; Natural Language Processing; Software Development
Institutions	Microsoft; Carnegie Mellon University
Doctoral advisor	Mervyn Jack

Xuedong Huang (Simplified Chinese: 黄学东, born October 20, 1962) is a Chinese-American computer scientist and the key person behind Microsoft's spoken language processing technologies. He is a Microsoft Technical Fellow and company's Chief Speech Scientist. Wired magazine named him one of 25 Geniuses in Next List 2016. ^[1]

Background

Huang grew up in Hunan, China and became a US citizen in 1995. He received his B.S. degree in computer science from Hunan University in 1982, his MS in computer science from Tsinghua University in 1984, and his PhD in Electrical Engineering from the University of Edinburgh in 1989.

Career

He joined Carnegie Mellon University in 1989 and worked with Raj Reddy and Kai-Fu Lee on speech recognition. At CMU, Huang directed Sphinx-II speech system research and achieved the best performance in every category of DARPA's 1992 benchmarking. He received the Allen Newell research excellence medal for his leadership in speech recognition in 1992, and IEEE Speech Processing Best Paper Award in 1993. He was elected to be a Fellow of IEEE in 2000, and a Fellow of ACM in 2017.^[2]

Huang has co-authored over 100 papers and two books: Hidden Markov Models for Speech Recognition, (1987) and Spoken Language Processing, Prentice Hall](2000). In 2014 he coauthored a historical speech recognition review with Raj Reddy and James K. Baker for Communications of the ACM that reflected several generations of speech research.^[3] In 2016, he led his team reaching a historical human parity milestone in transcribing conversational speech on the Switchboard task. ^[4] In 2018, he led his teams achieving more historical human parity milestones in Chinese to English Machine Translation on the WMT-2017 task ^[5] and real-time natural Text-to-Speech services on Azure. ^[6]

He is best known for founding and leading Microsoft's speech and language initiatives as well as his pioneering work on Microsoft's multimodal interactive MiPad prototype ^[7] as Bill Gates demonstrated at the Consumer Electronics Show in 2001. Huang was instrumental in introducing Microsoft's Speech Application Programming Interface (SAPI) in 1995 and numerous speech and language services since then. From 2000 to 2004, Huang served as the general manager of Microsoft's Speech Platforms Group and shipped Microsoft Speech Server and other voice technologies used in Microsoft Windows, Microsoft Office, Windows Mobile and Microsoft Exchange Server. Microsoft Response Point received 2009's Technology of the Year Awards as the best VOIP phone system from the InfoWorld Magazine.^[8] From 2009 to 2014, he served as the Chief Architect for Bing. He is currently a CVP leading Microsoft's world-wide Speech and Language responsible for Microsoft Cortana, Microsoft Translator, Office 365, Microsoft Windows, Microsoft Azure, and many 3rd parties' speech and translation services.

TV and books

Robert MacNeil, William Cran, Robert McCrum (2005). Do You Speak American? page 191-197, Harcourt Trade
PBS TV: Do You Speak American? 2005
Xuedong Huang, Alex Acero, Hsiao-Wuen Hon (2001). Spoken Language Processing: a guide to theory, algorithm, and system development, page 1-980. Prentice Hall
Xuedong D Huang, Yasuo Ariki, Mervyn A Jack (1990). Hidden Markov Models for Speech Recognition, Edinburgh University Press

References

↑ https://www.wired.com/2016/04/wired-nextlist-2016/
↑ "ACM Recognizes New Fellows", Communications of the ACM, 60 (3): 23, March 2017, doi:10.1145/3039921 .
↑ http://cacm.acm.org/magazines/2014/1/170863-a-historical-perspective-of-speech-recognition/fulltext
↑ https://blogs.microsoft.com/next/2016/10/18/historic-achievement-microsoft-researchers-reach-human-parity-conversational-speech-recognition/
↑ https://blogs.microsoft.com/ai/machine-translation-news-test-set-human-parity/
↑ https://azure.microsoft.com/en-us/blog/microsoft-s-new-neural-text-to-speech-service-helps-machines-speak-like-people/
↑ http://www.microsoft.com/en-us/news/features/2000/05-22mipad.aspx
↑ "Archived copy". Archived from the original on February 20, 2009. Retrieved February 9, 2009.

External links

Thoughts on Microsoft's Response Point, "ZDNet", Sept. 24, 2008
Microsoft's Xuedong Huang Delivers Keynote at ITEXPO West, Emphasizes Telephony Ease-of-Use and Describes Future Natural User Interface, Sept. 17, 2008
Microsoft Bolsters Its VoIP Strategy With Response Point Service Pack, "Information Week", March 24, 2008
Innovation Spotlight: Q&A With Xuedong Huang On Microsoft Response Point 2007
Microsoft Launches Breakthrough PBX for SMBs, "EWEEK", March, 2007
Huang appointed as Hunan University's Honorary Dean of Software School "Hunan University News", Nov 5, 2005 (in Chinese)
Microsoft Gets Futuristic With Speech Server 2004 March, 2004
Microsoft Peppers the Market with Integrated Voice Solutions March, 2004
Why it's getting easier to talk to your PC "ZDNET", Nov 14, 2002
The Year Ahead for Microsoft, Microsoft PressPass, Dec 18, 2001
Speech Recognition in Microsoft Office XP Increases Productivity, Efficiency for Chinese and Japanese Users, Microsoft PressPass, Aug. 16, 2001
MiPad: Speech Powered Prototype Listens and Learns Microsoft PressPass, May 22, 2000
Gates demos Microsoft MiPad "ZDNET", Oct 13, 2000
Microsoft's speech guru Xuedong Huang says that a "conversational interface" will make computing dramatically easier early next century. Newsweek, March 9, 1998
BEYOND TALKING BARNEY BusinessWeek, Feb 23, 1998
Talking Face To Interface. Seattle Times, June 1, 1997

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[1] ttps://www.wired.com/2016/04/wired-nextlist-2016/

[2] "ACM Recognizes New Fellows", Communications of the ACM, 60 (3): 23, March 2017, doi:10.1145/3039921 .

[3] ttp://cacm.acm.org/magazines/2014/1/170863-a-historical-perspective-of-speech-recognition/fulltext

[4] ttps://blogs.microsoft.com/next/2016/10/18/historic-achievement-microsoft-researchers-reach-human-parity-conversational-speech-recognition/

[5] ttps://blogs.microsoft.com/ai/machine-translation-news-test-set-human-parity/

[6] ttps://azure.microsoft.com/en-us/blog/microsoft-s-new-neural-text-to-speech-service-helps-machines-speak-like-people/

[7] ttp://www.microsoft.com/en-us/news/features/2000/05-22mipad.aspx

[8] "Archived copy". Archived from the original on February 20, 2009. Retrieved February 9, 2009.

Xuedong Huang

Born	(1962-10-20) October 20, 1962 Hunan, China
Citizenship	American (since 1995)
Alma mater	Hunan University Tsinghua University Edinburgh University
Awards	Wired Magazine Next List 2016 2011 Asian American Engineer of the Year IEEE 1993 Paper Award Allen Newell Research Excellence Medal
Scientific career
Fields	Speech Recognition CNTK Natural Language Processing Software Development
Institutions	Microsoft Carnegie Mellon University
Doctoral advisor	Mervyn Jack