Charles Lynn Wayne
Charles Lynn Wayne (1943 – November 23, 2024) was an American program manager at the Defense Advanced Research Projects Agency (DARPA). He was instrumental in creating the Common Task Method for advancing speech recognition and natural language processing technologies by centering around public benchmarks and datasets, and in establishing Human Language Technology (HLT) initiatives programs at DARPA including TIDES (Translingual Information Detection, Extraction, and Summarization) and EARS (Effective, Affordable, Reusable Speech-to-Text).[1][2]
Career at DARPA
[edit]Automatic Speech Recognition
[edit]Wayne joined DARPA in 1988 as a project manager and managed the speech program and the natural language program, which incorporated efforts from the former Strategic Computing natural language program.[3] Under his leadership, the program made significant progress in automatic speech recognition technology, with systems like Sphinx achieving 94% accuracy for speaker-independent thousand-word vocabulary recognition by 1989, and Dragon Dictate demonstrating 30,000-word capacity after adaptation to a specific speaker.[4]
Wayne led efforts to standardize libraries of speech audio for benchmarking speech recognition systems. He organized annual meetings characterized as "bake-offs" where researchers would test their systems against standard speech samples.[3] By 1991, Wayne had established the DARPA Spoken Language program with two major components: large vocabulary speech recognition and spoken language understanding for interactive problem solving.[5]
Common Task Method
[edit]Wayne is credited with creating the Common Task Method (CTM) during his time at DARPA in the late 1980s. The method established a cycle beginning with ambitious technical challenges and quantitative performance targets, followed by data acquisition and annotation, parallel research efforts, and objective evaluations. Workshops were regularly held for researchers to discuss results, share technical approaches to increasing the task performance, and design future directions.[6][7]
Wayne emphasized the importance of objective performance evaluations, working closely with the National Institute of Standards and Technology (NIST) to administer official performance evaluations for DARPA's Human Language Technology (HLT) research.[8]
His emphasis on standardized evaluation metrics, such as Word Error Rate (WER), and the establishment of large-scale linguistic data resources through the Linguistic Data Consortium, created a technical infrastructure that enabled decades of progress in speech and language technologies.[6][9] His approach to topic detection and tracking demonstrated "the virtue of formal research task definitions, common data, and common evaluations" in driving technological progress.[10]
Linguistic Data Consortium
[edit]Wayne played a role in establishing the Linguistic Data Consortium (LDC). In 1987, Frederick Jelinek met Jacob Schwartz at DARPA concerning the necessity of large datasets in linguistic AI research. Schwartz informed Wayne, who then invited the appropriate people to a meeting at the Lake Mohunk Mountain, resulting in the LDC. The LDC became responsible for acquiring, annotating, and distributing most of the speech and text data used in DARPA's HLT research and evaluations.[11] This initiative addressed the need for vast quantities of data essential for advancing language technology research.[6]
Text Retrieval Conference
[edit]Wayne was instrumental in the creation of the Text Retrieval Conference (TREC) program. In 1990–1991, he asked Donna Harman at NIST to help create a new, large test collection for the TIPSTER Program. This initiative evolved into TREC, which significantly advanced information retrieval research and technologies.[12]
TIDES and EARS Programs
[edit]During his second term as a project manager at DARPA (2001–2005), Wayne led two significant programs in the Information Awareness Office:[13]
- TIDES (Translingual Information Detection, Extraction, and Summarization) – This program aimed to enable English speakers to find and interpret critical information regardless of language or medium. The project addressed natural language processing needs for discovery tools to find information in foreign languages and convert speech to text.[13]
- EARS (Effective, Affordable, Reusable Speech-to-Text) – This program aimed to produce rich, accurate transcripts of natural human-human speech useful to both people and machines. Wayne set ambitious goals for reducing word error rates in English, Chinese, and Arabic speech recognition.
References
[edit]- ^ "Charles Wayne Obituary (1943 - 2024) - Washington, DC - The Washington Post". Legacy.com. Retrieved 2025-03-27.
- ^ "Progress and Prospects". staffwww.dcs.shef.ac.uk. Archived from the original on 9 Jul 2024. Retrieved 2025-03-27.
- ^ a b Roland, Alex; Shiman, Philip (2002). Strategic Computing: DARPA and the Quest for Machine Intelligence, 1983-1993. Cambridge, Mass: MIT Press. p. 307. ISBN 978-0262182256.
- ^ Roland, Alex; Shiman, Philip (2002). Strategic Computing: DARPA and the Quest for Machine Intelligence, 1983-1993. Cambridge, Mass: MIT Press. p. 307. ISBN 978-0262182256.
- ^ Wayne, Charles L. (1991-02-19). "A snapshot of two DARPA speech and natural language programs". Proceedings of the Workshop on Speech and Natural Language. HLT '91. USA: Association for Computational Linguistics: 403–404. doi:10.3115/112405.1138641.
- ^ a b c Liberman, Mark; Wayne, Charles (June 2020). "Human Language Technology". AI Magazine. 41 (2): 22–35. doi:10.1609/aimag.v41i2.5297. ISSN 0738-4602.
- ^ Liberman, Mark (April 1, 2015). "Reproducible Research and the Common Task Method". Simons Foundation. Retrieved 2025-03-27.
- ^ Pallett, David S. (1985). "Performance Assessment of Automatic Speech Recognizers". Journal of Research of the National Bureau of Standards. 90 (5): 371–387. doi:10.6028/jres.090.026. ISSN 0160-1741. PMC 6658420. PMID 34566165.
- ^ Donoho, David (2017-10-02). "50 Years of Data Science". Journal of Computational and Graphical Statistics. 26 (4): 745–766. doi:10.1080/10618600.2017.1384734. ISSN 1061-8600.
- ^ Wayne, Charles L. (2000). "Multilingual Topic Detection and Tracking: Successful Research Enabled by Corpora and Evaluation". LREC.
- ^ Jelinek, Frederick (2009). "ACL Lifetime Achievement Award: The Dawn of Statistical ASR and MT" (PDF). Computational Linguistics. 35 (4): 483–494. doi:10.1162/coli.2009.35.4.35401.
- ^ Tassey, Gregory (2010). Economic impact assessment of NIST's text retrieval conference (TREC) program (PDF) (Report). Gaithersburg, Maryland: National Institute of Standards and Technology.
- ^ a b Wayne, C. (2002). "Information Awareness Office (IAO) Human Language Technology TIDES, EARS, Babylon" (PDF). DARPATech 2002. Archived from the original (PDF) on 2013-02-23.