Ronald Allan Cole

 

 

PERSONAL DATA

 

Office Address: 

Center for Spoken Language Research

University of Colorado, Boulder

Campus Box 594 Boulder, CO.  80302-0258

Telephone:

303-735-5109

Fax:

303-735-5072

Email:

cole@cslr.colorado.edu

 

ACADEMIC TRAINING

 

Institution

Degree

Date

Univ. of Rochester

B.A. with distinction in Psychology

1963-1967

Univ. of California at Riverside

M.A. in Psychology

1967-1969

Univ. of California at Riverside

Ph.D. in Psychology, Doctoral Dissertation: "Phoneme Independence in short-term memory" Thesis Advisor: Dr. Terrence Kenney

1969-1971

 

PROFESSIONAL EXPERIENCE

 

Oct. 1998 - Present

Professor, Dept. of Psychology, Department of Computer Science

Director, Center for Spoken Language Understanding, University of Colorado, Boulder.

Apr. 1992 - Oct. 1998

Professor, Dept. of Computer Science & Engineering

Director, Center for Spoken Language Understanding Oregon Graduate Institute.

Aug. 1988 - Apr. 1992

Associate Professor, Dept. of Computer Science & Engineering, Oregon Graduate Institute.

May 1980 - Aug. 1988

Senior Project Scientist, Department of Computer Science, Carnegie Mellon University.

Jan. 1975 - Apr. 1980

Associate Professor, Department of Psychology, Carnegie Mellon University.

Jan.- June 1974

Visiting Lecturer, Department of Linguistics, Tel-Aviv University, Tel-Aviv, Israel.

July 1974 - Dec. 1974

Associate Professor, Department of Psychology, University of Waterloo.

July 1970 - June 1974

Assistant Professor, Department of Psychology, University of Waterloo.

 


COURSES TAUGHT

 

Undergraduate

Graduate

Short Course

Intro to Psychology

Perception

Computer Speech Recognition:  The State
of the Art

Perception and Cognition

Psycholinguistics

 

Psycholinguistics

Time Perception

 

Psychology of Consciousness

Speech Perception

 

Research Methods in Perception

Acoustic Phonetic

 

Biofeedback

Spectrogram Reading The Structure of Spoken Language

 

 

 

MASTERS THESES SUPERVISED

 

Scott, B.L.

The verbal transformation effect as a function of embedded sounds, University of Waterloo, September, 1971.

Freilich, I.

Cognitive rigidity and the alpha rhythm of the human electro-encephalogram; University of Waterloo, 1972.

Singer, J.

The effects of repetition on perception and response, University of Waterloo, September 1974.

Raz, I.

The extent of invariance for Hebrew consonants, Tel-Aviv University, December 1975.

Gopalakrishnan, M

Segmenting speech into broad phonetic categories using neural networks; Oregon Graduate Institute, August 1990.

Rooker, T.

Formant estimation from a spectral slice using neural networks.  Oregon Graduate Institute, August 1990.

Zhou, L.

Speaker-independent neural network pitch tracker with telephone bandwidth, speech for computer speech recognition.  Oregon Graduate Institute, April 1991.

Roginski, K.

A Neural Network Phonetic Classifier for Telephone Speech, Oregon Graduate Institute, November 1991.

Jain, N.

A New Approach to Voice Dialing, Oregon Graduate Institute, July 1995.

 

DOCTORAL DISSERTATIONS SUPERVISED

 

Scott, B.L.

Speech perception: a theory and application, University of Waterloo, August 1974.

Jakimik, J.

The interaction of sound and knowledge during word recognition from fluent speech, Carnegie Mellon University, June 1979.

Rudnicky, A.

The role of language experience in language perception: An ecogical theory.  Carnegie Mellon University, June 1979.

Muthusamy, Y.

A segmental approach to automatic language identification, Oregon Graduate Institute, July 1993.

 


 

RESEARCH GRANTS

2000 – 2005

Cole, R., Massaro, D., van Santen, J., Movellan, J., “ITR:  Creating the Next Generation of Intelligent Animated Conversational Agents,” $4,000,000, NSF.

1999 - 2002

Cole, R., "CRCD: An Interactive Curriculum in Human Language Technology for Undergraduate and Graduate Education and Research," $400,000, NSF.

1999 - 2001

Cole, R., "Advancing Human Language Technology in Brazil and the United States Through Collaborative Research on Portuguese Spoken Language Systems," $199,481, NSF.

1998 - 2002

Cole, R., "CARE: Accessible Language Resources for Research and Education," $1,200,000, NSF.

1998 – 2000

Cole, R., "Accelerating Research Advances in Human Language Technology through Portable, Accessible Natural Dialogue Systems," $600,000, ONR/DARPA.

1998

Cole, R., “Making Spoken Language Systems Ubiquitous: Long-Term Research Agenda,” $150,000, Intel Corporation.

1997-1999

Cole, R., Y. Yan, “A Phonetic Knowledge-Guided Approach to Speaker Adaption for Large Vocabulary Continuous Speech Recognition,” $94,375, Office of Naval Research.

1997-1998

Cole, R., “Large Vocabulary Continuous Speech Recognition,” $100,000, fonix Corporation.

1997-1998

Cole, R., “Speech Recognition Evaluation,”$72,675, Nortel Corporation.

1997-1998

Fanty, M., R. Cole, Y. Yan, “CISE Research Instrumentation: File Server and Storage Server for Spoken Language Technologies,” $50,000, National Science Foundation.

1997-2000

Cole, R., M. Macon, D. Massaro, A. Waibel, “Creating Conversational Agents for Language Training,” $1,800,000, National Science Foundation.

1997-1998

Cole, R., J. de Villiers, D. Massaro, “Bringing Spoken Language Systems to the Classroom for Learning Language Training with Hearing Impaired People,” $50,000, National Science Foundation.

1997

Cole R., French, Italian, Mexican, and Caribbean Spanish Data Collection, $199,906, AT & T.

1997

Cole, R., “Understanding the role of International Collaboration in Computer Science and Engineering,” (NSF Workshop), $49,946, National Science Foundation.

1997

Cole, R., Intel Equipment Grant, $200,000, (40 Pentium computers).

1997

Cole, R., "Proposal for the Second NSF Grantees Workshop in Interactive Systems," $120,925, National Science Foundation.

1996-1998

Cole, R., G. Whitney, D. Jonassen, B. Moeller, S. Carver, "Conceptualization of Flagship Center for Collaborative Research in Learning and Human Language Technologies,” $50,000, National Science Foundation.

1996-1997

Cole R., Basic Research/Education Support, $45,913, Intel Corp.

1996-1997

Cole, R., S. Sutton, "Advancing Human Language Technology in Mexico and the U.S. Through Collaborative Research on Spoken Language Systems," $100,000, National Science Foundation.

1996-1998

Cole, R., D. Novick, M. Fanty, “Rapid Prototyping of Spoken Language Systems,” $1,300,000, Office of Naval Research.

1996-1998

Cole, R., M. Fanty, B. Oshika, "Human Language Resources for Research in Multilanguage Systems, Robust Recognition, and Speaker Identification," $746,890, National Science Foundation.

1996

Cole, R., "Toward Robust Speech Recognition: Improved Measures of Confidence," $20,000, Texas Instruments.

1994-1995

Cole, R., "Instrumentation for Research in Spoken Language Systems," $40,000, National Science Foundation.

1994-1996

Cole, R., M. Fanty, D. Novick, E. Barnard, H. Hermansky, "Spoken Dialogue Technology for Appointment Scheduling," $1,400,000, U S West

1994-1995

Cole, R., D. Novick, M. Fanty, "Rapid Prototyping and Deployment of Spoken Language Systems," $450,000, U.S. Office of Naval Research/U.S. Census Bureau.

1994-1995

Cole, R., "Joint E.C.-U.S. Survey of the State of the Art in Human Language Technology,” $58,000, National Science Foundation.

1994-1998

Cole, R., "Toward Rapid Development and Deployment of Spoken Language Systems,” $150,646, U.S. Office of Naval Research/AASERT.

1994-1995

Cole, R., D. Novick, M. Fanty, E. Barnard, H. Hermansky, "Toward Robust Spoken-Language Systems,” $829,886, NSF/ARPA.

1993-1996

Cole, R., B. Oshika, E. Barnard, “Linguistic Units for Language Identification,” $697,162, Department of Defense.

1993-1994

Cole, R., "Digital Data Collection Platform and Software,” $41,000, Linguistic Data Consortium.

1993-1995

Cole, R., E. Barnard, T. Leen, “Task-based Analysis and Stochastic Search

in Neural Networks, $325,000, Office of Naval Research.

1993-1994

Cole, R., “Telephone Data Collection,” $65,325, Apple Computer Inc.

1993-1996

Cole, R., "Automatic Language Identification: A Distinctive Feature Approach," $110,323, Office of Naval Research, AASERT Award.

1993-1994

Cole, R., D. Novick, M. Fanty, “Voice questionnaire for the Year 2000 Census,” $125,000, U.S. Bureau of the Census, (grant administered through the U.S. Office of Naval Research).

 

1993-1999

Cole, R., D. Novick, “Graduate Traineeships for Under-Represented Minorities for Research in Spoken-Language Interfaces,” $557,500, National Science Foundation.

1993-1994

Cole, R., D. Novick, "Real Time Voice Response Questionnaire for the Year 2000 Census,” $61,912, Digital Equipment Corporation.

1992

Cole, R., S. Zahorian, L. Hirschman, “Workshop on Spoken Language Understanding,” $35,700, National Science Foundation.

1992-1995

Cole, R., M. Fanty, "Spoken Letter Recognition,” $360,000/3-year, National Science Foundation.

1991-1993

Cole, R., M. Fanty, T. Leen, "Neural Network Approaches to Spoken Letter Recognition,” $230,000, Office of Naval Research.

1992-1993

Cole, R., M. Fanty, T. Leen, "Instrumentation for the Center for Spoken Language Understanding,” $43,000, National Science Foundation.

1991

Fanty, M., R. Cole, "A Portable Interactive Environment for Computer Speech Recognition,” $83,000, National Science Foundation.

1991

Cole R., Equipment donation, $108,000, Digital Equipment Corporation.

1991-1992

Cole, R., M. Fanty, "English Alphabet Recognition Over Telephone Lines," $230,000, US West Advanced Technologies.

1991-1992

Cole, R., "Speaker-Independent Recognition of Arbitrary Sets of Words," $100,000, US West Technologies.

1990-1992

Cole, R., M. Fanty, Cash donation, $40,000, and equipment donation, $43,000, Apple Computer.

1990-1992

Cole, R., "Phonetic Classification of Continuous Speech,” $50,000/year, National Science Foundation.

1990

Cole, R., M. Fanty, "Feasibility of Speech Recognition on a VLSI Neurocomputer,” Matching Funds Grant, $20,000, OASIS.

1985-1987

Cole, R., "Phonetic Classification of Continuous Speech," $580,000, National Science Foundation.

1982-1985

Cole, R., "Knowledge Engineering and Knowledge Acquisition in Speech Understanding Research," $480,000, National Science Foundation.

1972-1975

Cole, R., "What we hear during speech," $30,000, National Research Council of Canada.

1972

Cole, R., "Hearing through the skin," Operating grant, $2000, University of Waterloo Research Grant.

1970-1971

Cole, R., Operating grant, $4000, National Research Council of Canada.

1970-1971

Cole, R., Computing grant, $1300, National Research Council of Canada.

 

PUBLICATIONS

1.      Jiyong Ma, Jie Yan and Ron Cole, in CU Animate:Tools for Enabling Conversations with Animated Characters, in Submitted to: ICSLP-2002, Denver, Colorado, USA,Sept 2002., pp. 4, USA, Sep, 2002.

2.      Hosom, J.P., Cole, R.A., “Burst Detection Based on Measurements of Intensity Discrimination.”  Proceedings of ICSLP 2000.  (pp. IV-564 -- IV-567).  Bejing, China 2000.

 

3.      Shobaki, K., Hosom, J.P., Cole, R., “The OGI Kids’ Speech Corpus and Recognizers.” Proceedings of ICSLP 2000.  (pp. IV-564 -- IV-567).  Bejing China 2000.

 

4.      Cole, R.A., Serridge, B., Hosom, J.P., Cronk, A., and Kaiser, E., "A Platform for Multilingual Reserach in Spoken Dialogue Systems." Workshop on Multi-lingual Interoperability in Speech Technology (MIST), The Netherlands, September 1999.

5.      Massaro, D. W., Cohen, M. M., Daniel, S., & Cole, R. A. (1999). Developing and evaluating conversational agents. In P. A. Hancock (Ed.) Human Factors and Ergonomics: Perceptual and Cognitive Principles. (Handbook of Perception & Cognition, 2nd Edition). (pp. 173-194). San Diego, CA: Academic Press.

6.      Cole, R. A., "Tools for research and education in speech science," In Proceedings of the International Conference of Phonetic Sciences, San Francisco, CA, Aug 1999.

7.      T. Carmell, J.P. Hosom, and R. Cole. A computer-based course in spectrogram reading. In Proceedings of ESCA/SOCRATES Workshop on Method and Tool Innovations for Speech Science Education, London, UK, Apr 1999.

8.      Ron Cole, Dominic W. Massaro, Jacques de Villiers, Brian Rundle, Khaldoun Shobaki, Johan Wouters, Michael Cohen, Jonas Beskow, Patrick Stone, Pamela Connors, Alice Tarachow, and Daniel Solcher. New tools for interactive speech and language training: Using animated conversational agents in the classrooms of profoundly deaf children. In Proceedings of ESCA/SOCRATES Workshop on Method and Tool Innovations for Speech Science Education, London, UK, Apr 1999.

9.      D. Massaro, Cohen, M. M., Beskow, J., Daniel, S., Cole, R., "Developing and Evaluating Conversational Agents," Workshop on Embodied Conversation Characters (WECC), Lake Tohoe, 1998, http://mambo.ucsc.edu:80/psl/wecc3.rtf.

10.   Cosi, P., J.P. Hosom, J. Schalkwyk, S. Sutton, and R. A. Cole, "Connected Digit Recognition Experiments with the OGI Toolkit's Neural Network and HMM-Based recognizers," In Proceedings, 4th IEEE Workshop on Interactive Voice Tehcnology for Telecommunications Applications (IVTTA-ETWR98), Turin, Italy, (September 1998).

11.   Cole, R.A., M. Noel, and V. Noel. The CSLU Speaker Recognition Corpus. In Proceedings of ICSLP, Sydney, Australia, 1998.

12.   Hosom, J. P., R. A. Cole, P. Cosi, "Evaluation and Integration of Neural-Network Training Techniques for Continuous Digit Recognition," In Proceedings of the International Conference on Spoken Language Processing (ICSLP), Sydney, Australia, (November 1998).

13.   Serridge, B., R. A. Cole, A. Barbosa, N. Munive, A. Vargas, "Creating a Mexican Spanish Version of the CSLU Toolkit," In Proceedings of the International Conference on Spoken Language Processing (ICSLP), Sydney, Australia, (November 1998).

14.   Sutton, S., R. A. Cole, J. deVilliers, J. Schalkwyk, P. Vermeulen, M. Macon, Y. Yan, E. Kaiser, B. Rundle, K. Shobaki, P. Hosom, A. Kain, J. Wouters, D. Massaro, M. Cohen, "Universal Speech Tools:  The CSLU Toolkit," In Proceedings of the International Conference on Spoken Language Processing (ICSLP), Sydney, Australia, (November 1998).

15.   Cole, R., T. Carmell, P. Connors, M. Macon, J. Wouters, J. de Villiers, A. Tarachow, D. Massaro, M.Cohen, J. Beskow, J. Yang, U. Meier, A. Waibel, P. Stone, G. Fortier,  A. Davis, C. Soland, “Intelligent Animate Agents for Interactive Language Training,” presented at STILL ’98, Stockholm, Sweden, May 1998.

16.   Yan, Y., X. Wu, J. Schalkwyk, R. A. Cole, "Development of CLSU LVCSR: The 1997 DARPA HUB4 Evaluation System," In DARPA Broadcast News Transcription and Understanding Workshop, (1998).

17.   Cole, R., S. Sutton, Y. Yan, P. Vermeulen, M. Fanty, “Accessible technology for interactive systems: A new approach to spoken language research,” In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Seattle, WA., 1998.

18.   Cole, R., D. G. Novick, M. Fanty, P. Vermeulen and S. Sutton, "Experiments with a Spoken Dialogue System for Taking the U.S. Census," Special Edition: Speech Communications, (1998, in Press).

19.   Cole, R., & V. Zue, 1997.  Spoken Language Input.  In Cole, R.A., J. Mariani, H. Uszkoriet, A. Zaenen, & V. Zue (eds.), Survey of the State of the Art in Human Language Technology, (1, pp. 1-49), Cambridge University Press.

20.   Cole, R., 1997.  Spoken Output Technologies.  In Cole, R.A., J. Mariani, H. Uszkoriet, A. Zaenen, & V. Zue (eds.), Survey of the State of the Art in Human Language Technology, (5, pp. 165-214), Cambridge University Press.

21.   Cole, R., 1997.  Mathematical Methods.  In Cole, R.A., J. Mariani, H. Uszkoriet, A. Zaenen, & V. Zue (eds.), Survey of the State of the Art in Human Language Technology, (11, pp. 337-369), Cambridge University Press.

22.   Cole, R., 1997.  Language Resources. In Cole, R.A., J. Mariani, H. Uszkoriet, A. Zaenen, & V. Zue (eds.), Survey of the State of the Art in Human Language Technology, (12, pp. 381-403), Cambridge University Press.

23.   Computer Science and Telecommunications Board, National Research Council, “More Than Screen Deep:  Toward Every-Citizen Interfaces to the Nation’s Information Infrastructure,” National Academy Press, Washington, D.C., 1997.

24.   Sutton, S., E. Kaiser, A. Cronk, and R. Cole, "Bringing spoken language systems to the classroom", EUROSPEECH'97, Rhodes, Greece, (1997).

25.   Tu X., Y. Yan, R. Cole, "Matching training and testing criteria in hybrid speech recognition systems", EUROSPEECH'97, Rhodes, Greece (1997).

26.   Cole R., S. Sutton, M. Fanty, E. Kaiser, J. Schalkwyk, J. de Villiers, A. Cronk, Colton, "Cyberspeech: Password to Cyberspace," DAIC Workshop, Seattle, WA, (1997).

27.   Yan, Y., M. Fanty, R. Cole, “Speech recognition using neural networks with forward-backward probability generated targets,” Proceedings of the International Conference on Acoustics Speech and Signal Processing, Munich, (1997).

28.   Hosom, J.P., R. Cole, “A diphone-based digit recognition system using neural networks,” Proceedings of the International Conference on Acoustics Speech and Signal Processing, Munich, (1997).

29.   Cole, R. A., D. G. Novick, M. Fanty, P. Vermeulen, S. Sutton, "Experiments with a spoken dialogue system for taking the U.S. census," Free Speech Journal, (1996).

30.   Yan Y., E. Barnard, R. Cole, “Development of an approach to automatic language identification based on phone recognition,” Computer, Speech & Language, Vol. 10(1), pp. 37-54, January, (1996).

31.   Sutton, S., D. Novick, R. Cole, M. Fanty, “Building 10,000 spoken-dialogue systems,” Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, (1996).

32.   Hu, Z., J. Schalkwyk, E. Barnard, R. Cole, “Speech recognition using syllable-like units,” Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, (1996).

33.   Cole, R., Y. Yan, T. Bailey, “The influence of bigram constraints on word recognition by humans: Implications for computer speech recognition,” Proceedings of International Conference on Spoken Language Processing, Philadelphia, PA, (1996).

34.   Cole, R., Y. Yan, B. Mak, M. Fanty, T. Bailey, “The contribution of consonants versus vowels in word recognition of fluent speech,” Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Atlanta, Georgia, (1996).

35.   Jain, N., R. Cole, E. Barnard, “Creating speaker-specific phonetic templates with a speaker-independent phonetic recognizer: Implications for voice dialing,” Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Atlanta, Georgia, (1996).

36.   Colton, L. D., R. Cole, D. G. Novick, S. Sutton, “A laboratory course for designing and testing spoken dialogue systems,” Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Atlanta, Georgia, (1996).

37.   Hu, Z., E. Barnard, R. Cole, "Transition-based feature extraction within frame-based recognition," Proceedings of the Fourth European Conference on Speech Communication and Technology, Madrid, Spain, (1995).

38.   Colton, L. D., M. Fanty, R. Cole, "Second pass verification improves N-Way forced choice recognition and out-of-vocabulary rejection," Proceedings of the Fourth European Conference on Speech Communication and Technology, Madrid, Spain, (1995).

39.   Noel, M., R. Cole, T. Durham, T. L. Lander, "New telephone speech corpora at CSLU," Proceedings of the Fourth European Conference on Speech Communication and Technology, Madrid, Spain, (1995).

40.   Lander, T., R. Cole, B. Oshika, M. Noel, "The OGI 22 language telephone speech corpus," Proceedings of the Fourth European Conference on Speech Communication and Technology, Madrid, Spain, (1995).

41.   Lander, T., B. T. Oshika, R. Cole, M. Fanty, "Multi-language speech database: creation and phonetic labeling agreement," Proceedings of the International Congress of Phonetic Science, Stockholm, Sweden, (1995).

42.   Fanty, M., E. Barnard, R. Cole, "Alphabet recognition," by invitation to the Handbook of Neural Computation, (1995).

43.   Barnard, E., R. Cole, M. Fanty, P. Vermeulen, "Real-world speech recognition with neural networks," (invited paper), Proceedings of the International Symposium on Aerospace/Defense Sensing & Control and Dual-Use Photonics, International Society for Optical Engineering, Technical Conference no. 2492, Orlando, FL, (1995).

44.   Sutton, S., B. Hansen, T. Lander, D. G. Novick, R. Cole, "Evaluating the effectiveness of dialogue for an automated spoken questionnaire," AAAI 1995 Spring Symposium Series, Stanford University, (1995).

45.   Cole, R., L. Hirschman et al., "The challenge of spoken language systems: Research directions for the nineties," IEEE Transactions on Speech and Audio Processing, 1, pp. 1-21, (1995).

46.   Cole, R., B. T. Oshika, M. Noel, T. Lander, M. Fanty, "Labeler agreement in phonetic labeling of continuous speech," Proceedings of the 1994 International Conference on Spoken Language Processing, Yokohama, Japan, (1994).

47.   Cole, R., M. Fanty, M. Noel, T. Lander, "Telephone speech corpus development at CSLU," Proceedings of the 1994 International Conference on Spoken Language Processing, Yokohama, (1994).

48.   Cole, R., D. G. Novick, M. Fanty, P. Vermeulen, S. Sutton, D. Burnett and J. Schalkwyk, "A prototype voice-response questionnaire for the U.S. census," Proceedings of the 1994 International Conference on Spoken Language Processing, Yokohama, Japan, (1994).

49.   Schalkwyk, J., E. Barnard, R. Cole, J. R. Sachs, "Detecting an imposter in telephone speech," Workshop on Automatic Speaker Recognition, Identification and Verification, Martigny, Switzerland, (1994).

50.   Cole, R., M. Noel, D. C. Burnett, M. Fanty, T. Lander, B. Oshika, S. Sutton, "Corpus development activities at the Center for Spoken Language Understanding," Proceedings of the ARPA Workshop on Human Language Technology, April 7-11, (1994).

51.   Muthusamy, Y.K., N. Jain, R. Cole, "Perceptual benchmarks for automatic language identification," Proceedings of the 1994 International Conference on Acoustics, Speech and Signal Processing, (1994).

52.   Berkling, K.M., T. Arai, E. Barnard, R. Cole, "Analysis of phoneme-based features for language identification," Proceedings of the 1994 International Conference on Acoustics, Speech and Signal Processing, (1994).

53.   Cole, R., D. G. Novick, D. Burnett, B. Hansen. S. Sutton, M. Fanty, “Towards automatic collection of the U.S. census,” Proceedings of the 1994 International Conference on Acoustics, Speech and Signal Processing, (1994).

54.   Cole, R., D. G. Novick, M. Fanty, S. Sutton, B. Hansen, D. Burnett, “Rapid prototyping of spoken language systems: The year 2000 census project,” Proceedings of the International Symposium on Spoken Dialogue, Tokyo, Japan, (1993).

55.   Muthusamy Y., K. Berkling, T. Arai, R. Cole, E. Barnard, “A comparison of approaches to automatic language identification using telephone speech, “EUROSPEECH '93, Berlin, Germany, (1993).

56.   Schmid, P., R. Cole, M. Fanty, H. Bourlard, M. Haessen, “Real-time, neural network-based, French alphabet recognition with telephone speech,” EUROSPEECH '93, Berlin, Germany, (1993).

57.   Cole, R., Y. K. Muthusamy, “Perceptual studies on vowels excised from continuous speech,” Proceedings of the International Conference on Spoken Language Processing, Banff, Alberta, (1992).

58.   Muthusamy, Y. K., R. Cole, “Automatic segmentation and identification of ten languages using telephone speech,” Proceedings of the International Conference on Spoken Language Processing, Banff, Alberta, (1992).

59.   Fanty, M., J. Pochmara, R. Cole, “An interactive environment for speech recognition research,” Proceedings of the International Conference on Spoken Language Processing, Banff, Alberta, (1992).

60.   Muthusamy, Y. K., R. Cole, B. T. Oshika, “The OGI multi-language telephone speech corpus,” Proceedings of the International Conference on Spoken Language Processing, Banff, Alberta, (1992).

61.   Cole, R., M. Fanty, K. Roginski, “A telephone speech database of spelled and spoken names,” Proceedings of the International Conference on Spoken Language Processing, Banff, Alberta, (1992).

62.   Muthusamy, Y. K., R. Cole, B. T. Oshika, “Automatic language identification,” Voice Systems Worldwide Speech Tech '92 Conference, New York, NY, (1992).

63.   Cole, R., M. Fanty, K. Roginski, “Recognizing spelled names with telephone speech,” Voice Systems Worldwide Speech Tech '92 Conference, New York, NY, (1992).

64.   Creekmore, J., M. Fanty, R. Cole, “A comparative study of five spectral representations for speaker-independent phonetic recognition,” presented at 25th Annual Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, (1991).

65.   Fanty, M., R. Cole, M. Slaney, “A comparison of DFT, PLP and Cochleagram for alphabet recognition,” 25th Annual Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, (1991).

66.   Muthusamy, Y.K., R. Cole, “A segment-based automatic language identification system,” in J.E. Moody, S. J. Hanson, R.P. Lippmann, editors, Advances in Neural Information Processing Systems 4, San Mateo, CA, (1992).  Morgan Kaufmann Publish.

67.   Cole, R., K. Roginski, M. Fanty, “English alphabet recognition with telephone speech,” in J.E. Moody, S. J. Hanson, R.P. Lippmann, editors, Advances in Neural Information Processing Systems 4, San Mateo, CA, (1992).  Morgan Kaufmann Publishers.

68.   Cole, R., M. Fanty, K. Roginski, “Speaker-independent name retrieval from spellings using a database of 50,000 names: Application to telephone speech,” '91, Genova, Italy, Sep. (1991).

69.   Janssen, R.D.T., M. Fanty, R. Cole, “Speaker-independent phonetic classification in continuous english letters,” Proceedings of the International Joint Conference on Neural Networks, Seattle, WA, (1991).

70.   Muthusamy, Y. K., R. Cole, M. Gopalakrishnan, “A segment-based approach to automatic language identification,” Proceedings of the 1991 International Conference on Acoustics, Speech and Signal Processing, May 14-17, Toronto, (1991).

71.   Cole, R., M. Fanty, M. Gopalakrishnan, R. D. T. Janssen, “Speaker-independent name retrieval from spellings using a database of 50,000 names,” Proceedings of the 1991 International Conference on Acoustics, Speech and Signal Processing, May 14-17, Toronto, (1991).

72.   Fanty, M., R. Cole, “Spoken letter recognition,” in R. Lippmann, J. Moody, D. Touretzky (Ed.), Advances in Neural Information Processing Systems, San Mateo, CA: Morgan Kaufmann Publishers, (1991).

73.   Fanty, M., R. Cole, “Speaker-independent english alphabet recognition: Experiments with the E-Set,” Proceedings of the 1990 International Conference on Spoken Language Processing, Kobe, Japan, (1990).

74.   Atlas, L., R. Cole, Y. Muthusamy, A. Lippman, G. Connor, D. Park, M. El-Sharkawi, R. Marks II, “A performance comparison of trained multi-layer perceptrons and trained classification trees,” Proceedings of the IEEE (Special Issue on Neural Networks)  (1990).

75.   Cole, R., M. Fanty, "Spoken letter recognition,” Proceedings of the DARPA Workshop on Speech and Natural Language Processing, Hidden Valley, PA, (1990).

76.   Cole, R., M. Fanty, Y. K. Muthusamy, M. Gopalakrishnan, “Speaker-independent recognition of spoken english letters,” Proceedings of the International Joint Conference on Neural Networks '90, San Diego, CA, (1990).

77.   Muthusamy, Y. K., R. Cole, M. Slaney, “speaker-independent vowel recognition: spectrograms versus cochleagrams,” Proceedings of the IEEE 1990 International Conference on Acoustics, Speech and Signal Processing, Albuquerque, New Mexico, (1990).

78.   Atlas, L., W. Kooiman, P. Loughlin, R. Cole, “New nonstationary techniques for the analysis and display of speech transients,” Proceedings of the IEEE 1990 International Conference on Acoustics, Speech and Signal Processing, Albuquerque, New Mexico, (1990).

79.   Cole, R., Y. K. Muthusamy, L. Atlas, “Speaker-independent vowel recognition: Comparison of backpropagation and trained classification trees,” Proceedings of the IEEE Hawaii International Conference on System Sciences No. 23, Kona-Kailua, Hawaii, (1990).

80.   Barnard, E., R. Cole, M. P. Vea, F. Alleva, “Pitch detection with a neural-net classifier,” IEEE Transactions on Acoustics, Speech & Signal Processing, (1991).

81.   Atlas, L., R. Cole, Y. K. Muthusamy, J. Taylor, E. Barnard, “Performance comparisons between backpropagation networks and classification trees on three real-world applications,” Proceedings of the Conference on Neural Information Processing Systems, Denver, CO, (1989).

82.   Atlas, L. E., J. Connor, D. Park, M. El-Sharkawi, R. Marks II, A. Lippman, R. Cole, Y. K. Muthusamy, “A performance comparison of trained multi-layer perceptrons and trained classification trees,” Proceedings of the IEEE Systems, Man and Cybernetics Society Conference, Cambridge, MA, (1989).

83.   Cole, R., J. W. T. Inouye, Y. K. Muthusamy, M. Gopalakrishnan, “Language identification with neural networks: a feasibility study,” Proceedings of the IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, Victoria B.C., (1989).

84.   Cole, R., L. Hou, “Segmentation and broad classification of continuous speech,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, New York, (1988).

85.   Cole, R., “Phonetic classification in new generation speech recognition systems,” Speech Tech 86, pp. 43-46, New York, (1986).

86.   Cole, R., M. P. Phillips, R. A. Brennan, B. Chigier, “The CMU phonetic classification system,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Tokyo, (1986).

87.   Jakimik, J. A., R. Cole, A. I. Rudnicky, “Sound and spelling in spoken word recognition,” Journal of Verbal Learning and Verbal Behavior, pp. 165-178, (1985).

88.   Cole, R., R. M. Stern, M. J. Lasry, “Performing fine phonetic distinctions: templates vs. features,” in Invariance and Variability of Speech Processes, ed. J. Perkell and D. Klatt, Lawrence Erlbaum, New York, (1984).

89.   Cole, R., R. M. Stern, M. S. Phillips, S. M. Brill, A. P. Pilant, P. Specker, “Feature-based speaker-independent recognition of isolated english letters,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 731-734, (1983).

90.   Cole, R., A. I. Rudnicky, “What's new in speech perception: The research and ideas of William Chandler Bagley 1874-1946,” Psychological Review, (1983).

91.   Bradshaw, G. L., R. Cole, Z. D. Li, “Comparison of learning techniques in speech recognition,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 554-557, (1982).

92.   Haber, R. N., R. Cole, “Evidence for direct visual access to letter identities,” Acta Psychologica, 46, pp. 181-192, (1980).

93.   Cole, R., “Perception of fluent speech by children and adults,” in Annals of the New York Academy of Sciences, 379, pp. 92-102, (1981).

94.   Cole, R., C. A. Perfetti, “Listening for mispronunciations in a children's story:  The use of context by children and adults,” Journal of Verbal Learning and Verbal Behavior, 19, pp. 297-315, (1980).

95.   Cole, R., R. N. Haber, “Reaction time to letter name or letter case,” Acta Psychologica, 44, pp. 281-285, (1980).

96.   Cole, R., V. W. Zue, “Speech as eyes see it,” in Attention and Performance, ed. S. Nickerson, Lawrence Erlbaum Associates, Hillsdale, N.J., (1980).

97.   Cole, R., J. Jakimik, W. E. Cooper, “Segmenting speech into words,” Journal of the Acoustical Society of America, 67, pp. 1323-1332, (1980).

98.   Cole, R., J. Jakimik, “How are syllables used to recognize words?,” Journal of the Acoustical Society of America, 67, pp. 965-970, (1980).

99.   Winitz, H., D. Ingram, R. Cole, J. Folkins, “Articulation,” in Annual Abstracts of Speech, Voice, Language and Hearing, ed. I. Goldstein, Little, Brown & Co., Boston, (1979).

100.   Zue, V., R. Cole, “Experiments on spectrogram reading,” Proceedings of the IEEE Conference on Acoustics, Speech and Signal Processing, pp. 116-119, Washington, D.C., (1979).

101.   Cole, R., A. I. Rudnicky, V. Zue, D. R. Reddy, “Speech as patterns on paper,” in Perception and Production of Fluent Speech, ed. R. A. Cole, Lawrence Erlbaum Associates, Hillsdale, NJ, (1980).  Cole, R. A., “Navigating the slippery stream of speech,” Psychology Today, (1979).

102.   Cole, R., J. Jakimik, “A model of speech perception,” in Perception and Production of Fluent Speech, ed. R. A. Cole, Lawrence Erlbaum Associates, Hillsdale, NJ, (1980).

103.   Rudnicky, A. I., R. Cole, “The effect of subsequent context on syllable perception,” Journal of Experimental Psychology: Human Perception and Performance, 4, pp. 638-647, (1978).

104.   Cole, R., J. Jakimik, W. E. Cooper, “Perceptibility of phonetic features in fluent speech,” Journal of the Acoustical Society of America, 64, pp. 44.56, (1978).

105.   Rudnicky, A. I., R. Cole, “Adaptation produced by connected speech,” Journal of Experimental Psychology: Human Perception and Performance, 3, pp. 51-61, (1977).

106.   Cole, R., J. Jakimik, “Understanding speech: How words are heard,” in Information Processing Strategies, ed. G. Underwood, Academic Press, London, (1978).

107.   Cooper, W. E., D. Billings, R. Cole, “Articulatory effects on speech perception: A second report,” Journal of Phonetics, 4, pp. 219-232, (1976).

108.   Cole, R., W. E. Cooper, “Properties of frication analyzers for /j/,” Journal of the Acoustical Society of America, 62, pp. 177-182, (1977).

109.   Cole, R., N. Cummings, “Bilateral alpha rhythm in children during listening and looking,” in Language development and neurological theory, eds. S. Segolowitz & F. Gruber, Academic Press, New York, (1977).

110.   Cole, R., “Invariant features and feature detectors,” in Language development and neurological theory, ed. S. Segolowitz & F. Gruber, Academic Press, New York, (1977).

111.   Cooper, W. E., R. R. Ebert, R. Cole, “Perceptual analysis of stop consonants and glides,” Journal of Experimental Psychology: Human Perception and Performance, 2, pp. 92-104, (1976).

112.   Cooper, W. E., R. R. Ebert, R. Cole, “Speech perception and production of the consonant cluster /st/,” Journal of Experimental Psychology: Human Perception and Performance, 2, pp. 105-1154, (1976).

113.   Cole, R., W. E. Cooper, “Perception of voicing in english affricates and fricatives,” Journal of the Acoustical Society of America, 58, pp. 1280-1287, (1975).

114.   Cole, R., W. E. Cooper, J. Singer, F. Allard, “Selective adaptation of english consonants using real speech,” Perception and Psychophysics, 18, pp. 227-244, (1975).

115.   Cole, R., M. Young, “Effect of subvocalization on memory for speech sounds,” Journal of Experimental Psychology: Human Learning and Memory, 1, pp. 772-779, (1975).

116.   Cole, R., B. Scott, “Toward a theory of speech perception,” Psychological Review, 81, pp. 348-374, (1974).

117.   Cole, R., B. Scott, “The phantom in the phoneme: Invariant cues for stop consonants,” Perception and Psychophysics, 15, pp. 101-107, (1974).

118.   Cole, R., M. Coltheart, F. Allard, “Memory of a speaker's voice: Reaction time to same and different-voiced letters,” Quarterly Journal of Experimental Psychology, 24, pp. 1-7, (1974).

119.   Sales, B. D., R. Cole, R. N. Haber, “Mechanisms of aural encoding: VIII.  Phonetic interference and contest-sensitive coding in short-term memory,” Memory and Cognition, 2, pp. 596-600, (1974).

120.   Cole, R., B. D. Sales, R. N. Haber, “Mechanisms of aural encoding: VII.  Differential decay of consonants and vowels in a Petersen and Petersen STM task,” Memory and Cognition, 2, pp. 211-214, (1974).

121.   Cole, R., B. Scott, “Perception of temporal order in speech.  The role of vowel transitions,” Canadian Journal of Psychology, 27, pp. 441-449, (1973).

122.   Cole, R., “Different memory functions for consonants and vowels,” Cognitive Psychology, 4, pp. 39-54, (1973).

123.   Cole, R., “Listening for mispronunciations: A measure of what we hear during speech,” Perception and Psychophysics, 13, pp. 153-156, (1973).

124.   Cole, R., “Perceiving syllables and remembering phonemes,” Journal of Speech and Hearing Research, 16, pp. 37-47, (1973).

125.   Cole, R., R. Haber, B. Sales, “Mechanisms of aural encoding: VI.  Consonants and vowels are remembered as subsets of distinctive features,” Perception and Psychophysics, 13, pp. 87-92, (1973).

126.   Cole, R., B. Scott, “Distinctive feature control of decision time: Same-different judgments of simultaneously heard phonemes,” Perception and Psychophysics, 12, pp. 91-94, (1972).

127.   Sales, B. D., R. N. Haber, R. Cole, “Mechanisms of aural encoding: V.  Environmental effects of consonants on vowel encoding,” Perception and Psychophysics, 6, pp. 361-365, (1969).

128.   Harley, W. F., Jr., C. C. Wilson, R. Cole, “The influence of perceptual organizing responses on recall,” Psychonomic Science, 11, pp. 135-136, (1968).

129.   Cole, R., B. D. Sales, R. N. Haber, “Mechanisms of encoding the speech sound,” Proceedings of the 76th Annual APA Convention, (1968).

130.   Cole, R., B. D. Sales, R. N. Haber, “Mechanisms of aural encoding: II.  The role of distinctive features in articulation and rehearsal,” Perception and Psychophysics, 6, pp. 343-348, (1969).

131.   Sales, B. D., R. N. Haber, R. Cole, “Mechanisms of aural encoding: III.  Distinctive features for vowels,” Perception and Psychophysics, 4, pp. 321-327, (1968).

132.   Cole, R., R. N. Haber, B. D. Sales, “Mechanisms of aural encoding: I.  Distinctive features for consonants,” Perception and Psychophysics, 3, pp. 281-327, (1968).

 

BOOKS

Cole, R.A. (Ed.), "Perception and Production of Fluent Speech,” Lawrence Erlbaum Associates, Hillsdale, NJ, (1980).

Cole, R. A., J. Mariani, H. Uszkoriet, A. Zaenen and V. Zue (Ed.), "Survey of the State of the Art in Human Language Technology," Cambridge University Press, Cambridge, MA.

 

FILMS

I have produced, along with Alex Rudnicky and Raj Reddy, a professional 16mm sound film entitled "Speech as Eyes See It."  The movie shows the performance of an expert spectrogram reader, Dr. Victor Zue, who is able to determine the phonetic content of an unknown utterance from a speech spectrogram.  The movie shows the problems involved in spectrogram reading and the strategies that are used to overcome these problems. Dr. Zue's achievement has important implications for theories of speech perception, for the use of spectrographic feedback during speech therapy, and for machine recognition of speech.

 

PROFESSIONAL SERVICES

Invited by the National Science Foundation to edit the international survey "State of the Art in Speech and Natural Language Processing” sponsored jointly by the Directorate General XIII of the European Commission and the National Science Foundation, January 1994.

 

Invited to participate in Forum of Federal Information and Communications R&R to provide critical feedback on Strategic Implementation Plan, "America in the Age of Information", July 6-7, 1995.

 

Founder and Editor-in-Chief, Free Speech Journal. (http://www.cse.ogi.edu/CSLU/fsj/html/home.html)

 

Workshop Organizer, NSF Workshop on Spoken Language Understanding, Washington DC, 1992.

 

Workshop Organizer, NSF Interactive Systems Grantees Workshop,  Stevenson, WA, August 1997.

 

Workshop Organizer, NSF Language Resources Workshop, Stevenson, WA,  August 1997.

 

Workshop Organizer, NSF Workshop on International Collaboration in Computer Science, Stevenson, WA, October 1997.

 

Workshop Organizer, NSF Workshop on Western Hemisphere Collaboration in Computer Science, Orlando, FL, February, 1999.

 

Workshop Organizer, NSF Workshop on Western Hemisphere Collaboration in Computer Science,  Manzanillo, Colima, Mexico, August 1999.

 

Workshop Organizer, Workshop on US-Argentina and US-Chile Collaborative Research on Computer Science and Engineering, Buenos Aires, Argentina and Santiago, Chile.  Sponsored by the National Science Foundation.  (40 attendees per workshop), (4/00).

 

INVITED TALKS:

Between 5 and 10 annually at various universities, industry forums, and professional societies.

 

EDITORIAL BOARDS:

Associate Editor, Journal of Experimental Psychology: Human Perception and Performance, 1976-1978.

Assistant Editor, “International Journal of Speech Technology, (Current).

Associate Editor, Speech Communication, (Current).

Associate Editor, Computer Speech & Language, (Current).