Ronald
Allan Cole
PERSONAL
DATA
| Office
Address: |
Center
for Spoken Language Research
University
of Colorado, Boulder
Campus
Box 594 Boulder, CO. 80302-0258 |
| Telephone: |
303-735-5109 |
| Fax: |
303-735-5072 |
| Email: |
cole@cslr.colorado.edu |
ACADEMIC
TRAINING
| Institution |
Degree |
Date |
| Univ.
of Rochester |
B.A.
with distinction in Psychology |
1963-1967 |
| Univ.
of California at Riverside |
M.A.
in Psychology |
1967-1969 |
|
Univ.
of California at Riverside |
Ph.D.
in Psychology, Doctoral Dissertation: "Phoneme Independence
in short-term memory" Thesis Advisor: Dr. Terrence Kenney |
1969-1971 |
PROFESSIONAL
EXPERIENCE
| Oct.
1998 - Present |
Professor,
Dept. of Psychology, Department of Computer Science
Director,
Center for Spoken Language Understanding, University of Colorado,
Boulder. |
| Apr.
1992 - Oct. 1998 |
Professor,
Dept. of Computer Science & Engineering
Director,
Center for Spoken Language Understanding Oregon Graduate Institute. |
| Aug.
1988 - Apr. 1992 |
Associate
Professor, Dept. of Computer Science & Engineering, Oregon
Graduate Institute. |
| May
1980 - Aug. 1988 |
Senior
Project Scientist, Department of Computer Science, Carnegie Mellon
University. |
| Jan.
1975 - Apr. 1980 |
Associate
Professor, Department of Psychology, Carnegie Mellon University. |
| Jan.-
June 1974 |
Visiting
Lecturer, Department of Linguistics, Tel-Aviv University, Tel-Aviv,
Israel. |
| July
1974 - Dec. 1974 |
Associate
Professor, Department of Psychology, University of Waterloo. |
| July
1970 - June 1974 |
Assistant
Professor, Department of Psychology, University of Waterloo. |
COURSES TAUGHT
Undergraduate |
Graduate |
Short Course |
| Intro
to Psychology |
Perception |
Computer
Speech Recognition: The
State
of the Art |
| Perception
and Cognition |
Psycholinguistics |
|
| Psycholinguistics |
Time
Perception |
|
| Psychology
of Consciousness |
Speech
Perception |
|
| Research
Methods in Perception |
Acoustic
Phonetic |
|
| Biofeedback |
Spectrogram
Reading The Structure of Spoken Language |
|
MASTERS
THESES SUPERVISED
| Scott,
B.L. |
The
verbal transformation effect as a function of embedded sounds,
University of Waterloo, September, 1971. |
| Freilich,
I. |
Cognitive
rigidity and the alpha rhythm of the human electro-encephalogram;
University of Waterloo, 1972. |
| Singer,
J. |
The
effects of repetition on perception and response, University of
Waterloo, September 1974. |
| Raz,
I. |
The
extent of invariance for Hebrew consonants, Tel-Aviv University,
December 1975. |
| Gopalakrishnan,
M |
Segmenting
speech into broad phonetic categories using neural networks; Oregon
Graduate Institute, August 1990. |
| Rooker,
T. |
Formant
estimation from a spectral slice using neural networks. Oregon Graduate Institute, August 1990. |
| Zhou,
L. |
Speaker-independent
neural network pitch tracker with telephone bandwidth, speech
for computer speech recognition.
Oregon Graduate Institute, April 1991. |
| Roginski,
K. |
A
Neural Network Phonetic Classifier for Telephone Speech, Oregon
Graduate Institute, November 1991. |
|
Jain,
N. |
A
New Approach to Voice Dialing, Oregon Graduate Institute, July
1995. |
DOCTORAL
DISSERTATIONS SUPERVISED
| Scott,
B.L. |
Speech
perception: a theory and application, University of Waterloo,
August 1974. |
| Jakimik,
J. |
The
interaction of sound and knowledge during word recognition from
fluent speech, Carnegie Mellon University, June 1979. |
| Rudnicky,
A. |
The
role of language experience in language perception: An ecogical
theory. Carnegie Mellon University, June 1979. |
| Muthusamy,
Y. |
A
segmental approach to automatic language identification, Oregon
Graduate Institute, July 1993. |
RESEARCH GRANTS
| 2000
– 2005 |
Cole, R., Massaro, D., van Santen, J., Movellan,
J., “ITR: Creating the
Next Generation of Intelligent Animated Conversational Agents,”
$4,000,000, NSF. |
| 1999
- 2002 |
Cole, R., "CRCD: An Interactive Curriculum
in Human Language Technology for Undergraduate and Graduate Education
and Research," $400,000, NSF. |
| 1999
- 2001 |
Cole, R., "Advancing Human Language Technology
in Brazil and the United States Through Collaborative Research
on Portuguese Spoken Language Systems," $199,481, NSF. |
| 1998
- 2002 |
Cole, R., "CARE: Accessible Language Resources
for Research and Education," $1,200,000, NSF. |
| 1998
– 2000 |
Cole,
R., "Accelerating Research Advances in Human Language Technology
through Portable, Accessible Natural Dialogue Systems," $600,000,
ONR/DARPA. |
| 1998 |
Cole, R., “Making Spoken Language Systems Ubiquitous:
Long-Term Research Agenda,” $150,000, Intel Corporation. |
| 1997-1999 |
Cole,
R., Y. Yan, “A Phonetic Knowledge-Guided Approach to Speaker Adaption
for Large Vocabulary Continuous Speech Recognition,” $94,375,
Office of Naval Research. |
| 1997-1998 |
Cole,
R., “Large Vocabulary Continuous Speech Recognition,” $100,000,
fonix Corporation. |
| 1997-1998 |
Cole,
R., “Speech Recognition Evaluation,”$72,675, Nortel Corporation.
|
| 1997-1998 |
Fanty,
M., R. Cole, Y. Yan, “CISE Research Instrumentation: File Server
and Storage Server for Spoken Language Technologies,” $50,000,
National Science Foundation. |
| 1997-2000 |
Cole,
R., M. Macon, D. Massaro, A. Waibel, “Creating Conversational
Agents for Language Training,” $1,800,000, National Science Foundation. |
| 1997-1998 |
Cole,
R., J. de Villiers, D. Massaro, “Bringing Spoken Language Systems
to the Classroom for Learning Language Training with Hearing Impaired
People,” $50,000, National Science Foundation. |
| 1997 |
Cole
R., French, Italian, Mexican, and Caribbean Spanish Data Collection,
$199,906, AT & T. |
| 1997 |
Cole,
R., “Understanding the role of International Collaboration in
Computer Science and Engineering,” (NSF Workshop), $49,946, National
Science Foundation. |
| 1997 |
Cole,
R., Intel Equipment Grant, $200,000, (40 Pentium computers). |
| 1997 |
Cole,
R., "Proposal for the Second NSF Grantees Workshop in Interactive
Systems," $120,925, National Science Foundation. |
| 1996-1998 |
Cole,
R., G. Whitney, D. Jonassen, B. Moeller, S. Carver, "Conceptualization
of Flagship Center for Collaborative Research in Learning and
Human Language Technologies,” $50,000, National Science Foundation. |
| 1996-1997 |
Cole
R., Basic Research/Education Support, $45,913, Intel Corp. |
| 1996-1997 |
Cole,
R., S. Sutton, "Advancing Human Language Technology in Mexico
and the U.S. Through Collaborative Research on Spoken Language
Systems," $100,000, National Science Foundation. |
| 1996-1998 |
Cole,
R., D. Novick, M. Fanty, “Rapid Prototyping of Spoken Language
Systems,” $1,300,000, Office of Naval Research. |
| 1996-1998 |
Cole,
R., M. Fanty, B. Oshika, "Human Language Resources for Research
in Multilanguage Systems, Robust Recognition, and Speaker Identification,"
$746,890, National Science Foundation. |
| 1996 |
Cole,
R., "Toward Robust Speech Recognition: Improved Measures
of Confidence," $20,000, Texas Instruments. |
| 1994-1995 |
Cole,
R., "Instrumentation for Research in Spoken Language Systems,"
$40,000, National Science Foundation. |
| 1994-1996 |
Cole, R., M. Fanty, D. Novick, E. Barnard, H.
Hermansky, "Spoken Dialogue Technology for Appointment Scheduling,"
$1,400,000, U S West |
| 1994-1995 |
Cole,
R., D. Novick, M. Fanty, "Rapid Prototyping and Deployment
of Spoken Language Systems," $450,000, U.S. Office of Naval
Research/U.S. Census Bureau. |
| 1994-1995 |
Cole,
R., "Joint E.C.-U.S. Survey of the State of the Art in Human
Language Technology,” $58,000, National Science Foundation. |
| 1994-1998 |
Cole,
R., "Toward Rapid Development and Deployment of Spoken Language
Systems,” $150,646, U.S. Office of Naval Research/AASERT. |
| 1994-1995 |
Cole,
R., D. Novick, M. Fanty, E. Barnard, H. Hermansky, "Toward
Robust Spoken-Language Systems,” $829,886, NSF/ARPA. |
| 1993-1996 |
Cole,
R., B. Oshika, E. Barnard, “Linguistic Units for Language Identification,”
$697,162, Department of Defense. |
| 1993-1994 |
Cole,
R., "Digital Data Collection Platform and Software,” $41,000,
Linguistic Data Consortium. |
| 1993-1995 |
Cole, R., E. Barnard,
T. Leen, “Task-based Analysis and Stochastic Search
in Neural Networks, $325,000,
Office of Naval Research. |
| 1993-1994 |
Cole,
R., “Telephone Data Collection,” $65,325, Apple Computer Inc. |
| 1993-1996 |
Cole,
R., "Automatic Language Identification: A Distinctive Feature
Approach," $110,323, Office of Naval Research, AASERT Award. |
| 1993-1994 |
Cole,
R., D. Novick, M. Fanty, “Voice questionnaire for the Year 2000
Census,” $125,000, U.S. Bureau of the Census, (grant administered
through the U.S. Office of Naval Research). |
| 1993-1999 |
Cole,
R., D. Novick, “Graduate Traineeships for Under-Represented Minorities
for Research in Spoken-Language Interfaces,” $557,500, National
Science Foundation. |
| 1993-1994 |
Cole,
R., D. Novick, "Real Time Voice Response Questionnaire for
the Year 2000 Census,” $61,912, Digital Equipment Corporation.
|
| 1992 |
Cole,
R., S. Zahorian, L. Hirschman, “Workshop on Spoken Language Understanding,”
$35,700, National Science Foundation. |
| 1992-1995 |
Cole,
R., M. Fanty, "Spoken Letter Recognition,” $360,000/3-year,
National Science Foundation. |
| 1991-1993 |
Cole,
R., M. Fanty, T. Leen, "Neural Network Approaches to Spoken
Letter Recognition,” $230,000, Office of Naval Research. |
| 1992-1993 |
Cole,
R., M. Fanty, T. Leen, "Instrumentation for the Center for
Spoken Language Understanding,” $43,000, National Science Foundation. |
| 1991 |
Fanty,
M., R. Cole, "A Portable Interactive Environment for Computer
Speech Recognition,” $83,000, National Science Foundation. |
| 1991 |
Cole
R., Equipment donation, $108,000, Digital Equipment Corporation. |
| 1991-1992 |
Cole,
R., M. Fanty, "English Alphabet Recognition Over Telephone
Lines," $230,000, US West Advanced Technologies. |
| 1991-1992 |
Cole,
R., "Speaker-Independent Recognition of Arbitrary Sets of
Words," $100,000, US West Technologies. |
| 1990-1992 |
Cole,
R., M. Fanty, Cash donation, $40,000, and equipment donation,
$43,000, Apple Computer. |
| 1990-1992 |
Cole,
R., "Phonetic Classification of Continuous Speech,” $50,000/year,
National Science Foundation. |
| 1990 |
Cole,
R., M. Fanty, "Feasibility of Speech Recognition on a VLSI
Neurocomputer,” Matching Funds Grant, $20,000, OASIS. |
| 1985-1987 |
Cole,
R., "Phonetic Classification of Continuous Speech,"
$580,000, National Science Foundation. |
| 1982-1985 |
Cole,
R., "Knowledge Engineering and Knowledge Acquisition in Speech
Understanding Research," $480,000, National Science Foundation. |
| 1972-1975 |
Cole,
R., "What we hear during speech," $30,000, National
Research Council of Canada. |
| 1972 |
Cole,
R., "Hearing through the skin," Operating grant, $2000,
University of Waterloo Research Grant. |
| 1970-1971 |
Cole,
R., Operating grant, $4000, National Research Council of Canada. |
| 1970-1971 |
Cole,
R., Computing grant, $1300, National Research Council of Canada. |
PUBLICATIONS
|
1.
Jiyong
Ma, Jie Yan and Ron Cole, in CU Animate:Tools for Enabling Conversations
with Animated Characters, in Submitted to: ICSLP-2002, Denver,
Colorado, USA,Sept 2002., pp. 4, USA, Sep, 2002. |
|
2.
Hosom,
J.P., Cole, R.A., “Burst Detection Based on Measurements of Intensity
Discrimination.” Proceedings
of ICSLP 2000. (pp. IV-564 --
IV-567). Bejing, China
2000.
|
|
3.
Shobaki,
K., Hosom, J.P., Cole, R., “The OGI Kids’ Speech Corpus and Recognizers.”
Proceedings of ICSLP 2000. (pp.
IV-564 -- IV-567). Bejing
China 2000.
|
|
4.
Cole,
R.A., Serridge, B., Hosom, J.P., Cronk, A., and Kaiser, E., "A
Platform for Multilingual Reserach in Spoken Dialogue Systems."
Workshop on Multi-lingual Interoperability in Speech Technology
(MIST), The Netherlands, September 1999. |
|
5.
Massaro,
D. W., Cohen, M. M., Daniel, S., & Cole, R. A. (1999). Developing
and evaluating conversational agents. In P. A. Hancock (Ed.) Human
Factors and Ergonomics: Perceptual and Cognitive Principles. (Handbook
of Perception & Cognition, 2nd Edition). (pp. 173-194). San
Diego, CA: Academic Press. |
|
6.
Cole,
R. A., "Tools for research and education in speech science,"
In Proceedings of the International Conference of Phonetic Sciences,
San Francisco, CA, Aug 1999. |
|
7.
T.
Carmell, J.P. Hosom, and R. Cole. A computer-based course in spectrogram
reading. In Proceedings of ESCA/SOCRATES Workshop on Method and
Tool Innovations for Speech Science Education, London, UK, Apr
1999. |
|
8.
Ron
Cole, Dominic W. Massaro, Jacques de Villiers, Brian Rundle, Khaldoun
Shobaki, Johan Wouters, Michael Cohen, Jonas Beskow, Patrick Stone,
Pamela Connors, Alice Tarachow, and Daniel Solcher. New tools
for interactive speech and language training: Using animated conversational
agents in the classrooms of profoundly deaf children. In Proceedings
of ESCA/SOCRATES Workshop on Method and Tool Innovations for Speech
Science Education, London, UK, Apr 1999. |
|
9.
D.
Massaro, Cohen, M. M., Beskow, J., Daniel, S., Cole, R., "Developing
and Evaluating Conversational Agents," Workshop on Embodied
Conversation Characters (WECC), Lake Tohoe,
1998, http://mambo.ucsc.edu:80/psl/wecc3.rtf. |
|
10.
Cosi,
P., J.P. Hosom, J. Schalkwyk, S. Sutton, and R. A. Cole, "Connected
Digit Recognition Experiments with the OGI Toolkit's Neural Network
and HMM-Based recognizers," In Proceedings,
4th IEEE Workshop on Interactive Voice Tehcnology for
Telecommunications Applications (IVTTA-ETWR98), Turin, Italy,
(September 1998). |
|
11.
Cole,
R.A., M. Noel, and V. Noel. The CSLU Speaker Recognition Corpus.
In Proceedings of ICSLP, Sydney, Australia, 1998. |
|
12.
Hosom,
J. P., R. A. Cole, P. Cosi, "Evaluation and Integration of
Neural-Network Training Techniques for Continuous Digit Recognition,"
In Proceedings of the International
Conference on Spoken Language Processing (ICSLP), Sydney,
Australia, (November 1998). |
|
13.
Serridge,
B., R. A. Cole, A. Barbosa, N. Munive, A. Vargas, "Creating
a Mexican Spanish Version of the CSLU Toolkit," In Proceedings of the International Conference on Spoken Language Processing
(ICSLP), Sydney, Australia, (November 1998). |
|
14.
Sutton,
S., R. A. Cole, J. deVilliers, J. Schalkwyk, P. Vermeulen, M.
Macon, Y. Yan, E. Kaiser, B. Rundle, K. Shobaki, P. Hosom, A.
Kain, J. Wouters, D. Massaro, M. Cohen, "Universal Speech
Tools: The CSLU Toolkit,"
In Proceedings of the International
Conference on Spoken Language Processing (ICSLP), Sydney,
Australia, (November 1998). |
|
15.
Cole,
R., T. Carmell, P. Connors, M. Macon, J. Wouters, J. de Villiers,
A. Tarachow, D. Massaro, M.Cohen, J. Beskow, J. Yang, U. Meier,
A. Waibel, P. Stone, G. Fortier,
A. Davis, C. Soland, “Intelligent Animate Agents for Interactive
Language Training,” presented at STILL ’98, Stockholm, Sweden,
May 1998. |
|
16.
Yan,
Y., X. Wu, J. Schalkwyk, R. A. Cole, "Development of CLSU
LVCSR: The 1997 DARPA HUB4 Evaluation System," In DARPA Broadcast News Transcription and Understanding Workshop, (1998). |
|
17.
Cole,
R., S. Sutton, Y. Yan, P. Vermeulen, M. Fanty, “Accessible technology
for interactive systems: A new approach to spoken language research,”
In Proceedings of the International Conference
on Acoustics, Speech and Signal Processing, Seattle, WA.,
1998. |
|
18.
Cole,
R., D. G. Novick, M. Fanty, P. Vermeulen and S. Sutton, "Experiments
with a Spoken Dialogue System for Taking the U.S. Census,"
Special Edition: Speech Communications, (1998, in Press). |
|
19.
Cole,
R., & V. Zue, 1997. Spoken
Language Input. In Cole,
R.A., J. Mariani, H. Uszkoriet, A. Zaenen, & V. Zue (eds.),
Survey of the State of the Art in Human Language Technology, (1, pp. 1-49), Cambridge University Press. |
|
20.
Cole,
R., 1997. Spoken Output
Technologies. In Cole,
R.A., J. Mariani, H. Uszkoriet, A. Zaenen, & V. Zue (eds.),
Survey of the State of the Art in Human Language Technology, (5, pp. 165-214), Cambridge University Press. |
|
21.
Cole,
R., 1997. Mathematical
Methods. In Cole, R.A., J. Mariani, H. Uszkoriet,
A. Zaenen, & V. Zue (eds.), Survey
of the State of the Art in Human Language
Technology, (11, pp. 337-369), Cambridge University Press. |
|
22.
Cole,
R., 1997. Language Resources.
In Cole, R.A., J. Mariani, H. Uszkoriet, A. Zaenen, & V. Zue
(eds.), Survey of the State of the Art in Human
Language Technology,
(12, pp. 381-403), Cambridge University Press. |
|
23.
Computer
Science and Telecommunications Board, National Research Council,
“More Than Screen Deep: Toward
Every-Citizen Interfaces to the Nation’s Information Infrastructure,”
National Academy Press, Washington, D.C., 1997. |
|
24.
Sutton, S., E. Kaiser,
A. Cronk, and R. Cole, "Bringing spoken language systems
to the classroom", EUROSPEECH'97,
Rhodes, Greece, (1997). |
|
25.
Tu X., Y. Yan, R. Cole,
"Matching training and testing criteria in hybrid speech
recognition systems", EUROSPEECH'97,
Rhodes, Greece (1997). |
|
26.
Cole R., S. Sutton, M.
Fanty, E. Kaiser, J. Schalkwyk, J. de Villiers, A. Cronk, Colton,
"Cyberspeech: Password to Cyberspace," DAIC Workshop, Seattle, WA, (1997). |
|
27.
Yan, Y., M. Fanty, R.
Cole, “Speech recognition using neural networks with forward-backward
probability generated targets,” Proceedings
of the International Conference on Acoustics
Speech and Signal Processing, Munich, (1997). |
|
28.
Hosom, J.P., R. Cole,
“A diphone-based digit recognition system using neural networks,”
Proceedings of the International Conference
on Acoustics Speech and Signal Processing, Munich, (1997). |
|
29.
Cole, R. A., D. G. Novick,
M. Fanty, P. Vermeulen, S. Sutton, "Experiments with a spoken
dialogue system for taking the U.S. census," Free Speech Journal, (1996). |
|
30.
Yan Y., E. Barnard, R.
Cole, “Development of an approach to automatic language identification
based on phone recognition,” Computer,
Speech & Language, Vol. 10(1), pp. 37-54, January, (1996). |
|
31.
Sutton, S., D. Novick,
R. Cole, M. Fanty, “Building 10,000 spoken-dialogue systems,”
Proceedings of the International Conference
on Spoken Language Processing, Philadelphia, PA, (1996). |
|
32.
Hu, Z., J. Schalkwyk,
E. Barnard, R. Cole, “Speech recognition using syllable-like units,”
Proceedings of the International Conference
on Spoken Language Processing, Philadelphia, PA, (1996). |
|
33.
Cole, R., Y. Yan, T.
Bailey, “The influence of bigram constraints on word recognition
by humans: Implications for computer speech recognition,” Proceedings of International Conference on Spoken Language Processing, Philadelphia, PA, (1996). |
|
34.
Cole, R., Y. Yan, B.
Mak, M. Fanty, T. Bailey, “The contribution of consonants versus
vowels in word recognition of fluent speech,” Proceedings
of the International Conference on Acoustics,
Speech and Signal Processing, Atlanta, Georgia, (1996). |
|
35.
Jain, N., R. Cole, E.
Barnard, “Creating speaker-specific phonetic templates with a
speaker-independent phonetic recognizer: Implications for voice
dialing,” Proceedings of the International Conference on Acoustics, Speech
and Signal Processing, Atlanta, Georgia, (1996). |
|
36.
Colton, L. D., R. Cole,
D. G. Novick, S. Sutton, “A laboratory course for designing and
testing spoken dialogue systems,” Proceedings
of the International Conference on Acoustics, Speech and Signal Processing, Atlanta, Georgia, (1996). |
|
37.
Hu, Z., E. Barnard, R.
Cole, "Transition-based feature extraction within frame-based
recognition," Proceedings
of the Fourth European Conference on Speech Communication and
Technology, Madrid, Spain, (1995). |
|
38.
Colton, L. D., M. Fanty,
R. Cole, "Second pass verification improves N-Way forced
choice recognition and out-of-vocabulary rejection," Proceedings of the Fourth European Conference on Speech Communication
and Technology, Madrid, Spain, (1995). |
|
39.
Noel, M., R. Cole, T.
Durham, T. L. Lander, "New telephone speech corpora at CSLU,"
Proceedings of the Fourth European Conference
on Speech Communication and Technology, Madrid, Spain, (1995). |
|
40.
Lander, T., R. Cole,
B. Oshika, M. Noel, "The OGI 22 language telephone speech
corpus," Proceedings of the Fourth European Conference
on Speech Communication and Technology, Madrid, Spain, (1995). |
|
41.
Lander, T., B. T. Oshika,
R. Cole, M. Fanty, "Multi-language speech database: creation
and phonetic labeling agreement," Proceedings
of the International Congress of Phonetic Science, Stockholm, Sweden, (1995). |
|
42.
Fanty, M., E. Barnard,
R. Cole, "Alphabet recognition," by invitation to the
Handbook of Neural Computation, (1995). |
|
43.
Barnard, E., R. Cole,
M. Fanty, P. Vermeulen, "Real-world speech recognition with
neural networks," (invited paper),
Proceedings of the International Symposium on Aerospace/Defense
Sensing & Control and Dual-Use Photonics,
International Society for Optical Engineering, Technical Conference
no. 2492, Orlando, FL, (1995). |
|
44.
Sutton, S., B. Hansen,
T. Lander, D. G. Novick, R. Cole, "Evaluating the effectiveness
of dialogue for an automated spoken questionnaire," AAAI 1995 Spring Symposium Series, Stanford University, (1995). |
|
45.
Cole, R., L. Hirschman
et al., "The challenge of spoken language systems: Research
directions for the nineties," IEEE
Transactions on Speech and Audio Processing, 1, pp. 1-21,
(1995). |
|
46.
Cole, R., B. T. Oshika,
M. Noel, T. Lander, M. Fanty, "Labeler agreement in phonetic
labeling of continuous speech," Proceedings of the 1994 International
Conference on Spoken Language Processing, Yokohama, Japan, (1994). |
|
47.
Cole, R., M. Fanty, M.
Noel, T. Lander, "Telephone speech corpus development at
CSLU," Proceedings of the 1994 International Conference
on Spoken Language Processing, Yokohama, (1994). |
|
48.
Cole, R., D. G. Novick,
M. Fanty, P. Vermeulen, S. Sutton, D. Burnett and J. Schalkwyk,
"A prototype voice-response questionnaire for the U.S. census,"
Proceedings of the 1994 International Conference on Spoken Language
Processing, Yokohama, Japan, (1994). |
|
49.
Schalkwyk, J., E. Barnard,
R. Cole, J. R. Sachs, "Detecting an imposter in telephone
speech," Workshop on
Automatic Speaker Recognition, Identification and Verification,
Martigny, Switzerland, (1994). |
|
50.
Cole, R., M. Noel, D.
C. Burnett, M. Fanty, T. Lander, B. Oshika, S. Sutton, "Corpus
development activities at the Center for Spoken Language Understanding,"
Proceedings of the ARPA Workshop on Human Language Technology,
April 7-11, (1994). |
|
51.
Muthusamy, Y.K., N. Jain,
R. Cole, "Perceptual benchmarks for automatic language identification,"
Proceedings of the 1994
International Conference on Acoustics, Speech and Signal Processing, (1994). |
|
52.
Berkling, K.M., T. Arai,
E. Barnard, R. Cole, "Analysis of phoneme-based features
for language identification," Proceedings
of the 1994 International Conference on Acoustics, Speech
and Signal Processing, (1994). |
|
53.
Cole, R., D. G. Novick,
D. Burnett, B. Hansen. S. Sutton, M. Fanty, “Towards automatic
collection of the U.S. census,” Proceedings
of the 1994 International Conference on Acoustics, Speech
and Signal Processing, (1994). |
|
54.
Cole, R., D. G. Novick,
M. Fanty, S. Sutton, B. Hansen, D. Burnett, “Rapid prototyping
of spoken language systems: The year 2000 census project,” Proceedings of the International Symposium on Spoken Dialogue, Tokyo, Japan, (1993). |
|
55.
Muthusamy Y., K. Berkling,
T. Arai, R. Cole, E. Barnard, “A comparison of approaches to automatic
language identification using telephone speech, “EUROSPEECH '93, Berlin, Germany, (1993). |
|
56.
Schmid, P., R. Cole,
M. Fanty, H. Bourlard, M. Haessen, “Real-time, neural network-based,
French alphabet recognition with telephone speech,” EUROSPEECH '93, Berlin, Germany, (1993). |
|
57.
Cole, R., Y. K. Muthusamy,
“Perceptual studies on vowels excised from continuous speech,”
Proceedings of the International Conference
on Spoken Language Processing, Banff, Alberta, (1992). |
|
58.
Muthusamy, Y. K., R.
Cole, “Automatic segmentation and identification of ten languages
using telephone speech,” Proceedings
of the International Conference on Spoken Language Processing, Banff, Alberta, (1992). |
|
59.
Fanty, M., J. Pochmara,
R. Cole, “An interactive environment for speech recognition research,”
Proceedings of the International
Conference on Spoken Language Processing, Banff, Alberta,
(1992). |
|
60.
Muthusamy, Y. K., R.
Cole, B. T. Oshika, “The OGI multi-language telephone speech corpus,”
Proceedings of the International Conference
on Spoken Language Processing, Banff, Alberta, (1992). |
|
61.
Cole, R., M. Fanty, K.
Roginski, “A telephone speech database of spelled and spoken names,”
Proceedings of the International Conference
on Spoken Language Processing, Banff, Alberta, (1992). |
|
62.
Muthusamy, Y. K., R.
Cole, B. T. Oshika, “Automatic language identification,” Voice Systems Worldwide Speech Tech '92 Conference, New York, NY,
(1992). |
|
63.
Cole, R., M. Fanty, K.
Roginski, “Recognizing spelled names with telephone speech,” Voice Systems Worldwide Speech Tech '92 Conference,
New York, NY, (1992). |
|
64.
Creekmore, J., M. Fanty,
R. Cole, “A comparative study of five spectral representations
for speaker-independent phonetic recognition,” presented at 25th Annual Asilomar Conference
on Signals, Systems, and Computers, Pacific Grove, CA, (1991). |
|
65.
Fanty, M., R. Cole, M.
Slaney, “A comparison of DFT, PLP and Cochleagram for alphabet
recognition,” 25th Annual Asilomar Conference on Signals,
Systems, and Computers,
Pacific Grove, CA, (1991). |
|
66.
Muthusamy, Y.K., R. Cole,
“A segment-based automatic language identification system,” in
J.E. Moody, S. J. Hanson, R.P. Lippmann, editors, Advances in Neural Information Processing Systems 4, San Mateo, CA, (1992). Morgan Kaufmann Publish. |
|
67.
Cole, R., K. Roginski,
M. Fanty, “English alphabet recognition with telephone speech,”
in J.E. Moody, S. J. Hanson, R.P. Lippmann, editors, Advances in Neural Information Processing Systems 4, San Mateo, CA, (1992). Morgan Kaufmann Publishers. |
|
68.
Cole, R., M. Fanty, K.
Roginski, “Speaker-independent name retrieval from spellings using
a database of 50,000 names: Application to telephone speech,”
'91, Genova, Italy, Sep. (1991). |
|
69.
Janssen, R.D.T., M. Fanty,
R. Cole, “Speaker-independent phonetic classification in continuous
english letters,” Proceedings
of the International Joint Conference on Neural Networks,
Seattle, WA, (1991). |
|
70.
Muthusamy, Y. K., R.
Cole, M. Gopalakrishnan, “A segment-based approach to automatic
language identification,” Proceedings
of the 1991 International Conference on Acoustics, Speech
and Signal Processing, May 14-17, Toronto, (1991). |
|
71.
Cole, R., M. Fanty, M.
Gopalakrishnan, R. D. T. Janssen, “Speaker-independent name retrieval
from spellings using a database of 50,000 names,” Proceedings of the 1991 International Conference on Acoustics, Speech and Signal Processing, May 14-17,
Toronto, (1991). |
|
72.
Fanty, M., R. Cole, “Spoken
letter recognition,” in R. Lippmann, J. Moody, D. Touretzky (Ed.),
Advances in Neural Information Processing
Systems, San Mateo, CA: Morgan Kaufmann Publishers, (1991). |
|
73.
Fanty, M., R. Cole, “Speaker-independent
english alphabet recognition: Experiments with the E-Set,” Proceedings
of the 1990 International Conference on Spoken Language Processing,
Kobe, Japan, (1990). |
|
74.
Atlas, L., R. Cole, Y.
Muthusamy, A. Lippman, G. Connor, D. Park, M. El-Sharkawi, R.
Marks II, “A performance comparison of trained multi-layer perceptrons
and trained classification trees,” Proceedings
of the IEEE (Special Issue on Neural Networks) (1990). |
|
75.
Cole, R., M. Fanty, "Spoken
letter recognition,” Proceedings
of the DARPA Workshop on Speech
and Natural Language Processing, Hidden Valley, PA, (1990). |
|
76.
Cole, R., M. Fanty, Y.
K. Muthusamy, M. Gopalakrishnan, “Speaker-independent recognition
of spoken english letters,” Proceedings
of the International Joint Conference on Neural Networks '90, San Diego, CA, (1990). |
|
77.
Muthusamy, Y. K., R.
Cole, M. Slaney, “speaker-independent vowel recognition: spectrograms
versus cochleagrams,” Proceedings
of the IEEE 1990 International Conference on Acoustics,
Speech and Signal Processing, Albuquerque, New Mexico, (1990). |
|
78.
Atlas, L., W. Kooiman,
P. Loughlin, R. Cole, “New nonstationary techniques for the analysis
and display of speech transients,” Proceedings
of the IEEE 1990 International Conference on Acoustics, Speech and Signal Processing, Albuquerque, New Mexico,
(1990). |
|
79.
Cole, R., Y. K. Muthusamy,
L. Atlas, “Speaker-independent vowel recognition: Comparison of
backpropagation and trained classification trees,” Proceedings of the IEEE Hawaii International Conference on System Sciences No. 23, Kona-Kailua,
Hawaii, (1990). |
|
80.
Barnard, E., R. Cole,
M. P. Vea, F. Alleva, “Pitch detection with a neural-net classifier,”
IEEE Transactions on Acoustics, Speech &
Signal Processing, (1991). |
|
81.
Atlas, L., R. Cole, Y.
K. Muthusamy, J. Taylor, E. Barnard, “Performance comparisons
between backpropagation networks and classification trees on three
real-world applications,” Proceedings
of the Conference on Neural Information Processing Systems,
Denver, CO, (1989). |
|
82.
Atlas, L. E., J. Connor,
D. Park, M. El-Sharkawi, R. Marks II, A. Lippman, R. Cole, Y.
K. Muthusamy, “A performance comparison of trained multi-layer
perceptrons and trained classification trees,” Proceedings
of the IEEE Systems, Man and Cybernetics Society Conference,
Cambridge, MA, (1989). |
|
83.
Cole, R., J. W. T. Inouye,
Y. K. Muthusamy, M. Gopalakrishnan, “Language identification with
neural networks: a feasibility study,” Proceedings
of the IEEE Pacific Rim Conference on Communications, Computers
and Signal Processing, Victoria B.C., (1989). |
|
84.
Cole, R., L. Hou, “Segmentation
and broad classification of continuous speech,” Proceedings of the IEEE International Conference
on Acoustics, Speech, and Signal Processing, New York, (1988). |
|
85.
Cole, R., “Phonetic classification
in new generation speech recognition systems,” Speech Tech 86, pp. 43-46, New York, (1986). |
|
86.
Cole, R., M. P. Phillips,
R. A. Brennan, B. Chigier, “The CMU phonetic classification system,”
Proceedings of the IEEE
International Conference on Acoustics,
Speech, and Signal Processing,
Tokyo, (1986). |
|
87.
Jakimik, J. A., R. Cole,
A. I. Rudnicky, “Sound and spelling in spoken word recognition,”
Journal of Verbal Learning and Verbal Behavior,
pp. 165-178, (1985). |
|
88.
Cole, R., R. M. Stern,
M. J. Lasry, “Performing fine phonetic distinctions: templates
vs. features,” in Invariance
and Variability of Speech Processes, ed. J. Perkell and D.
Klatt, Lawrence Erlbaum, New York, (1984). |
|
89.
Cole, R., R. M. Stern,
M. S. Phillips, S. M. Brill, A. P. Pilant, P. Specker, “Feature-based
speaker-independent recognition of isolated english letters,”
Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal
Processing, pp. 731-734, (1983). |
|
90.
Cole, R., A. I. Rudnicky,
“What's new in speech perception: The research and ideas of William
Chandler Bagley 1874-1946,” Psychological
Review, (1983). |
|
91.
Bradshaw, G. L., R. Cole,
Z. D. Li, “Comparison of learning techniques in speech recognition,”
Proceedings of the IEEE International Conference
on Acoustics, Speech, and Signal Processing, pp. 554-557, (1982). |
|
92.
Haber, R. N., R. Cole,
“Evidence for direct visual access to letter identities,” Acta Psychologica, 46, pp. 181-192, (1980). |
|
93.
Cole, R., “Perception
of fluent speech by children and adults,” in Annals of the New York Academy
of Sciences, 379, pp. 92-102, (1981). |
|
94.
Cole, R., C. A. Perfetti,
“Listening for mispronunciations in a children's story: The use of context by children and adults,”
Journal of Verbal Learning
and Verbal Behavior, 19, pp. 297-315, (1980). |
|
95.
Cole, R., R. N. Haber,
“Reaction time to letter name or letter case,” Acta Psychologica, 44, pp. 281-285, (1980). |
|
96.
Cole, R., V. W. Zue,
“Speech as eyes see it,” in Attention
and Performance, ed. S. Nickerson, Lawrence Erlbaum Associates,
Hillsdale, N.J., (1980). |
|
97.
Cole, R., J. Jakimik,
W. E. Cooper, “Segmenting speech into words,” Journal of the Acoustical
Society of America, 67, pp. 1323-1332, (1980). |
|
98.
Cole, R., J. Jakimik,
“How are syllables used to recognize words?,” Journal of the Acoustical
Society of America, 67, pp. 965-970, (1980). |
|
99.
Winitz, H., D. Ingram,
R. Cole, J. Folkins, “Articulation,” in Annual
Abstracts of Speech, Voice,
Language and Hearing, ed. I. Goldstein, Little, Brown &
Co., Boston, (1979). |
|
100.
Zue, V., R. Cole, “Experiments
on spectrogram reading,” Proceedings
of the IEEE Conference
on Acoustics, Speech and Signal Processing, pp. 116-119, Washington,
D.C., (1979). |
|
101.
Cole, R., A. I. Rudnicky,
V. Zue, D. R. Reddy, “Speech as patterns on paper,” in Perception and Production of Fluent Speech,
ed. R. A. Cole, Lawrence Erlbaum Associates, Hillsdale, NJ, (1980). Cole, R. A., “Navigating the slippery stream
of speech,” Psychology Today,
(1979). |
|
102.
Cole, R., J. Jakimik,
“A model of speech perception,” in Perception
and Production of Fluent
Speech, ed. R. A. Cole, Lawrence Erlbaum Associates, Hillsdale,
NJ, (1980). |
|
103.
Rudnicky, A. I., R. Cole,
“The effect of subsequent context on syllable perception,” Journal of Experimental Psychology: Human
Perception and Performance, 4, pp. 638-647, (1978). |
|
104.
Cole, R., J. Jakimik,
W. E. Cooper, “Perceptibility of phonetic features in fluent speech,”
Journal of the Acoustical Society of America,
64, pp. 44.56, (1978). |
|
105.
Rudnicky, A. I., R. Cole,
“Adaptation produced by connected speech,” Journal of Experimental Psychology:
Human Perception and Performance, 3, pp. 51-61, (1977). |
|
106.
Cole, R., J. Jakimik,
“Understanding speech: How words are heard,” in Information Processing Strategies, ed. G. Underwood, Academic Press,
London, (1978). |
|
107.
Cooper, W. E., D. Billings,
R. Cole, “Articulatory effects on speech perception: A second
report,” Journal of Phonetics,
4, pp. 219-232, (1976). |
|
108.
Cole, R., W. E. Cooper,
“Properties of frication analyzers for /j/,” Journal of the Acoustical Society
of America, 62, pp. 177-182, (1977). |
|
109.
Cole, R., N. Cummings,
“Bilateral alpha rhythm in children during listening and looking,”
in Language development and neurological theory,
eds. S. Segolowitz & F. Gruber, Academic Press, New York,
(1977). |
|
110.
Cole, R., “Invariant
features and feature detectors,” in Language
development and neurological theory,
ed. S. Segolowitz & F. Gruber, Academic Press, New York, (1977). |
|
111.
Cooper, W. E., R. R.
Ebert, R. Cole, “Perceptual analysis of stop consonants and glides,”
Journal of Experimental Psychology: Human
Perception and Performance, 2, pp. 92-104, (1976). |
|
112.
Cooper, W. E., R. R.
Ebert, R. Cole, “Speech perception and production of the consonant
cluster /st/,” Journal of
Experimental Psychology: Human Perception and Performance,
2, pp. 105-1154, (1976). |
|
113.
Cole, R., W. E. Cooper,
“Perception of voicing in english affricates and fricatives,”
Journal of the Acoustical Society of America,
58, pp. 1280-1287, (1975). |
|
114.
Cole, R., W. E. Cooper,
J. Singer, F. Allard, “Selective adaptation of english consonants
using real speech,” Perception
and Psychophysics, 18, pp. 227-244, (1975). |
|
115.
Cole, R., M. Young, “Effect
of subvocalization on memory for speech sounds,” Journal of Experimental Psychology:
Human Learning and Memory, 1, pp. 772-779, (1975). |
|
116.
Cole, R., B. Scott, “Toward
a theory of speech perception,” Psychological
Review, 81, pp. 348-374, (1974). |
|
117.
Cole, R., B. Scott, “The
phantom in the phoneme: Invariant cues for stop consonants,” Perception and Psychophysics, 15, pp.
101-107, (1974). |
|
118.
Cole, R., M. Coltheart,
F. Allard, “Memory of a speaker's voice: Reaction time to same
and different-voiced letters,” Quarterly
Journal of Experimental Psychology, 24, pp. 1-7, (1974). |
|
119.
Sales, B. D., R. Cole,
R. N. Haber, “Mechanisms of aural encoding: VIII. Phonetic interference and contest-sensitive coding in short-term
memory,” Memory and Cognition,
2, pp. 596-600, (1974). |
|
120.
Cole, R., B. D. Sales,
R. N. Haber, “Mechanisms of aural encoding: VII. Differential decay of consonants and vowels in a Petersen and
Petersen STM task,” Memory
and Cognition, 2, pp. 211-214, (1974). |
|
121.
Cole, R., B. Scott, “Perception
of temporal order in speech.
The role of vowel transitions,” Canadian
Journal of Psychology, 27, pp. 441-449, (1973). |
|
122.
Cole, R., “Different
memory functions for consonants and vowels,” Cognitive Psychology, 4, pp. 39-54, (1973). |
|
123.
Cole, R., “Listening
for mispronunciations: A measure of what we hear during speech,”
Perception and Psychophysics, 13, pp.
153-156, (1973). |
|
124.
Cole, R., “Perceiving
syllables and remembering phonemes,” Journal
of Speech and Hearing Research,
16, pp. 37-47, (1973). |
|
125.
Cole, R., R. Haber, B.
Sales, “Mechanisms of aural encoding: VI.
Consonants and vowels are remembered as subsets of distinctive
features,” Perception and
Psychophysics, 13, pp. 87-92, (1973). |
|
126.
Cole, R., B. Scott, “Distinctive
feature control of decision time: Same-different judgments of
simultaneously heard phonemes,” Perception
and Psychophysics, 12, pp. 91-94, (1972). |
|
127.
Sales, B. D., R. N. Haber,
R. Cole, “Mechanisms of aural encoding: V. Environmental effects of consonants on vowel encoding,” Perception and Psychophysics, 6, pp.
361-365, (1969). |
|
128.
Harley, W. F., Jr., C.
C. Wilson, R. Cole, “The influence of perceptual organizing responses
on recall,” Psychonomic
Science, 11, pp. 135-136, (1968). |
|
129.
Cole, R., B. D. Sales,
R. N. Haber, “Mechanisms of encoding the speech sound,” Proceedings of the 76th Annual APA Convention, (1968). |
|
130.
Cole, R., B. D. Sales,
R. N. Haber, “Mechanisms of aural encoding: II. The role of distinctive features in articulation and rehearsal,”
Perception and Psychophysics,
6, pp. 343-348, (1969). |
|
131.
Sales, B. D., R. N. Haber,
R. Cole, “Mechanisms of aural encoding: III. Distinctive features for vowels,” Perception and Psychophysics, 4, pp. 321-327, (1968). |
|
132.
Cole, R., R. N. Haber,
B. D. Sales, “Mechanisms of aural encoding: I. Distinctive features for consonants,” Perception and Psychophysics, 3, pp. 281-327, (1968). |
BOOKS
| Cole,
R.A. (Ed.), "Perception
and Production of Fluent Speech,” Lawrence Erlbaum Associates,
Hillsdale, NJ, (1980). |
| Cole,
R. A., J. Mariani, H. Uszkoriet, A. Zaenen and V. Zue (Ed.), "Survey of the State of the Art
in Human Language Technology," Cambridge
University Press, Cambridge, MA. |
FILMS
I
have produced, along with Alex Rudnicky and Raj Reddy, a professional
16mm sound film entitled "Speech as Eyes See It." The movie shows the performance of an expert
spectrogram reader, Dr. Victor Zue, who is able to determine the phonetic
content of an unknown utterance from a speech spectrogram. The movie shows the problems involved in spectrogram
reading and the strategies that are used to overcome these problems.
Dr. Zue's achievement has important implications for theories of speech
perception, for the use of spectrographic feedback during speech therapy,
and for machine recognition of speech.
PROFESSIONAL
SERVICES
Invited
by the National Science Foundation to edit the international survey
"State of the Art in Speech and Natural
Language Processing” sponsored
jointly by the Directorate General XIII of the European Commission and
the National Science Foundation, January 1994.
Invited
to participate in Forum of Federal Information and Communications R&R
to provide critical feedback on Strategic Implementation Plan, "America in the Age of Information",
July 6-7, 1995.
Founder
and Editor-in-Chief, Free Speech
Journal. (http://www.cse.ogi.edu/CSLU/fsj/html/home.html)
Workshop
Organizer, NSF Workshop on Spoken
Language Understanding, Washington DC, 1992.
Workshop
Organizer, NSF Interactive Systems
Grantees Workshop, Stevenson,
WA, August 1997.
Workshop
Organizer, NSF Language Resources
Workshop, Stevenson, WA, August 1997.
Workshop
Organizer, NSF Workshop on International
Collaboration in Computer Science, Stevenson, WA, October 1997.
Workshop
Organizer, NSF Workshop on Western
Hemisphere Collaboration in Computer Science, Orlando, FL, February,
1999.
Workshop
Organizer, NSF Workshop on Western
Hemisphere Collaboration in Computer Science, Manzanillo, Colima, Mexico, August 1999.
Workshop
Organizer, Workshop on US-Argentina and US-Chile Collaborative Research
on Computer Science and Engineering, Buenos Aires, Argentina and
Santiago, Chile. Sponsored by
the National Science Foundation. (40
attendees per workshop), (4/00).
INVITED TALKS:
Between
5 and 10 annually at various universities, industry forums, and professional
societies.
EDITORIAL BOARDS:
Associate
Editor, Journal of Experimental
Psychology: Human Perception and Performance, 1976-1978.
Assistant
Editor, “International Journal
of Speech Technology, (Current).
Associate
Editor, Speech Communication,
(Current).
Associate
Editor, Computer Speech & Language, (Current).
|