Jean-Marc Valin

Jean-Marc.Valin@USherbrooke.ca

CV

My Blog

Publications

Thesis and Dissertation

  1. J.-M. Valin, Auditory System For a Mobile Robot. PhD Thesis, 102 pp., 2005.
  2. J.-M. Valin, Extension spectrale d'un signal de parole de la bande telephonique a la bande AM. Masters dissertation, 65 pp., 2001.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Peer-reviewed journals

  1. J.-M. Valin, D. V. Smith, C. Montgomery, T. B. Terriberry, An Iterative Linearised Solution to the Sinusoidal Parameter Estimation Problem, To appear in Computers and Electrical Engineering (Elsevier), 2008
  2. J.-M. Valin, S. Yamamoto, J. Rouat, F. Michaud, K. Nakadai, H. G. Okuno, Robust Recognition of Simultaneous Speech By a Mobile Robot, IEEE Transactions on Robotics, Vol. 23, No. 4, pp. 742-752, 2007.
  3. J.-M. Valin, I. B. Collings, Interference-Normalised Least Mean Square Algorithm, IEEE Signal Processing Letters, Vol. 14, No 12, pp. 988-991, 2007.
  4. J.-M. Valin, On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk. IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, No. 3, pp. 1030-1034, 2007.
  5. J.-M. Valin, F. Michaud, J. Rouat, Robust Localization and Tracking of Simultaneous Moving Sound Sources Using Beamforming and Particle Filtering. Robotics and Autonomous Systems Journal (Elsevier), Vol. 55, No. 3, pp. 216-228, 2007.
  6. F. Michaud, C. Cote, D. Letourneau, Y. Brosseau, J.-M. Valin, E. Beaudry, C. Raievsky, A. Ponchon, P. Moisan, P. Lepage, Y. Morin, F. Gagnon, P. Giguere, M.-A. Roux, S. Caron, P. Frenette, F. Kabanza, Spartacus attending the 2005 AAAI conference. Autonomous Robots (Springer), Vol. 22, No. 4, pp. 369-383, 2007.
  7. S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, K. Komatani, T. Ogata, H. G. Okuno, Simultaneous Speech Recognition based on Automatic Missing-Feature Mask Generation integrated with Sound Source Separation (in Japanese). Journal of Robotic Society of Japan, Vol. 25, No. 1, 2007.
  8. S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, R. Takeda, K. Komatani, T. Ogata, H. G. Okuno, Improving Location-Based Speech Recognition of Simultaneous Speech Signals by Parameter Optimization with Genetic Algorithm (in Japanese). Human Interface, Vol.8, No.2, pp. 203-212, 2006.
  9. D. Létourneau, F. Michaud, J.-M. Valin, Autonomous Mobile Robot That Can Read. EURASIP Journal on Applied Signal Processing, Special Issue on Advances in Intelligent Vision Systems: Methods and Applications, pp. 2650-2662, 2004.

Peer-reviewed conferences and workshops

    2008

  1. F. Sabrina, J.-M. Valin, Adaptive Rate Control for Aggregated VoIP Traffic, Accepted for Globecom 2008.
  2. J.-M. Valin, Perceptually-Motivated Nonlinear Channel Decorrelation For Stereo Acoustic Echo Cancellation, Proc. Joint Workshop on Hands­free Speech Communication and Microphone Arrays (HSCMA), 2008.
  3. S. Brière, J.-M. Valin, F. Michaud, Dominic Létourneau, Embedded Auditory System for Small Mobile Robots, Proc. International Conference on Robotics and Automation (ICRA), 2008.
  4. H. G. Okuno, S. Yamamoto, K. Nakadai, J.-M. Valin, K. Komatani, T. Ogata, "A Portable Robot Audition Software System for Multiple Simultaneous Speech Signals", Proc. Acoustics'08.
  5. 2007

  6. J.-M. Valin, D. V. Smith, C. Montgomery, T. B. Terriberry, Low-Complexity Iterative Sinusoidal Parameter Estimation, Proc. International Conference on Signal Processing and Communication Systems (ICSPCS), pp. 276-283, 2007.
  7. J.-M. Valin, I.B. Collings, A New Robust Frequency Domain Echo Canceller With Closed-Loop Learning Rate Adaptation, Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
  8. S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, K. Komatani, T. Ogata, H. G. Okuno, Design and Implementation of a Robot Audition System for Automatic Speech Recognition of Simultaneous Speech, Proc. ASRU, 2007.
  9. 2006

  10. J.-M. Valin, F. Michaud, J. Rouat, Robust 3D Localization and Tracking of Sound Sources Using Beamforming and Particle Filtering. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 841-844, 2006.
  11. J.-M. Valin, C. Montgomery, Improved Noise Weighting in CELP Coding of Speech - Applying the Vorbis Psychoacoustic Model To Speex. Proc. of the 120th AES Convention, 2006.
  12. J.-M. Valin, Channel Decorrelation For Stereo Acoustic Echo Cancellation In High-Quality Audio Communication. Proc. Workshop on the Internet, Telecommunications and Signal Processing (WITSP), 2006.
  13. S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, K. Komatani, T. Ogata, H. G. Okuno, Real-Time Robot Audition System That Recognizes Simultaneous Speech in the Real World. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2006.
  14. S. Yamamoto, R. Takeda, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, K. Komatani, T. Ogata, H. G. Okuno, Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition. Proc. 9th Biennial Pacific Rim International Conference on Artificial Intelligence (PRICAI), pp. 484-494, 2006.
  15. S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, R. Takeda, K. Komatani, T. Ogata, H. G. Okuno, Genetic Algorithm based Improvement of Robot's Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals. Proc. Nineteenth International Conference on Industrial, Engineering and Other Applications of Applied Intelligence Systems (IEA/AIE), pp.207-217, 2006.
  16. S. Briere, D. Letourneau, M. Frechette, J.-M. Valin, F. Michaud, Embedded and integration audition for a mobile robot. Proceedings AAAI Fall Symposium Workshop Aurally Informed Performance: Integrating Machine Listening and Auditory Presentation in Robotic Systems, FS-06-01, 6-10, 2006
  17. S. Yamamoto, R. Takeda, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, K. Komatani, T. Ogata, H. G. Okuno, Leak Energy based Missing Feature Mask Generation for ICA and GSS and Its Evaluation with Simultaneous Speech Recognition, Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition (SAPA), pp.42-46, 2006.
  18. 2005

  19. S. Yamamoto, K. Nakadai, J.-M. Valin, J. Rouat, F. Michaud, K. Komatani, T. Ogata, H. G. Okuno, Making a robot recognize three simultaneous sentences in real-time. Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2005.
  20. M. Murase, S. Yamamoto, J.-M. Valin, K. Nakadai, K. Yamada, K. Komatani, T. Ogata, H. G. Okuno, Multiple Moving Speaker Tracking by Microphone Array on Mobile Robot. Proc. European Conference on Speech Communication and Technology (Interspeech), 2005.
  21. F. Michaud, Y. Brosseau, C. Côté, D. Létourneau, P. Moisan, A. Ponchon, C. Raïevsky, J.-M. Valin, E. Beaudry, F. Kabanza, Modularity and Integration in the Design of a Socially Interactive Robot. Proc. International Workshop on Robot and Human Interactive Communication, pp. 172-177, 2005.
  22. S. Yamamoto, J.-M. Valin, K. Nakadai, J. Rouat, F. Michaud, T. Ogata, H. G. Okuno, Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory. Proc. International Conference on Robotics and Automation (ICRA), 2005.
  23. F. Michaud, D. Létourneau, P. Lepage, Y. Morin, F. Gagnon, P. Giguère, É. Beaudry, Y. Brosseau, C. Côté, A. Duquette, F.-F. Laplante, M.-A. Legault, P. Moisan, A. Ponchon, C. Raïevsky, M.-A. Roux, T. Salter, J.-M. Valin, S. Caron, P. Frenette, P. Masson, F. Kabanza, M. Lauria, Socially interactive robots for real life use, Proceedings Workshop on Mobile Robot Competition, American Association for Artificial Intelligence Conference (AAAI), 2005.
  24. F. Michaud, D. Letourneau, P. Lepage, Y. Morin, F. Gagnon, P. Gigere, E. Beaudry, Y. Brosseau, C. Côté, A. Duquette, J.-F. Laplante, M.-A. Legault, P. Moisan, A. Ponchon, C. Raïevsky, M.-A. Roux, T. Salter, J.-M. Valin, S. Caron, P. Masson, F. Kabanza, M. Lauria, A brochette of socially interactive robots. Proc. American Association for Artificial Intelligence Conference, pp. 1733-1734, 2005.
  25. 2004

  26. J.-M. Valin, J. Rouat, F. Michaud, Enhanced Robot Audition Based on Microphone Array Source Separation with Post-Filter. Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 2123-2128, 2004.
  27. C. Côté, D. Létourneau, F. Michaud, J.-M. Valin, Y. Brosseau, C. Raievsky, M. Lemay, V. Tran, Code Reusability Tools for Programming Mobile Robots, Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1820-1825, 2004.
  28. J.-M. Valin, J. Rouat, F. Michaud, Microphone Array Post-Filter for Separation of Simultaneous Non-Stationary Sources. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 221-224, 2004.
  29. J.-M. Valin, F. Michaud, B. Hadjou, J. Rouat, Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency-Domain Steered Beamformer Approach. Proc. IEEE International Conference on Robotics and Automation (ICRA), pp. 1033-1038, 2004.
  30. M. Lemay, F. Michaud, D. Letourneau, J.-M. Valin, Autonomous Initialization of Robot Formation. Proc. IEEE International Conference on Robotics and Automation (ICRA), pp. 3018-3023, 2004.
  31. 2003

  32. J.-M. Valin, F. Michaud, J. Rouat, D. Létourneau, Robust Sound Source Localization Using a Microphone Array on a Mobile Robot. Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1228-1233, 2003.
  33. D. Létourneau, F. Michaud, J.-M. Valin, C. Proulx, Textual Message Read by a Mobile Robot. Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 2724-2729, 2003.
  34. D. Létourneau, F. Michaud, J.-M. Valin, C. Proulx, Making a Mobile Robot Read Textual Messages. Proc. IEEE International Conference on Systems, Man and Cybernetics, pp. 4236-4241, 2003.
  35. 2002

  36. F. Michaud, D. Létourneau, M. Gilbert, J.-M. Valin, Dynamic Robot Formations Using Directional Visual Perception. Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 2740-2745, 2002.
  37. 2000

  38. J.-M. Valin, R. Lefebvre, Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding. Proc. IEEE Speech Coding Workshop (SCW), 2000, pp. 130-132.
  39. 1999

  40. J.-M. Valin, D. Stork, Open Mind Speech Recognition. Proc. Automatic Speech Recognition and Understanding Workshop (ARSU), 1999.
  41. S.D. Peters, P. Stubley, J.-M. Valin, On the Limits of Speech Recognition in Noise. Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1999,  pp. 365-368.

Demo

Other presentations

Fields of interest

Software

CELT (Project lead, main author)

Code-Excited Lapped Transform (CELT) is an open source (BSD-licensed) audio designed to transmit high-quality speech and audio with very low delay (<10 ms).

Speex (Project lead, main author)

Speex is an open source (BSD-licensed) audio codec that is optimised for voice. Unlike other codecs like MP3 and Ogg Vorbis, Speex is specially designed for compressing voice at low bit-rates for applications such as voice over IP (VoIP). It uses the Ogg container format and is meant to be complementary to the Vorbis codec.

FlowDesigner (Core developer)

FlowDesigner is a free (LGPL) "data flow oriented" development environment. It can be use to build complex applications by combining small, reusable building blocks. In some way, it has similarities with Simulink and LabView, although it is not designed (and far) to be a "clone" of any of them. So far, it has been used for tasks like signal and speech processing, neural networks, vector quantization, fuzzy logic and robotics.

ManyEars (Main author)

ManyEars provides FlowDesigner building-blocks for microphone array processing. This includes sound source localisation, tracking and separation.

Education

 
Ph.D.
Electrical Engineering
LABORIUS
Auditory System For a Mobile Robot
University of Sherbrooke
2002-2005
M.Sc.A Electrical Engineering
Speech coding group
Bandwidth extension of narrowband speech
University of Sherbrooke
2000-2001
B.Eng. Electrical Engineering
University of Sherbrooke
1995-1999