I am now with Technicolor, Cesson Sévigné, France | |
e-mail: |
myFirstName.myLastName@technicolor.com |
Research Interests | Short Bio | C. V. | Demonstrations | Software | Evaluations | Responsibilities | Projects | Collaborators | Publications
Statistical signal processing, machine learning and information theory, including:
Application areas:
Alexey Ozerov holds a Ph.D. in Signal Processing from the University of Rennes 1 (France). He worked towards this degree from 2003 to 2006 in the labs of France Telecom R&D and in collaboration with the IRISA institute. Earlier, he received an M.Sc. degree in Mathematics from the Saint-Petersburg State University (Russia) in 1999 and an M.Sc. degree in Applied Mathematics from the University of Bordeaux 1 (France) in 2003. From 1999 to 2002, Alexey worked at Terayon Communicational Systems (USA) as a R&D software engineer, first in Saint-Petersburg and then in Prague (Czech Republic). He was for one year (2007) in Sound and Image Processing Lab at KTH (Royal Institute of Technology), Stockholm, Sweden, for one year and half (2008-2009) in TELECOM ParisTech / CNRS LTCI - Signal and Image Processing (TSI) Department, and for two years (2009 - 2011) with METISS team of IRISA / INRIA - Rennes. Now he is with Technicolor R&D departement in Cesson Sévigné, France.
in English: PDF, PostScript       in French: PDF, PostScript
One microphone singing voice separation
One microphone source separation
Multichannel nonnegative matrix factorization for convolutive blind source separation
Factorial scaled hidden Markov model for single channel speech / music separation
SARAH project istrument extraction demos:
Using the FASST source separation toolbox for noise robust speech recognition
Coding-based Informed Source Separation
Multichannel nonnegative matrix factorization toolbox (in Matlab)
BSS Locate - A toolbox for source localization in stereo convolutive audio mixtures (in Matlab)
FASST - Flexible Audio Source Separation Toolbox (in Matlab)
EPFL, Lausanne, Switzerland
| Télécom ParisTech, France
| ESPCI ParisTech, France
| IRISA, Rennes, France
| IRISA, Rennes, France
| Yacast, Paris, France
| Télécom ParisTech, France
| Télécom ParisTech, France
| EPFL, Lausanne, Switzerland
| Télécom ParisTech, France
| Télécom ParisTech, France
| IRISA, Rennes, France
| IRISA, Rennes, France
| Delft University of Technology, The Netherlands
| Victoria University of Wellington, New Zealand
| KTH, Stockholm, Sweden
| IRCAM, Paris, France
| KTH, Stockholm, Sweden
| Télécom ParisTech, France
| Télécom ParisTech, France
| Orange Labs, Rennes, France
| Technological Educational Institute of Crete, Greece
| Télécom ParisTech, France
| IRISA, Rennes, France
| |
IEEE Copyright declimer conserning all IEEE papers reprints posted below: Copyright © 2005-2011 IEEE. This material is posted here with permission of the IEEE. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view these documents, you agree to all provisions of the copyright laws protecting it.
M. Li, J. Klejsa, A. Ozerov and W. B. Kleijn, "Audio Coding with Power Spectral Density Preserving Quantization," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'12), Kyoto, Japan, March, 2012. (submitted)
M. Li, A. Ozerov, J. Klejsa and W. B. Kleijn, "Asymptotically optimal distribution preserving quantization for stationary Gaussian processes," IEEE Transactions on Communications (submitted)
S. Arberet, A. Ozerov, F. Bimbot and R. Gribonval, "A tractable framework for estimating and combining spectral source models for audio source separation," Signal Processing, special issue on "Latent Variable Analysis and Signal Separation" (submitted)
Research report: HAL
E. Vincent, S. Araki, F. Theis, G. Nolte, P. Bofill, H Sawada, A Ozerov, V. Gowreesunker, D. Lutter, N.Q.K. Duong, "The Signal Separation Evaluation Campaign (2007-2010): Achievements and remaining challenges," Signal Processing, special issue on "Latent Variable Analysis and Signal Separation" (to appear)
Article: HAL
C. Blandin, A. Ozerov and E. Vincent, "Multi-source TDOA estimation in reverberant audio using angular spectra and clustering," Signal Processing, special issue on "Latent Variable Analysis and Signal Separation" (to appear)
A. Ozerov, E. Vincent and F. Bimbot, "A general flexible framework for the handling of prior information in audio source separation," IEEE Trans. on Audio, Speech and Lang. Proc. (to appear)
Article: HAL,       Code and Audio Examples
A. Ozerov and W. B. Kleijn, "Asymptotically optimal model estimation for quantization," IEEE Transactions on Communications, vol. 59, no. 4, pp. 1031-1042 , April 2011.
Article: PDF
A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE Trans. on Audio, Speech and Lang. Proc. special issue on Signal Models and Representations of Musical and Environmental Sounds, vol. 18, no. 3, pp. 550-563, March 2010.
Article: PDF,       Audio Examples,       Code
A. Ozerov, P. Philippe, F. Bimbot and R. Gribonval, "Adaptation of Bayesian models for single channel source separation and its application to voice / music separation in popular songs," IEEE Trans. on Audio, Speech and Lang. Proc., special issue on Blind Signal Proc. for Speech and Audio Applications, vol. 15, no. 5, pp. 1564-1578, July 2007.
Article: PDF       Audio Examples,
A. Ozerov, R. Gribonval, P. Philippe and F. Bimbot, "Choix et adaptation de modèles statistiques pour la séparation de voix chantée à partir d'un seul microphone," Traitement du signal, vol. 24, no. 3, pp. 211-224, 2007.
A. Ozerov, A. Liutkus, R. Badeau and G. Richard, "Informed source separation: source coding meets source separation," In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA'11), Mohonk, NY, Oct. 16-19, 2011.
Article: PDF,       Audio Examples
A. Ozerov, M. Lagrange and E. Vincent, "GMM-based classification from noisy features," International Workshop on Machine Listening in Multisource Environments (CHiME 2011), pages 30-35, Florence, Italy, September, 2011.
A. Ozerov and E. Vincent, "Using the FASST source separation toolbox for noise robust speech recognition," International Workshop on Machine Listening in Multisource Environments (CHiME 2011), pages 86-87, Florence, Italy, September, 2011.
Article: PDF, Poster: PDF,       Audio Examples
A. Ozerov, C. Févotte, R. Blouet and J.-L. Durrieu, "Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), pages 257-260, Prague, May, 2011.
Article: PDF, Poster: PDF,       Audio Examples
C. Blandin, E. Vincent and A. Ozerov, "Multi-source TDOA estimation using SNR-based angular spectra," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), pages 2616 - 2619, Prague, May, 2011.
A. Ozerov, E. Vincent and F. Bimbot, "A general modular framework for audio source separation", In 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA'10), pages 33 - 40, Saint-Malo, France, Sep. 27-30, 2010.
S. Araki, A. Ozerov, V. Gowreesunker, H. Sawada, F. Theis, G. Nolte, D. Lutter and N.Q.K. Duong, "The 2010 Signal Separation Evaluation Campaign (SiSEC2010): - Audio source separation -", In 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA'10), pages 114 - 122, Saint-Malo, France, Sep. 27-30, 2010.
Article: PDF
S. Araki, F. Theis, G. Nolte, D. Lutter, A. Ozerov, V. Gowreesunker, H. Sawada and N.Q.K. Duong, "The 2010 Signal Separation Evaluation Campaign (SiSEC2010): - Biomedical source separation -", In 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA'10), pages 123 - 130, Saint-Malo, France, Sep. 27-30, 2010.
Article: PDF
C. Févotte and A. Ozerov, "Notes on nonnegative tensor factorization of the spectrogram for audio source separation : statistical insights and towards self-clustering of the spatial cues", In 7th International Symposium on Computer Music Modeling and Retrieval (CMMR 2010), 2010.
Article: PDF,       Audio Examples,       Code
S. Arberet, A. Ozerov, N.Q.K. Duong, E. Vincent, R. Gribonval, F. Bimbot and P. Vandergheynst, "Nonnegative matrix factorization and spatial covariance model for under-determined reverberant audio source separation", In 10th International Conference on Information Sciences, Signal Processing and their applications (ISSPA 2010), 2010.
Article: PDF
A. Ozerov, C. Févotte and M. Charbit, "Factorial scaled hidden Markov model for polyphonic audio representation and source separation", In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA'09), Mohonk, NY, Oct. 18-21, 2009.
Article: PDF, Slides: PDF,       Audio Examples
J.-L. Durrieu, A. Ozerov, C. Févotte, G. Richard and B. David, "Main instrument separation from stereophonic audio signals using a source/filter model", In EUSIPCO, 17th European Signal Processing Conference, Glasgow, Scotland, August 24-28, 2009.
Article: PDF,       Audio Examples
A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures. With application to blind audio source separation", In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'09), pages 3137-3140, Taipei, Taiwan, April 19-24, 2009.
Article: PDF, Poster: PDF,       Audio Examples,       Code
A. Ozerov and W. B. Kleijn, "Optimal parameter estimation for model-based quantization," In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'09), pages 2497-2500, Taipei, Taiwan, April 19-24, 2009.
S. Arberet, A. Ozerov, R. Gribonval and F. Bimbot, "Blind spectral-GMM estimation for underdetermined instantaneous audio source separation", In Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation (ICA'09), pages 751-758, Paraty, Brazil, March 15-18, 2009.
Article: PDF
I. Potamitis and A. Ozerov, "Single channel source separation using static and dynamic features in the power domain", In EUSIPCO, 16th European Signal Processing Conference, Laussane, Switzerland, August 25-29, 2008.
Article: PDF,       Audio Examples
A. Ozerov and W. B. Kleijn, "Flexible quantization of audio and speech based on the autoregressive model," In IEEE Asilomar Conference on Signals, Systems, and Computers (Asilomar CSSC'07), pages 535-539, Pacific Grove, CA, Nov. 4-7, 2007.
R. Heusdens, W. B. Kleijn and A. Ozerov, "Entropy-constrained high-resolution lattice vector quantization using a perceptually relevant distortion measure," In IEEE Asilomar Conference on Signals, Systems, and Computers (Asilomar CSSC'07), pages 2075-2079, Pacific Grove, CA, Nov. 4-7, 2007.
Article: PDF
W. B. Kleijn and A. Ozerov, "Rate distribution between model and signal," In IEEE Worksh. on Apps. of Signal Processing to Audio and Acoustics (WASPAA'07), pages 243-246, Mohonk, NY, Oct. 2007.
Article: PDF
A. Ozerov, P. Philippe, R. Gribonval and F. Bimbot, "One microphone singing voice separation using source-adapted models," In IEEE Worksh. on Apps. of Signal Processing to Audio and Acoustics (WASPAA'05), pages 90-93, Mohonk, NY, Oct. 2005.
Article: PDF, Slides: PDF,       Audio Examples
A. Ozerov, R. Gribonval, P. Philippe and F. Bimbot, "Séparation voix / musique à partir d'enregistrements mono : quelques remarques sur le choix et l'adaptation des modèles," In GRETSI'05 Symposium on Signal and Image Processing, Louvain-la-Neuve, Belgique, Sept. 2005.
abstract in English: HTML, full text in French: PDF, PostScript,       Audio Examples
G. Gravier, L. Benaroya, A. Ozerov, R. Gribonval and F. Bimbot, "Séparation de sources à partir d'un seul capteur pour la reconnaissance robuste de la parole," In Journées d'Etude sur la Parole (JEP'04), April 2004.
A. Ozerov, C. Févotte and R. Blouet, "Automatic source separation via joint use of segmental information and spatial diversity" US patent 13021692, 2011 (filled).
S. Arberet, A. Ozerov, R. Gribonval and F. Bimbot, "Procédé et un dispositif d'estimation de signaux de source issus d'un signal de mélange" French patent 2939933, 2010 (published) and international extension WO2010/076412, 2010 (published).
A. Ozerov, S. Essid and M. Charbit, "Reconnaissance des instruments dans la musique polyphonique par décomposition NMF et classification SVM," Technical Report TELECOM ParisTech 2009D014, July 2009.
A. Ozerov. "Adaptation de modèles statistiques pour la séparation de sources mono-capteur. Application à la séparation voix / musique dans les chansons." PhD thesis, University of Rennes 1, 2006.
abstract in English: HTML, full text in French: PDF, PostScript
A. Ozerov. "Représentations robustes pour la reconnaissance automatique de la parole". MSc thesis, DESS "Scientific Calculation and Applications", University of Bordeaux 1, 2003.
abstract in English: HTML, full text in French: PDF, PostScript
A. Ozerov. "A criterion of nondisappearance of invariant sets satisfying Krasovsky property under C0 perturbations of right part of the system". MSc thesis, department of Ordinary Differential Equations, Mathematics and Mechanics faculty, St. Petersburg State University, 1999.
A. Ozerov, C. Févotte and R. Blouet, "The SARAH project: Standardization of High-Definition Audio Remastering", Demo presented at IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA'09), Mohonk, NY, Oct. 18-21, 2009.
Poster: PDF