Publications

Filter by:

2011

conference
J. Gemmeke, T. Virtanen and A. Hurmalainen. "Exemplar-Based Speech Enhancement and its Application to Noise-Robust Automatic Speech Recognition". Proc. International Workshop on Machine Listening in Multisource Environments (CHiME). 2011. pp. 53-57.
conference
H. Silén, E. Helander and M. Gabbouj. "Prediction of voice aperiodicity based on spectral representations in HMM speech synthesis". Interspeech. 2011. pp. 105 - 108.
conference
J. Gemmeke, A. Hurmalainen, T. Virtanen and S. Yang. "Toward a Practical Implementation of Exemplar-Based Noise Robust ASR". European Signal Processing Conference (EUSIPCO). 2011. pp. 1490-1494.
conference
A. Hurmalainen, J. Gemmeke and T. Virtanen. "Non-negative matrix deconvolution in noise robust speech recognition". Proceedings of International Conference on Audio, Speech and Signal Processing. 2011.
conference
J. Gemmeke, A. Hurmalainen, T. Virtanen and Y. Sun. "Toward A Practical Implementation Of Exemplar-Based Noise Robust ASR". EUSIPCO 2011: 19th European Signal Processing Conference, August 29 - September 2, 2011, Barcelona, Spain. 2011. pp. 1490-1494.
conference
K. Mahkonen, A. Hurmalainen, T. Virtanen and J. Gemmeke. "Mapping Sparse Representation to State Likelihoods in Noise-Robust Automatic Speech Recognition". Speech Science and Technology for Real Life, Conference Proceedings of Interspeech 2011, 27 - 31 August, 2011, Florence, Italy. 2011. pp. 465-468.
conference
A. Hurmalainen, T. Virtanen, J. Gemmeke and K. Mahkonen. "Esimerkkipohjainen meluisan puheen automaattinen tunnistus". Akustiikkapäivät 2011, Tampere, 11.-12.5.2011, Akustinen Seura ry. 2011. pp. 1-5.
conference
T. Heittola, A. Mesaros, T. Virtanen and A. Eronen. "Sound event detection and context recognition". Akustiikkapäivät 2011. 2011. pp. 51-56.
article
V. Popa, J. Nurminen and M. Gabbouj. "A Study of Bilinear Models in Voice Conversion", Journal of Signal and Information Processing, Vol. 2. 2011, pp. 125-139.
conference
T. Mäkinen, S. Kiranyaz and M. Gabbouj. "Content-based Audio Classification using Collective Network of Binary Classifiers". IEEE Workshop on Evolving and Adaptive Intelligent Systems. 2011. pp. 116 - 123.
conference
T. Heittola, A. Mesaros, T. Virtanen and A. Eronen. "Sound Event Detection in Multisource Environments Using Source Separation". CHiME 2011 - Workshop on Machine Listening in Multisource Environments. 2011. pp. 36-40.
conference
A. Mesaros, T. Heittola and A. Klapuri. "Latent Semantic Analysis in Sound Event Detection". European Signal Processing Conference (EUSIPCO-2011). 2011. pp. 1307-1311.
article
J. J. Orti, T. Virtanen, P. Vera-Candeas, N. Ruiz-Reyes and F. J. Canadas-Quesada. "Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization", . IEEE Journal of Selected Topics in Signal Processing, Vol. 5. 2011.
conference
P. Pertilä, M. Mieskolainen and M. S. Hämäläinen. "Closed-Form Self-Localization of Asynchronous Microphone Arrays". In Proc. The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA'11). 2011.
conference
H. Kallasjoki, U. Remes, J. Gemmeke, T. Virtanen and K. Palomäki. "Uncertainty measures for improving exemplar-based source separation". 12th Annual Conference of the International Speech Communication Association. 2011.
conference
B. Raj, R. Singh and T. Virtanen. "Phoneme-dependent NMF for speech enhancement in monaural mixtures". In proc. 12th Annual Conference of the International Speech Communication Association. 2011.
conference
J. Nikunen, T. Virtanen and M. Vilermo. "Multichannel audio upmixing based on non-negative tensor factorization representation". IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 2011.

2010

article
T. Mäkinen and P. Pertilä. "Shooter localization and bullet trajectory, caliber, and speed estimation based on detected firing sounds", Applied Acoustics, Vol. 10, October, 2010, pp. 902–913.
conference
E. Helander, H. Silén, J. Miguez and M. Gabbouj. "Maximum a posteriori voice conversion using sequential Monte Carlo methods". Interspeech. 2010.
conference
H. Silén, E. Helander, J. Nurminen and M. Gabbouj. "Analysis of Duration Prediction Accuracy in HMM-Based Speech Synthesis". The Fifth International Conference on Speech Prosody. 2010.
article
A. Eronen and A. Klapuri. "Music Tempo Estimation with k-NN regression", IEEE Trans. Audio, Speech and Language Processing, Vol. 18, January, 2010, pp. 50-57.
conference
J. Gemmeke and T. Virtanen. "Artificial and online acquired noise dictionaries for noise robust ASR". Proceedings of the 11th Annual Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 26-30, 2010. 2010. pp. 2082-2085.
conference
S. Tervo and T. Korhonen. "Estimation of reflective surfaces from continuous signals". Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP, Dallas, Texas, USA, March 14-19, 2010. 2010. pp. 153-156.
conference
T. Heittola, A. Mesaros, A. Eronen and T. Virtanen. "Audio context recognition using audio event histograms". In Proc. European Signal Processing Conference. 2010.
conference
A. Mesaros, T. Heittola, A. Eronen and T. Virtanen. "Acoustic event detection in real life recordings". In Proc. European Signal Processing Conference. 2010. pp. 1267-1271.
conference
H. Silén, E. Helander, J. Nurminen, K. Koppinen and M. Gabbouj. "Using Robust Viterbi Algorithm and HMM-Modeling in Unit Selection TTS to Replace Units of Poor Quality". Interspeech 2010. 2010.
conference
T. Virtanen, J. Gemmeke and A. Hurmalainen. "State-based labelling for a sparse representation of speech and its application to robust speech recognition". Interspeech 2010. 2010.
conference
J. Gemmeke and T. Virtanen. "Artificial and online acquired noise dictionaries for noise robust ASR". Interspeech 2010. 2010.
article
E. Helander, T. Virtanen, J. Nurminen and M. Gabbouj. "Voice Conversion Using Partial Least Squares Regression", IEEE Transactions on Audio, Speech, and Language Processing. 2010.
conference
A. Klapuri, T. Virtanen and T. Heittola. "Sound source separation in monaural music signals using excitation-filter model and EM algorithm". IEEE International Conference on Acoustics, Speech, and Signal Processing. 2010.
conference
P. Pertilä and M. S. Hämäläinen. "A Track Before Detect Approach for Sequential Bayesian Tracking of Multiple Speech Sources". In Proc. IEEE 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2010.
conference
B. Raj, T. Virtanen, S. Chaudhure and R. Singh. "Non-negative matrix factorization based compensation of music for automatic speech recognition". Interspeech 2010. 2010.
conference
S. Keronen, U. Remes, K. Palomäki, T. Virtanen and M. Kurimo. "Comparison of Noise Robust Methods in Large Vocabulary Speech Recognition". n Proc. European Signal Processing Conference. 2010.
conference
J. Nikunen and T. Virtanen. "Object-Based Audio Coding Using Non-Negative Matrix Factorization for the Spectrogram Representation". in proc. 128th Audio Engineering Society Convention. 2010.
conference
B. Raj, T. Virtanen, S. Chaudhure and R. Singh. "Non-negative matrix factorization based compensation of music for automatic speech recognition". Proceedings of Interspeech 2010. 2010.
article
A. Klapuri and T. Virtanen. "Representing Musical Sounds with an Interpolating State Model", IEEE Trans. Audio, Speech and Language Processing, Vol. 18. 2010.
article
M. Helén and T. Virtanen. "Audio query by example using similarity measures between probability density functions of features", EURASIP Journal on Audio, Speech and Music Processing, Vol. 2010. 2010.
article
A. Mesaros and T. Virtanen. "Automatic recognition of lyrics in singing", EURASIP Journal on Audio, Speech and Music Processing, Vol. 2010. 2010.
conference
J. Nikunen and T. Virtanen. "Noise-to-Mask Ratio Minimization by Weighted Non-negative Matrix Factorization". IEEE International Conference on Acoustics, Speech, and Signal Processing. 2010.
conference
A. Mesaros and T. Virtanen. "Recognition of phonemes and words in singing". proc. of the 35th International Conference on Acoustics, Speech, and Signal Processing. 2010.
conference
J. Gemmeke and T. Virtanen. "Noise robust exemplar-based connected digit recognition". proc. of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2010.

2009

conference
T. Mäkinen, P. Pertilä and P. Auranen. "Supersonic bullet state estimation using particle filtering". Proceedings of 2009 IEEE International Conference on Signal and Image Processing Applications, ICSIPA. 2009.
conference
J. Paulus and A. Klapuri. "Music structure analysis with a probabilistic fitness function in MIREX2009". Proc. of the Fifth Annual Music Information Retrieval Evaluation eXchange. 2009.
conference
H. Silén, E. Helander, J. Nurminen and M. Gabbouj. "Parameterization of vocal fry in HMM-based speech synthesis". Proceedings of the 10th Annual Conference of the International Speech Communication Associationa, Interspeech. 2009. pp. 1775-1778.
conference
A. Löytynoja and P. Pertilä. "A real-time talker localization implementation using multi-PHAT and particle filter". Proceedings of the 17th European Signal Processing Conference, Eusipco. 2009. pp. 1418-1422.
article
J. Paulus and A. Klapuri. "Music Structure Analysis Using a Probabilistic Fitness Measure and a Greedy Search Algorithm", IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, Aug, 2009, pp. 1159-1170.
conference
D. D. Alves, J. Paulus and J. Fonseca. "Drum transcription from multichannel recordings with non-negative matrix factorization". Proc. of the 17th European Signal Processing Conference. 2009. pp. 894-898.
conference
M. Parviainen. "Robust self-localization solutions for meeting room environments". Proceedings of the 13th IEEE International Symposium on consumer Electronics, ISCE 2009, Kyoto, Japan, 25-28 May 2009. 2009. pp. 237-240.
conference
V. Popa, J. Nurminen and M. Gabbouj. "A novel technique for voice conversion based on style and content decomposition with bilinear models". Proceedings of the 10th Annual Conference of the International Speech Communication Associationa, Interspeech 2009, Brighton, UK, 6-10 September 2009. 2009. pp. 2655-2658.
conference
M. Helén, T. Lahti and A. Klapuri. "Tools for automatic audio management". Open Information Management: applications of interconnectivity and collaboration. S. Niiranen ed. 2009. pp. 244-265.
conference
T. Virtanen and A. T. Cemgil. "Mixtures of Gamma Priors for Non-Negative Matrix Factorization Based Speech Separation". ICA. 2009.
conference
A. Mesaros and T. Virtanen. "Adaptation of a speech recognizer for singing voice". Proceedings of the 17th European Signal Processing Conference. 2009. pp. 1779-1783.
conference
T. Virtanen and T. Heittola. "Interpolating Hidden Markov Model and Its Application to Automatic Instrument Recognition". ICASSP. 2009.
conference
T. Heittola, A. Klapuri and T. Virtanen. "Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation". in Proc. 10th Int. Society for Music Information Retrieval Conf. (ISMIR 2009). 2009. pp. 327-332.
conference
A. Klapuri. "A classification approach to multipitch analysis". 6th Sound and Music Computing Conference. 2009.
conference
A. Klapuri. "A method for visualizing the pitch content of polyphonic music signals". Proc. 10th Int. Society for Music Information Retrieval Conf. (ISMIR 2009). 2009.
conference
M. Myllymäki and T. Virtanen. "Non-Stationary Noise Model Compensation in Voice Activity Detection". EUSIPCO. 2009.
conference
T. Virtanen. "Spectral Covariance in Prior Distributions of Non-Negative Matrix Factorization Based Speech Separation". EUSIPCO. 2009.
article
J. Paulus and A. Klapuri. "Drum sound detection in polyphonic music with hidden Markov models", EURASIP Journal on Audio, Speech, and Music Processing, Vol. 2009. 2009.
incollection
J. Paulus and A. Klapuri. "Labelling the Structural Parts of a Music Piece with Markov Models". Ystad, Sølvi, Kronland-Martinet, Richard, Jensen and Kristoffer eds. Springer Berlin / Heidelberg. 2009. pp. 166-176.

2008

conference
P. Pertilä. "Array steered response time-alignment for propagation delay compensation for acoustic localization". Proceedings of the Forty-Second Asilomar Conference on Signals, Systems and Computers. 2008. pp. 298-302.
conference
T. Lahti et al.. "On Enabling Techniques for Personal Audio Content Management". ACM International Conference on Multimedia Information Retrieval (MIR 2008). 2008.
conference
E. Helander, J. Schwarz, J. Nurminen, H. Silén and M. Gabbouj. "On the impact of alignment on voice conversion performance". roceedings of the 9th Annual Conference of the International Speech Communication Associationa, Interspeech. 2008. pp. 1453-1456.
conference
H. Silén, E. Helander, J. Nurminen and M. Gabbouj. "Evaluation of Finnish unit selection and HMM-based speech synthesis". Proceedings of the 9th Annual Conference of the International Speech Communication Associationa, Interspeech. 2008. pp. 1853-1856.
conference
J. Paulus and A. Klapuri. "Music Structure Analysis Using a Probabilistic Fitness Measure And an Integrated Musicological Model". Proc. of the Ninth International Conference on Music Information Retrieval. 2008.
conference
J. Paulus and A. Klapuri. "Acoustic Features for Music Piece Structure Analysis". Proc. of the 11th International Conference on Digital Audio Effects. 2008. pp. 309-312.
conference
M. Ryynänen, T. Virtanen, J. Paulus and A. Klapuri. "Accompaniment separation and karaoke application based on automatic melody transcription". EEE International Conf. on Multimedia and Expo. 2008.
conference
J. Paulus and A. Klapuri. "Labelling the Structural Parts of a Music Piece with Markov Models". Proc. of the 2008 Computers in Music Modeling and Retrieval Conference. Jensen and Kristoffer eds. 2008. pp. 137-147.
conference
M. Ryynänen and A. Klapuri. "Query by Humming of MIDI and Audio Using Locality Sensitive Hashing". Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'08). 2008.
conference
E. Helander, J. Nurminen and M. Gabbouj. "LSF mapping for voice conversion with very small training sets". Proceedings of 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. 2008. pp. 4669-4672.
article
A. Klapuri. "Multipitch analysis of polyphonic music and speech signals using an auditory model", IEEE Trans. Audio, Speech and Language Processing, Vol. 16, Feb, 2008, pp. 255-266.
article
E. B. Bilcu and J. Astola. "A hybrid approach to bilingual text-to-phoneme mapping", Facta Universitatis, Series: Electronics and Energetics, Vol. 21. 2008, pp. 91-105.
conference
A. Klapuri and T. Virtanen. "Automatic music transcription". , Handbook of Signal Processing in Acoustics. D. Havelock ed. 2008. pp. 277-303.
conference
T. Pirinen. "An experimental comparison of time delay weights for direction of arrival estimation". Proceedings of the 11th International Conference on Digital Audio Effects (DAFx-08), Espoo, Finland, 1-4 September 2008. J. Pakarinen ed. 2008. pp. 4 p.
article
T. Pirinen. "A confidence statistic and an outlier detector for difference estimates in sensor arrays", IEEE Sensors Journal, Vol. 8. 2008, pp. 2008-2015.
conference
T. Virtanen, A. T. Cemgil and S. Godsill. "Bayesian Extensions to Non-negative Matrix Factorisation for Audio Signal Modelling". ICASSP 2008. 2008.
article
P. Pertilä, T. Korhonen and A. Visa. "Measurement combination for acoustic source localization in a room environment", Eurasip Journal on Audio, Speech, and Music Processing, Vol. 2008. 2008.
conference
T. Korhonen and P. Pertilä. "TUT acoustic source tracking system 2007". Lecture Notes in Computer Science. 2008. pp. 104-112.
article
M. Ryynänen and A. Klapuri. "Automatic Transcription of Melody, Bass Line, and Chords in Polyphonic Music", Computer Music Journal, Vol. 32. 2008.
conference
M. Myllymäki and T. Virtanen. "Voice Activity Detection in the Presence of Breathing Noise Using Neural Network and Hidden Markov Model". EUSIPCO 2008. 2008.
conference
A. Mesaros and T. Virtanen. "Automatic Alignment of Music Audio and Lyrics". DAFx08. 2008.
conference
T. Virtanen, A. Mesaros and M. Ryynänen. "Combining Pitch-Based Inference and Non-Negative Spectrogram Factorization in Separating Vocals from Polyphonic Music". 2008.

2007

conference
E. Helander, H. Silén and M. Gabbouj. "The use of diphone variants in optimal text selection for Finnish unit selection speech synthesis". Proceedings of the 12th International conference Speech and Computer, SPECOM. 2007. pp. 293-298.
conference
E. Helander, J. Nurminen and M. Gabbouj. "Analysis of LSF frame selection in voice conversion". Proceedings of the 12th International conference Speech and Computer, SPECOM. 2007. pp. 651-656.
conference
M. Helén and T. Virtanen. "A Similarity Measure for Audio Query by Example Based on Perceptual Coding and Compression". proc. 10th International Conference on Digital Audio Effects (DAFx-07). 2007.
conference
J. Paulus and A. Klapuri. "Combining temporal and spectral features in HMM-based drum transcription". Proc. of the 8th International Conference on Music Information Retrieval. 2007. pp. 225-228.
conference
E. Helander and J. Nurminen. "On the importance of pure prosody in the perception of speaker identity". Proceedings of the 8th Annual Conference of the International Speech Communication Association, Interspeech. 2007. pp. 2665-2668.
conference
H. Silén, E. Helander, K. Koppinen and M. Gabbouj. "Building a Finnish unit selection TTS system". Proceedings of the 6th ISCA Workshop on Speech Synthesis. 2007. pp. 310-315.
conference
T. Korhonen and P. Pertilä. "TUT acoustic source tracking system 2007". Multimodal Technologies for Perception of Humans, International Evaluation Workshops CLEAR 2007 and RT 2007. 2007. pp. 104-112.
conference
P. Pertilä and M. Parviainen. "Robust speaker localization in meeting room domain". Proceedings of 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. 2007. pp. 497-500.
conference
E. Helander and J. Nurminen. "A novel method for prosody prediction in voice conversion". IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. 2007. pp. 509-512.
article
T. Virtanen. "Monaural Sound Source Separation by Non-Negative Matrix Factorization with Temporal Continuity and Sparseness Criteria", IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, March, 2007.
conference
E. B. Bilcu and J. Astola. "Improved hybrid approach for bilingual language recognition from text". Proceedings of the 5th International Symposium on Image and Signal Processing and Analysis, ISPA 2007, Istanbul, Turkey, 27-29 September 2007. M. Petrou ed. 2007. pp. 190-195.
article
M. McKinney, D. Moelants, M. E. Davies and A. Klapuri. "Evaluation of audio beat tracking and music tempo extraction algorithms", Journal of New Music Research, Vol. 36. 2007, pp. 1-16.
conference
T. Heittola and A. Klapuri. "TUT Acoustic Event Detection System 2007". Multimodal Technologies for Perception of Humans,Joint Proceedings of the CLEAR 2007 and RT 2007 Evaluation Workshops. R. Stiefelhagen, R. Bowers and J. Fiscus eds. 2007. pp. 364-370.
article
T. Pirinen, P. Pertilä and T. Korhonen. "Seinille kasvaa korvat", Prosessori. 2007, pp. 46-47.
conference
P. Pertilä, T. Korhonen, T. Pirinen and M. Parviainen. "TUT acoustic source tracking system 2006". Lecture Notes in Computer Science. 2007. pp. 127-136.
conference
A. Mesaros, T. Virtanen and A. Klapuri. "Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods". International Conference on Music Information Retrieval. 2007.
conference
M. Ryynänen and A. Klapuri. "Automatic bass line transcription from streaming polyphonic audio". IEEE International Conference on Audio, Speech and Signal Processing (ICASSP). 2007.
conference
A. Klapuri. "Analysis of musical instrument sounds by source-filter-decay model". IEEE International Conference on Audio, Speech and Signal Processing (ICASSP). 2007.
Results 201 - 300 of 421