Publications

Filter by:

2010

conference
P. Pertilä and M. S. Hämäläinen. "A Track Before Detect Approach for Sequential Bayesian Tracking of Multiple Speech Sources". In Proc. IEEE 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2010.
conference
B. Raj, T. Virtanen, S. Chaudhure and R. Singh. "Non-negative matrix factorization based compensation of music for automatic speech recognition". Interspeech 2010. 2010.
conference
S. Keronen, U. Remes, K. Palomäki, T. Virtanen and M. Kurimo. "Comparison of Noise Robust Methods in Large Vocabulary Speech Recognition". n Proc. European Signal Processing Conference. 2010.
conference
J. Nikunen and T. Virtanen. "Object-Based Audio Coding Using Non-Negative Matrix Factorization for the Spectrogram Representation". in proc. 128th Audio Engineering Society Convention. 2010.
conference
B. Raj, T. Virtanen, S. Chaudhure and R. Singh. "Non-negative matrix factorization based compensation of music for automatic speech recognition". Proceedings of Interspeech 2010. 2010.
article
A. Klapuri and T. Virtanen. "Representing Musical Sounds with an Interpolating State Model", IEEE Trans. Audio, Speech and Language Processing, Vol. 18. 2010.
article
M. Helén and T. Virtanen. "Audio query by example using similarity measures between probability density functions of features", EURASIP Journal on Audio, Speech and Music Processing, Vol. 2010. 2010.
article
A. Mesaros and T. Virtanen. "Automatic recognition of lyrics in singing", EURASIP Journal on Audio, Speech and Music Processing, Vol. 2010. 2010.
conference
J. Nikunen and T. Virtanen. "Noise-to-Mask Ratio Minimization by Weighted Non-negative Matrix Factorization". IEEE International Conference on Acoustics, Speech, and Signal Processing. 2010.
conference
A. Mesaros and T. Virtanen. "Recognition of phonemes and words in singing". proc. of the 35th International Conference on Acoustics, Speech, and Signal Processing. 2010.
conference
J. Gemmeke and T. Virtanen. "Noise robust exemplar-based connected digit recognition". proc. of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2010.

2009

conference
T. Mäkinen, P. Pertilä and P. Auranen. "Supersonic bullet state estimation using particle filtering". Proceedings of 2009 IEEE International Conference on Signal and Image Processing Applications, ICSIPA. 2009.
conference
J. Paulus and A. Klapuri. "Music structure analysis with a probabilistic fitness function in MIREX2009". Proc. of the Fifth Annual Music Information Retrieval Evaluation eXchange. 2009.
conference
H. Silén, E. Helander, J. Nurminen and M. Gabbouj. "Parameterization of vocal fry in HMM-based speech synthesis". Proceedings of the 10th Annual Conference of the International Speech Communication Associationa, Interspeech. 2009. pp. 1775-1778.
conference
A. Löytynoja and P. Pertilä. "A real-time talker localization implementation using multi-PHAT and particle filter". Proceedings of the 17th European Signal Processing Conference, Eusipco. 2009. pp. 1418-1422.
article
J. Paulus and A. Klapuri. "Music Structure Analysis Using a Probabilistic Fitness Measure and a Greedy Search Algorithm", IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, Aug, 2009, pp. 1159-1170.
conference
D. D. Alves, J. Paulus and J. Fonseca. "Drum transcription from multichannel recordings with non-negative matrix factorization". Proc. of the 17th European Signal Processing Conference. 2009. pp. 894-898.
conference
M. Parviainen. "Robust self-localization solutions for meeting room environments". Proceedings of the 13th IEEE International Symposium on consumer Electronics, ISCE 2009, Kyoto, Japan, 25-28 May 2009. 2009. pp. 237-240.
conference
V. Popa, J. Nurminen and M. Gabbouj. "A novel technique for voice conversion based on style and content decomposition with bilinear models". Proceedings of the 10th Annual Conference of the International Speech Communication Associationa, Interspeech 2009, Brighton, UK, 6-10 September 2009. 2009. pp. 2655-2658.
conference
M. Helén, T. Lahti and A. Klapuri. "Tools for automatic audio management". Open Information Management: applications of interconnectivity and collaboration. S. Niiranen ed. 2009. pp. 244-265.
conference
T. Virtanen and A. T. Cemgil. "Mixtures of Gamma Priors for Non-Negative Matrix Factorization Based Speech Separation". ICA. 2009.
conference
A. Mesaros and T. Virtanen. "Adaptation of a speech recognizer for singing voice". Proceedings of the 17th European Signal Processing Conference. 2009. pp. 1779-1783.
conference
T. Virtanen and T. Heittola. "Interpolating Hidden Markov Model and Its Application to Automatic Instrument Recognition". ICASSP. 2009.
conference
T. Heittola, A. Klapuri and T. Virtanen. "Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation". in Proc. 10th Int. Society for Music Information Retrieval Conf. (ISMIR 2009). 2009. pp. 327-332.
conference
A. Klapuri. "A classification approach to multipitch analysis". 6th Sound and Music Computing Conference. 2009.
conference
A. Klapuri. "A method for visualizing the pitch content of polyphonic music signals". Proc. 10th Int. Society for Music Information Retrieval Conf. (ISMIR 2009). 2009.
conference
M. Myllymäki and T. Virtanen. "Non-Stationary Noise Model Compensation in Voice Activity Detection". EUSIPCO. 2009.
conference
T. Virtanen. "Spectral Covariance in Prior Distributions of Non-Negative Matrix Factorization Based Speech Separation". EUSIPCO. 2009.
article
J. Paulus and A. Klapuri. "Drum sound detection in polyphonic music with hidden Markov models", EURASIP Journal on Audio, Speech, and Music Processing, Vol. 2009. 2009.
incollection
J. Paulus and A. Klapuri. "Labelling the Structural Parts of a Music Piece with Markov Models". Ystad, Sølvi, Kronland-Martinet, Richard, Jensen and Kristoffer eds. Springer Berlin / Heidelberg. 2009. pp. 166-176.

2008

conference
P. Pertilä. "Array steered response time-alignment for propagation delay compensation for acoustic localization". Proceedings of the Forty-Second Asilomar Conference on Signals, Systems and Computers. 2008. pp. 298-302.
conference
T. Lahti et al.. "On Enabling Techniques for Personal Audio Content Management". ACM International Conference on Multimedia Information Retrieval (MIR 2008). 2008.
conference
E. Helander, J. Schwarz, J. Nurminen, H. Silén and M. Gabbouj. "On the impact of alignment on voice conversion performance". roceedings of the 9th Annual Conference of the International Speech Communication Associationa, Interspeech. 2008. pp. 1453-1456.
conference
H. Silén, E. Helander, J. Nurminen and M. Gabbouj. "Evaluation of Finnish unit selection and HMM-based speech synthesis". Proceedings of the 9th Annual Conference of the International Speech Communication Associationa, Interspeech. 2008. pp. 1853-1856.
conference
J. Paulus and A. Klapuri. "Music Structure Analysis Using a Probabilistic Fitness Measure And an Integrated Musicological Model". Proc. of the Ninth International Conference on Music Information Retrieval. 2008.
conference
J. Paulus and A. Klapuri. "Acoustic Features for Music Piece Structure Analysis". Proc. of the 11th International Conference on Digital Audio Effects. 2008. pp. 309-312.
conference
M. Ryynänen, T. Virtanen, J. Paulus and A. Klapuri. "Accompaniment separation and karaoke application based on automatic melody transcription". EEE International Conf. on Multimedia and Expo. 2008.
conference
J. Paulus and A. Klapuri. "Labelling the Structural Parts of a Music Piece with Markov Models". Proc. of the 2008 Computers in Music Modeling and Retrieval Conference. Jensen and Kristoffer eds. 2008. pp. 137-147.
conference
M. Ryynänen and A. Klapuri. "Query by Humming of MIDI and Audio Using Locality Sensitive Hashing". Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'08). 2008.
conference
E. Helander, J. Nurminen and M. Gabbouj. "LSF mapping for voice conversion with very small training sets". Proceedings of 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. 2008. pp. 4669-4672.
article
A. Klapuri. "Multipitch analysis of polyphonic music and speech signals using an auditory model", IEEE Trans. Audio, Speech and Language Processing, Vol. 16, Feb, 2008, pp. 255-266.
article
E. B. Bilcu and J. Astola. "A hybrid approach to bilingual text-to-phoneme mapping", Facta Universitatis, Series: Electronics and Energetics, Vol. 21. 2008, pp. 91-105.
conference
A. Klapuri and T. Virtanen. "Automatic music transcription". , Handbook of Signal Processing in Acoustics. D. Havelock ed. 2008. pp. 277-303.
conference
T. Pirinen. "An experimental comparison of time delay weights for direction of arrival estimation". Proceedings of the 11th International Conference on Digital Audio Effects (DAFx-08), Espoo, Finland, 1-4 September 2008. J. Pakarinen ed. 2008. pp. 4 p.
article
T. Pirinen. "A confidence statistic and an outlier detector for difference estimates in sensor arrays", IEEE Sensors Journal, Vol. 8. 2008, pp. 2008-2015.
conference
T. Virtanen, A. T. Cemgil and S. Godsill. "Bayesian Extensions to Non-negative Matrix Factorisation for Audio Signal Modelling". ICASSP 2008. 2008.
article
P. Pertilä, T. Korhonen and A. Visa. "Measurement combination for acoustic source localization in a room environment", Eurasip Journal on Audio, Speech, and Music Processing, Vol. 2008. 2008.
conference
T. Korhonen and P. Pertilä. "TUT acoustic source tracking system 2007". Lecture Notes in Computer Science. 2008. pp. 104-112.
article
M. Ryynänen and A. Klapuri. "Automatic Transcription of Melody, Bass Line, and Chords in Polyphonic Music", Computer Music Journal, Vol. 32. 2008.
conference
M. Myllymäki and T. Virtanen. "Voice Activity Detection in the Presence of Breathing Noise Using Neural Network and Hidden Markov Model". EUSIPCO 2008. 2008.
conference
A. Mesaros and T. Virtanen. "Automatic Alignment of Music Audio and Lyrics". DAFx08. 2008.
conference
T. Virtanen, A. Mesaros and M. Ryynänen. "Combining Pitch-Based Inference and Non-Negative Spectrogram Factorization in Separating Vocals from Polyphonic Music". 2008.

2007

conference
E. Helander, H. Silén and M. Gabbouj. "The use of diphone variants in optimal text selection for Finnish unit selection speech synthesis". Proceedings of the 12th International conference Speech and Computer, SPECOM. 2007. pp. 293-298.
conference
E. Helander, J. Nurminen and M. Gabbouj. "Analysis of LSF frame selection in voice conversion". Proceedings of the 12th International conference Speech and Computer, SPECOM. 2007. pp. 651-656.
conference
M. Helén and T. Virtanen. "A Similarity Measure for Audio Query by Example Based on Perceptual Coding and Compression". proc. 10th International Conference on Digital Audio Effects (DAFx-07). 2007.
conference
J. Paulus and A. Klapuri. "Combining temporal and spectral features in HMM-based drum transcription". Proc. of the 8th International Conference on Music Information Retrieval. 2007. pp. 225-228.
conference
E. Helander and J. Nurminen. "On the importance of pure prosody in the perception of speaker identity". Proceedings of the 8th Annual Conference of the International Speech Communication Association, Interspeech. 2007. pp. 2665-2668.
conference
H. Silén, E. Helander, K. Koppinen and M. Gabbouj. "Building a Finnish unit selection TTS system". Proceedings of the 6th ISCA Workshop on Speech Synthesis. 2007. pp. 310-315.
conference
T. Korhonen and P. Pertilä. "TUT acoustic source tracking system 2007". Multimodal Technologies for Perception of Humans, International Evaluation Workshops CLEAR 2007 and RT 2007. 2007. pp. 104-112.
conference
P. Pertilä and M. Parviainen. "Robust speaker localization in meeting room domain". Proceedings of 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. 2007. pp. 497-500.
conference
E. Helander and J. Nurminen. "A novel method for prosody prediction in voice conversion". IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. 2007. pp. 509-512.
article
T. Virtanen. "Monaural Sound Source Separation by Non-Negative Matrix Factorization with Temporal Continuity and Sparseness Criteria", IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, March, 2007.
conference
E. B. Bilcu and J. Astola. "Improved hybrid approach for bilingual language recognition from text". Proceedings of the 5th International Symposium on Image and Signal Processing and Analysis, ISPA 2007, Istanbul, Turkey, 27-29 September 2007. M. Petrou ed. 2007. pp. 190-195.
article
M. McKinney, D. Moelants, M. E. Davies and A. Klapuri. "Evaluation of audio beat tracking and music tempo extraction algorithms", Journal of New Music Research, Vol. 36. 2007, pp. 1-16.
conference
T. Heittola and A. Klapuri. "TUT Acoustic Event Detection System 2007". Multimodal Technologies for Perception of Humans,Joint Proceedings of the CLEAR 2007 and RT 2007 Evaluation Workshops. R. Stiefelhagen, R. Bowers and J. Fiscus eds. 2007. pp. 364-370.
article
T. Pirinen, P. Pertilä and T. Korhonen. "Seinille kasvaa korvat", Prosessori. 2007, pp. 46-47.
conference
P. Pertilä, T. Korhonen, T. Pirinen and M. Parviainen. "TUT acoustic source tracking system 2006". Lecture Notes in Computer Science. 2007. pp. 127-136.
conference
A. Mesaros, T. Virtanen and A. Klapuri. "Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods". International Conference on Music Information Retrieval. 2007.
conference
M. Ryynänen and A. Klapuri. "Automatic bass line transcription from streaming polyphonic audio". IEEE International Conference on Audio, Speech and Signal Processing (ICASSP). 2007.
conference
A. Klapuri. "Analysis of musical instrument sounds by source-filter-decay model". IEEE International Conference on Audio, Speech and Signal Processing (ICASSP). 2007.
conference
T. Virtanen and M. Helén. "Probabilistic Model Based Similarity Measures for Audio Query-by-Example". proc. WASPAA 2007. 2007.

2006

conference
M. Ryynänen and A. Klapuri. "Transcription of the Singing Melody in Polyphonic Music". Proc. 7th International Conference on Music Information Retrieval (ISMIR 2006). 2006.
conference
A. Klapuri. "Multiple fundamental frequency estimation by summing harmonic amplitudes". 7th International Conference on Music Information Retrieval (ISMIR-06). 2006.
conference
J. Paulus and A. Klapuri. "Music Structure Analysis by Finding Repeated Parts". Proc. of the 1st ACM Audio and Music Computing Multimedia Workshop. 2006. pp. 59-68.
conference
E. B. Bilcu and J. Astola. "Neural networks with random letter codes for text-to-phoneme mapping and small training dictionary". Proceedings of the 14th European Signal Processing Conference, EUSIPCO. 2006.
conference
E. B. Bilcu and J. Astola. "A Hybrid neural network for language identification from text". Proceedings of the 2006 IEEE International Workshop on Machine Learning for Signal Processing. 2006. pp. 253-258.
article
F. Gouyon et al. "An experimental comparison of audio tempo induction algorithms", IEEE Trans. Audio, Speech, and Language Processing, Vol. 14, Sept, 2006, pp. 1832-1844.
conference
M. Helén and T. Lahti. "Query by Example Methods for Audio Signals". 7th Nordic Signal Processing Symposium (NORSIG 2006). 2006.
conference
M. Parviainen, T. Pirinen and P. Pertilä. "A speaker localization system for lecture room environment". Machine Learning for Multimodal Interaction, the Third International Workshop, MLMI. 2006. pp. 225-235.
conference
J. Paulus. "Acoustic Modelling of Drum Sounds With Hidden Markov Models for Music Transcription". Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing. 2006. pp. 241-244.
conference
M. Ryynänen. "Singing transcription". Signal Processing Methods for Music Transcription. A. Klapuri and M. Davy eds. 2006. pp. 361-391.
conference
P. Herrera-Boyer, A. Klapuri and M. Davy. "Automatic classification of pitched musical instrument sounds". Signal Processing Methods for Music Transcription. A. Klapuri and M. Davy eds. 2006. pp. 163-200.
conference
A. Klapuri. "Auditory model-based methods for multiple fundamental frequency estimation". Signal Processing Methods for Music Transcription. A. Klapuri and M. Davy eds. 2006. pp. 229-265.
conference
T. Pirinen. "A Lattice viewpoint for direction of arrival estimation using quantized time differences of arrival". Proceedings of the Fourth IEEE Workshop on Sensor Array and Multichannel Processing, SAM, Waltham, Massachusetts, USA, 12-14 July 2006. 2006. pp. 50-54.
conference
T. Pirinen and A. Visa. "Signal independent wideband activity detection features for microphone arrays". Proceedings of the 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2006, Toulouse, France, 14-19 May 2006. 2006. pp. 1109-1112.
article
A. Eronen et al. "Audio-Based Context Recognition", IEEE Trans. Audio, Speech and Language Processing, Vol. 14. 2006.
conference
A. Klapuri. "Signal Processing Methods for Music Transcription". A. Klapuri and M. Davy eds. 2006.
conference
M. Parviainen, T. Pirinen and P. Pertilä. "A speaker localization system for lecture room environment". Lecture Notes in Computer Science. 2006. pp. 225-235.
article
A. Klapuri, A. Eronen and J. Astola. "Analysis of the meter of acoustic musical signals", IEEE Trans. Audio, Speech, and Language Processing, Vol. 14. 2006.
conference
T. Virtanen and A. Klapuri. "Analysis of polyphonic audio using source-filter model and non-negative matrix factorization". Advances in Models for Acoustic Processing, Neural Information Processing Systems Workshop. 2006.
conference
T. Virtanen. "Speech Recognition Using Factorial Hidden Markov Models for Separation in the Feature Space". proc. Interspeech. 2006.
incollection
D. FitzGerald and J. Paulus. "Unpitched Percussion Transcription". Klapuri, Anssi, Davy and Manuel eds. Springer-Verlag. 2006. pp. 131-162.

2005

conference
P. Pertilä, M. Parviainen, T. Korhonen and A. Visa. "Moving sound source localization in large areas". Proceedings of 2005 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS. 2005. pp. 745-748.
conference
M. Ryynänen and A. Klapuri. "Polyphonic music transcription using note event modeling". Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 2005.
conference
A. Klapuri. "A perceptually motivated multiple-F0 estimation method". Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 2005.
conference
E. B. Bilcu, J. Astola and J. Saarinen. "Comparative study of letter encoding for text-to-phoneme mapping". Proceedings of 13. European Signal Processing Conference, EUSIPCO. 2005.
conference
J. Paulus. "Drum Transcription from Polyphonic Music with Instrument-wise Hidden Markov Models". Proc. of the First Annual Music Information Retrieval Evaluation eXchange. 2005.
conference
J. Paulus and T. Virtanen. "Drum Transcription with Non-negative Spectrogram Factorisation". Proc. of the 13th European Signal Processing Conference. 2005.
conference
T. Korhonen, P. Pertilä and A. Visa. "Particle filtering in high clutter environment". Proceedings of the 2005 Finnish Signal Processing Symposium - FINSIG'05. 2005. pp. 12-15.
conference
T. Pirinen, P. Pertilä and M. Parviainen. "The TUT 2005 source localization system". Proceedings of the Rich Transcription 2005 Spring Meeting Recognition Evaluation. 2005. pp. 93-99.
Results 201 - 300 of 391