Publications

Filter by:

2009

conference
T. Virtanen and A. T. Cemgil. "Mixtures of Gamma Priors for Non-Negative Matrix Factorization Based Speech Separation". ICA. 2009.
conference
A. Mesaros and T. Virtanen. "Adaptation of a speech recognizer for singing voice". Proceedings of the 17th European Signal Processing Conference. 2009. pp. 1779-1783.
conference
T. Virtanen and T. Heittola. "Interpolating Hidden Markov Model and Its Application to Automatic Instrument Recognition". ICASSP. 2009.
conference
T. Heittola, A. Klapuri and T. Virtanen. "Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation". in Proc. 10th Int. Society for Music Information Retrieval Conf. (ISMIR 2009). 2009. pp. 327-332.
conference
A. Klapuri. "A classification approach to multipitch analysis". 6th Sound and Music Computing Conference. 2009.
conference
A. Klapuri. "A method for visualizing the pitch content of polyphonic music signals". Proc. 10th Int. Society for Music Information Retrieval Conf. (ISMIR 2009). 2009.
conference
M. Myllymäki and T. Virtanen. "Non-Stationary Noise Model Compensation in Voice Activity Detection". EUSIPCO. 2009.
conference
T. Virtanen. "Spectral Covariance in Prior Distributions of Non-Negative Matrix Factorization Based Speech Separation". EUSIPCO. 2009.
article
J. Paulus and A. Klapuri. "Drum sound detection in polyphonic music with hidden Markov models", EURASIP Journal on Audio, Speech, and Music Processing, Vol. 2009. 2009.
incollection
J. Paulus and A. Klapuri. "Labelling the Structural Parts of a Music Piece with Markov Models". Ystad, Sølvi, Kronland-Martinet, Richard, Jensen and Kristoffer eds. Springer Berlin / Heidelberg. 2009. pp. 166-176.

2008

conference
P. Pertilä. "Array steered response time-alignment for propagation delay compensation for acoustic localization". Proceedings of the Forty-Second Asilomar Conference on Signals, Systems and Computers. 2008. pp. 298-302.
conference
T. Lahti et al.. "On Enabling Techniques for Personal Audio Content Management". ACM International Conference on Multimedia Information Retrieval (MIR 2008). 2008.
conference
E. Helander, J. Schwarz, J. Nurminen, H. Silén and M. Gabbouj. "On the impact of alignment on voice conversion performance". roceedings of the 9th Annual Conference of the International Speech Communication Associationa, Interspeech. 2008. pp. 1453-1456.
conference
H. Silén, E. Helander, J. Nurminen and M. Gabbouj. "Evaluation of Finnish unit selection and HMM-based speech synthesis". Proceedings of the 9th Annual Conference of the International Speech Communication Associationa, Interspeech. 2008. pp. 1853-1856.
conference
J. Paulus and A. Klapuri. "Music Structure Analysis Using a Probabilistic Fitness Measure And an Integrated Musicological Model". Proc. of the Ninth International Conference on Music Information Retrieval. 2008.
conference
J. Paulus and A. Klapuri. "Acoustic Features for Music Piece Structure Analysis". Proc. of the 11th International Conference on Digital Audio Effects. 2008. pp. 309-312.
conference
M. Ryynänen, T. Virtanen, J. Paulus and A. Klapuri. "Accompaniment separation and karaoke application based on automatic melody transcription". EEE International Conf. on Multimedia and Expo. 2008.
conference
J. Paulus and A. Klapuri. "Labelling the Structural Parts of a Music Piece with Markov Models". Proc. of the 2008 Computers in Music Modeling and Retrieval Conference. Jensen and Kristoffer eds. 2008. pp. 137-147.
conference
M. Ryynänen and A. Klapuri. "Query by Humming of MIDI and Audio Using Locality Sensitive Hashing". Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'08). 2008.
conference
E. Helander, J. Nurminen and M. Gabbouj. "LSF mapping for voice conversion with very small training sets". Proceedings of 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. 2008. pp. 4669-4672.
article
A. Klapuri. "Multipitch analysis of polyphonic music and speech signals using an auditory model", IEEE Trans. Audio, Speech and Language Processing, Vol. 16, Feb, 2008, pp. 255-266.
article
E. B. Bilcu and J. Astola. "A hybrid approach to bilingual text-to-phoneme mapping", Facta Universitatis, Series: Electronics and Energetics, Vol. 21. 2008, pp. 91-105.
conference
A. Klapuri and T. Virtanen. "Automatic music transcription". , Handbook of Signal Processing in Acoustics. D. Havelock ed. 2008. pp. 277-303.
conference
T. Pirinen. "An experimental comparison of time delay weights for direction of arrival estimation". Proceedings of the 11th International Conference on Digital Audio Effects (DAFx-08), Espoo, Finland, 1-4 September 2008. J. Pakarinen ed. 2008. pp. 4 p.
article
T. Pirinen. "A confidence statistic and an outlier detector for difference estimates in sensor arrays", IEEE Sensors Journal, Vol. 8. 2008, pp. 2008-2015.
conference
T. Virtanen, A. T. Cemgil and S. Godsill. "Bayesian Extensions to Non-negative Matrix Factorisation for Audio Signal Modelling". ICASSP 2008. 2008.
article
P. Pertilä, T. Korhonen and A. Visa. "Measurement combination for acoustic source localization in a room environment", Eurasip Journal on Audio, Speech, and Music Processing, Vol. 2008. 2008.
conference
T. Korhonen and P. Pertilä. "TUT acoustic source tracking system 2007". Lecture Notes in Computer Science. 2008. pp. 104-112.
article
M. Ryynänen and A. Klapuri. "Automatic Transcription of Melody, Bass Line, and Chords in Polyphonic Music", Computer Music Journal, Vol. 32. 2008.
conference
M. Myllymäki and T. Virtanen. "Voice Activity Detection in the Presence of Breathing Noise Using Neural Network and Hidden Markov Model". EUSIPCO 2008. 2008.
conference
A. Mesaros and T. Virtanen. "Automatic Alignment of Music Audio and Lyrics". DAFx08. 2008.
conference
T. Virtanen, A. Mesaros and M. Ryynänen. "Combining Pitch-Based Inference and Non-Negative Spectrogram Factorization in Separating Vocals from Polyphonic Music". 2008.

2007

conference
E. Helander, H. Silén and M. Gabbouj. "The use of diphone variants in optimal text selection for Finnish unit selection speech synthesis". Proceedings of the 12th International conference Speech and Computer, SPECOM. 2007. pp. 293-298.
conference
E. Helander, J. Nurminen and M. Gabbouj. "Analysis of LSF frame selection in voice conversion". Proceedings of the 12th International conference Speech and Computer, SPECOM. 2007. pp. 651-656.
conference
M. Helén and T. Virtanen. "A Similarity Measure for Audio Query by Example Based on Perceptual Coding and Compression". proc. 10th International Conference on Digital Audio Effects (DAFx-07). 2007.
conference
J. Paulus and A. Klapuri. "Combining temporal and spectral features in HMM-based drum transcription". Proc. of the 8th International Conference on Music Information Retrieval. 2007. pp. 225-228.
conference
E. Helander and J. Nurminen. "On the importance of pure prosody in the perception of speaker identity". Proceedings of the 8th Annual Conference of the International Speech Communication Association, Interspeech. 2007. pp. 2665-2668.
conference
H. Silén, E. Helander, K. Koppinen and M. Gabbouj. "Building a Finnish unit selection TTS system". Proceedings of the 6th ISCA Workshop on Speech Synthesis. 2007. pp. 310-315.
conference
T. Korhonen and P. Pertilä. "TUT acoustic source tracking system 2007". Multimodal Technologies for Perception of Humans, International Evaluation Workshops CLEAR 2007 and RT 2007. 2007. pp. 104-112.
conference
P. Pertilä and M. Parviainen. "Robust speaker localization in meeting room domain". Proceedings of 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. 2007. pp. 497-500.
conference
E. Helander and J. Nurminen. "A novel method for prosody prediction in voice conversion". IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. 2007. pp. 509-512.
article
T. Virtanen. "Monaural Sound Source Separation by Non-Negative Matrix Factorization with Temporal Continuity and Sparseness Criteria", IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, March, 2007.
conference
E. B. Bilcu and J. Astola. "Improved hybrid approach for bilingual language recognition from text". Proceedings of the 5th International Symposium on Image and Signal Processing and Analysis, ISPA 2007, Istanbul, Turkey, 27-29 September 2007. M. Petrou ed. 2007. pp. 190-195.
article
M. McKinney, D. Moelants, M. E. Davies and A. Klapuri. "Evaluation of audio beat tracking and music tempo extraction algorithms", Journal of New Music Research, Vol. 36. 2007, pp. 1-16.
conference
T. Heittola and A. Klapuri. "TUT Acoustic Event Detection System 2007". Multimodal Technologies for Perception of Humans,Joint Proceedings of the CLEAR 2007 and RT 2007 Evaluation Workshops. R. Stiefelhagen, R. Bowers and J. Fiscus eds. 2007. pp. 364-370.
article
T. Pirinen, P. Pertilä and T. Korhonen. "Seinille kasvaa korvat", Prosessori. 2007, pp. 46-47.
conference
P. Pertilä, T. Korhonen, T. Pirinen and M. Parviainen. "TUT acoustic source tracking system 2006". Lecture Notes in Computer Science. 2007. pp. 127-136.
conference
A. Mesaros, T. Virtanen and A. Klapuri. "Singer Identification in Polyphonic Music Using Vocal Separation and Pattern Recognition Methods". International Conference on Music Information Retrieval. 2007.
conference
M. Ryynänen and A. Klapuri. "Automatic bass line transcription from streaming polyphonic audio". IEEE International Conference on Audio, Speech and Signal Processing (ICASSP). 2007.
conference
A. Klapuri. "Analysis of musical instrument sounds by source-filter-decay model". IEEE International Conference on Audio, Speech and Signal Processing (ICASSP). 2007.
conference
T. Virtanen and M. Helén. "Probabilistic Model Based Similarity Measures for Audio Query-by-Example". proc. WASPAA 2007. 2007.

2006

conference
M. Ryynänen and A. Klapuri. "Transcription of the Singing Melody in Polyphonic Music". Proc. 7th International Conference on Music Information Retrieval (ISMIR 2006). 2006.
conference
A. Klapuri. "Multiple fundamental frequency estimation by summing harmonic amplitudes". 7th International Conference on Music Information Retrieval (ISMIR-06). 2006.
conference
J. Paulus and A. Klapuri. "Music Structure Analysis by Finding Repeated Parts". Proc. of the 1st ACM Audio and Music Computing Multimedia Workshop. 2006. pp. 59-68.
conference
E. B. Bilcu and J. Astola. "Neural networks with random letter codes for text-to-phoneme mapping and small training dictionary". Proceedings of the 14th European Signal Processing Conference, EUSIPCO. 2006.
conference
E. B. Bilcu and J. Astola. "A Hybrid neural network for language identification from text". Proceedings of the 2006 IEEE International Workshop on Machine Learning for Signal Processing. 2006. pp. 253-258.
article
F. Gouyon et al. "An experimental comparison of audio tempo induction algorithms", IEEE Trans. Audio, Speech, and Language Processing, Vol. 14, Sept, 2006, pp. 1832-1844.
conference
M. Helén and T. Lahti. "Query by Example Methods for Audio Signals". 7th Nordic Signal Processing Symposium (NORSIG 2006). 2006.
conference
M. Parviainen, T. Pirinen and P. Pertilä. "A speaker localization system for lecture room environment". Machine Learning for Multimodal Interaction, the Third International Workshop, MLMI. 2006. pp. 225-235.
conference
J. Paulus. "Acoustic Modelling of Drum Sounds With Hidden Markov Models for Music Transcription". Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing. 2006. pp. 241-244.
conference
M. Ryynänen. "Singing transcription". Signal Processing Methods for Music Transcription. A. Klapuri and M. Davy eds. 2006. pp. 361-391.
conference
P. Herrera-Boyer, A. Klapuri and M. Davy. "Automatic classification of pitched musical instrument sounds". Signal Processing Methods for Music Transcription. A. Klapuri and M. Davy eds. 2006. pp. 163-200.
conference
A. Klapuri. "Auditory model-based methods for multiple fundamental frequency estimation". Signal Processing Methods for Music Transcription. A. Klapuri and M. Davy eds. 2006. pp. 229-265.
conference
T. Pirinen. "A Lattice viewpoint for direction of arrival estimation using quantized time differences of arrival". Proceedings of the Fourth IEEE Workshop on Sensor Array and Multichannel Processing, SAM, Waltham, Massachusetts, USA, 12-14 July 2006. 2006. pp. 50-54.
conference
T. Pirinen and A. Visa. "Signal independent wideband activity detection features for microphone arrays". Proceedings of the 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2006, Toulouse, France, 14-19 May 2006. 2006. pp. 1109-1112.
article
A. Eronen et al. "Audio-Based Context Recognition", IEEE Trans. Audio, Speech and Language Processing, Vol. 14. 2006.
conference
A. Klapuri. "Signal Processing Methods for Music Transcription". A. Klapuri and M. Davy eds. 2006.
conference
M. Parviainen, T. Pirinen and P. Pertilä. "A speaker localization system for lecture room environment". Lecture Notes in Computer Science. 2006. pp. 225-235.
article
A. Klapuri, A. Eronen and J. Astola. "Analysis of the meter of acoustic musical signals", IEEE Trans. Audio, Speech, and Language Processing, Vol. 14. 2006.
conference
T. Virtanen and A. Klapuri. "Analysis of polyphonic audio using source-filter model and non-negative matrix factorization". Advances in Models for Acoustic Processing, Neural Information Processing Systems Workshop. 2006.
conference
T. Virtanen. "Speech Recognition Using Factorial Hidden Markov Models for Separation in the Feature Space". proc. Interspeech. 2006.
incollection
D. FitzGerald and J. Paulus. "Unpitched Percussion Transcription". Klapuri, Anssi, Davy and Manuel eds. Springer-Verlag. 2006. pp. 131-162.

2005

conference
P. Pertilä, M. Parviainen, T. Korhonen and A. Visa. "Moving sound source localization in large areas". Proceedings of 2005 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS. 2005. pp. 745-748.
conference
M. Ryynänen and A. Klapuri. "Polyphonic music transcription using note event modeling". Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 2005.
conference
A. Klapuri. "A perceptually motivated multiple-F0 estimation method". Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 2005.
conference
E. B. Bilcu, J. Astola and J. Saarinen. "Comparative study of letter encoding for text-to-phoneme mapping". Proceedings of 13. European Signal Processing Conference, EUSIPCO. 2005.
conference
J. Paulus. "Drum Transcription from Polyphonic Music with Instrument-wise Hidden Markov Models". Proc. of the First Annual Music Information Retrieval Evaluation eXchange. 2005.
conference
J. Paulus and T. Virtanen. "Drum Transcription with Non-negative Spectrogram Factorisation". Proc. of the 13th European Signal Processing Conference. 2005.
conference
T. Korhonen, P. Pertilä and A. Visa. "Particle filtering in high clutter environment". Proceedings of the 2005 Finnish Signal Processing Symposium - FINSIG'05. 2005. pp. 12-15.
conference
T. Pirinen, P. Pertilä and M. Parviainen. "The TUT 2005 source localization system". Proceedings of the Rich Transcription 2005 Spring Meeting Recognition Evaluation. 2005. pp. 93-99.
conference
A. Mesaros and J. Astola. "Inter-dependence of spectral measures for the singing voice". Proceedings of International Symposium on Signal, Circuits and Systems, ISSCS. 2005. pp. 307-310.
conference
M. Parviainen, P. Pertilä, T. Korhonen and A. Visa. "A spatiotemporal approach for passive sound source localization - real-world experiments". Proceedings of International Workshop on Nonlinear Signal and Image Processing, NSIP. 2005. pp. 468-473.
conference
T. Mikkonen. "Homogeneous graph invariants". International conference on Discrete Mathematics and ist applications, Tamil Nadu, India, 9-11 December 2005. 2005. pp. 4 p.
article
C. Wooters et al. "The 2004 ICSI-SR-UW meeting recognition system", Lecture Notes in Computer Science, Vol. 3361. 2005, pp. 196-208.
conference
T. Pirinen. "Normalized confidence factors for robust direction of arrival estimation". Proceedings of 2005 IEEE International Symposium on Circuits and Systems, ISCAS 2005, Kobe, Japan, 23-26 May 2005. 2005. pp. 1429-1432.
conference
A. Klapuri, T. Virtanen and M. Helén. "Modeling musical sounds with an interpolating state model". Proc. European signal processing conference. 2005.
conference
M. Helén and T. Virtanen. "Separation of Drums From Polyphonic Music Using Non-Negative Matrix Factorization and Support Vector Machine". . European Signal Processing Conference ed. 2005.

2004

conference
P. Pertilä, M. Parviainen, T. Korhonen and A. Visa. "A spatiotemporal approach to passive sound source localization". Proceedings of International Symposium on Communications and Information Technologies 2004, ISCIT. 2004. pp. 1150-1154.
article
A. Klapuri. "Automatic music transcription as we know it today", Journal of New Music Research, Vol. 33, September, 2004, pp. 269-282.
conference
T. Pirinen, J. Yli-Hietanen, P. Pertilä and A. Visa. "Detection and compensation of sensor malfunction in time delay based direction of arrival estimation". Proceedings of 2004 IEEE International Symposium on Circuits and Systems, ISCAS. 2004. pp. 872 - 875.
conference
E. B. Bilcu, J. Astola and J. Saarinen. "Recurrent neural networks with both side input context dependence for text-to-phoneme mapping". Proceedings of the 2004 First International Symposium on Control, Communications and Signal Processing, ISCCSP. 2004. pp. 599-602.
book
A. Klapuri, A. Eronen and J. Astola. Automatic estimation of the meter of acoustic musical signals, TTY-Paino, 2004.
article
K. Koppinen. "Analysis of the asymptotic impulse and frequency responses of polynomial predictors", Signal Processing, Vol. 84. 2004, pp. 549-560.
conference
A. Stolcke et al.. "Progress in meeting recognition: The ICSI-SRI-UW spring 2004 evaluation system". Proceedings of NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, Canada, 17 May 2004. 2004. pp. 7 p.
conference
T. Pirinen and J. Yli-Hietanen. "Time delay based failure-robust direction of arrival estimation". Proceedings of 2004 IEEE Sensor Array and Multichannel Signal Processing Workshop, SAM 2004, Barcelona, Spain, 18-21 July 2004. 2004. pp. 5 p.
conference
N. Mirfhafori et al.. "From switchboard to meetings: development of the 2004 ICSI-SRI-UW meeting recognition system". Proceedings of the 8th International Conference on Spoken Language Processing, Interspeech 20004, ICSLP, Jeju Island, Korea, 4-8 October 2004. Kim, S. H., Youn and D. H. eds. 2004. pp. 4 p.
article
K. Koppinen. "Signal Processing", Signal Processing. 2004, pp. 549-560.
conference
M. Ryynänen and A. Klapuri. "Modelling of note events for singing transcription". Proc. ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing. 2004.
conference
A. Pertusa, A. Klapuri and J. M. N}esta. "Recognition of note onsets in digital music using semitone bands". Progress in Pattern Recognition, Image Analysis and Applications: 10th Iberoamerican Congress on Pattern Recognition. A. Sanfeliu and M. Lazo eds. 2004. pp. 869-879.
conference
T. Virtanen. "Separation of Sound Sources by Convolutive Sparse Coding". I. Tutorial, R. W. on Statistical and P. A. Processing eds. 2004.
Results 201 - 300 of 366