# Publications

Filter by:
Go Reset

## 2006

conference

T. Virtanen. "Speech Recognition Using Factorial Hidden Markov Models for Separation in the Feature Space".

*proc. Interspeech*. 2006.
incollection

D. FitzGerald and J. Paulus. "Unpitched Percussion Transcription". Klapuri, Anssi, Davy and Manuel eds. Springer-Verlag. 2006. pp. 131-162.

## 2005

conference

P. Pertilä, M. Parviainen, T. Korhonen and A. Visa. "Moving sound source localization in large areas".

*Proceedings of 2005 International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS*. 2005. pp. 745-748.
conference

M. Ryynänen and A. Klapuri. "Polyphonic music transcription using note event modeling".

*Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics*. 2005.
conference

A. Klapuri. "A perceptually motivated multiple-F0 estimation method".

*Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics*. 2005.
conference

E. B. Bilcu, J. Astola and J. Saarinen. "Comparative study of letter encoding for text-to-phoneme mapping".

*Proceedings of 13. European Signal Processing Conference, EUSIPCO*. 2005.
conference

J. Paulus. "Drum Transcription from Polyphonic Music with Instrument-wise Hidden Markov Models".

*Proc. of the First Annual Music Information Retrieval Evaluation eXchange*. 2005.
conference

J. Paulus and T. Virtanen. "Drum Transcription with Non-negative Spectrogram Factorisation".

*Proc. of the 13th European Signal Processing Conference*. 2005.
conference

T. Korhonen, P. Pertilä and A. Visa. "Particle filtering in high clutter environment".

*Proceedings of the 2005 Finnish Signal Processing Symposium - FINSIG'05*. 2005. pp. 12-15.
conference

T. Pirinen, P. Pertilä and M. Parviainen. "The TUT 2005 source localization system".

*Proceedings of the Rich Transcription 2005 Spring Meeting Recognition Evaluation*. 2005. pp. 93-99.
conference

A. Mesaros and J. Astola. "Inter-dependence of spectral measures for the singing voice".

*Proceedings of International Symposium on Signal, Circuits and Systems, ISSCS*. 2005. pp. 307-310.
conference

M. Parviainen, P. Pertilä, T. Korhonen and A. Visa. "A spatiotemporal approach for passive sound source localization - real-world experiments".

*Proceedings of International Workshop on Nonlinear Signal and Image Processing, NSIP*. 2005. pp. 468-473.
conference

T. Mikkonen. "Homogeneous graph invariants".

*International conference on Discrete Mathematics and ist applications, Tamil Nadu, India, 9-11 December 2005*. 2005. pp. 4 p.
article

C. Wooters et al. "The 2004 ICSI-SR-UW meeting recognition system",

*Lecture Notes in Computer Science*, Vol. 3361. 2005, pp. 196-208.
conference

T. Pirinen. "Normalized confidence factors for robust direction of arrival estimation".

*Proceedings of 2005 IEEE International Symposium on Circuits and Systems, ISCAS 2005, Kobe, Japan, 23-26 May 2005*. 2005. pp. 1429-1432.
conference

A. Klapuri, T. Virtanen and M. Helén. "Modeling musical sounds with an interpolating state model".

*Proc. European signal processing conference*. 2005.
conference

M. Helén and T. Virtanen. "Separation of Drums From Polyphonic Music Using Non-Negative Matrix Factorization and Support Vector Machine". . European Signal Processing Conference ed. 2005.

## 2004

conference

P. Pertilä, M. Parviainen, T. Korhonen and A. Visa. "A spatiotemporal approach to passive sound source localization".

*Proceedings of International Symposium on Communications and Information Technologies 2004, ISCIT*. 2004. pp. 1150-1154.
article

A. Klapuri. "Automatic music transcription as we know it today",

*Journal of New Music Research*, Vol. 33, September, 2004, pp. 269-282.
conference

T. Pirinen, J. Yli-Hietanen, P. Pertilä and A. Visa. "Detection and compensation of sensor malfunction in time delay based direction of arrival estimation".

*Proceedings of 2004 IEEE International Symposium on Circuits and Systems, ISCAS*. 2004. pp. 872 - 875.
conference

E. B. Bilcu, J. Astola and J. Saarinen. "Recurrent neural networks with both side input context dependence for text-to-phoneme mapping".

*Proceedings of the 2004 First International Symposium on Control, Communications and Signal Processing, ISCCSP*. 2004. pp. 599-602.
book

A. Klapuri, A. Eronen and J. Astola.

*Automatic estimation of the meter of acoustic musical signals*, TTY-Paino, 2004.
article

K. Koppinen. "Analysis of the asymptotic impulse and frequency responses of polynomial predictors",

*Signal Processing*, Vol. 84. 2004, pp. 549-560.
conference

A. Stolcke et al.. "Progress in meeting recognition: The ICSI-SRI-UW spring 2004 evaluation system".

*Proceedings of NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, Canada, 17 May 2004*. 2004. pp. 7 p.
conference

T. Pirinen and J. Yli-Hietanen. "Time delay based failure-robust direction of arrival estimation".

*Proceedings of 2004 IEEE Sensor Array and Multichannel Signal Processing Workshop, SAM 2004, Barcelona, Spain, 18-21 July 2004*. 2004. pp. 5 p.
conference

N. Mirfhafori et al.. "From switchboard to meetings: development of the 2004 ICSI-SRI-UW meeting recognition system".

*Proceedings of the 8th International Conference on Spoken Language Processing, Interspeech 20004, ICSLP, Jeju Island, Korea, 4-8 October 2004*. Kim, S. H., Youn and D. H. eds. 2004. pp. 4 p.
conference

M. Ryynänen and A. Klapuri. "Modelling of note events for singing transcription".

*Proc. ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing*. 2004.
conference

A. Pertusa, A. Klapuri and J. M. N}esta. "Recognition of note onsets in digital music using semitone bands".

*Progress in Pattern Recognition, Image Analysis and Applications: 10th Iberoamerican Congress on Pattern Recognition*. A. Sanfeliu and M. Lazo eds. 2004. pp. 869-879.
conference

T. Virtanen. "Separation of Sound Sources by Convolutive Sparse Coding". I. Tutorial, R. W. on Statistical and P. A. Processing eds. 2004.

## 2003

conference

M. Parviainen and T. Virtanen. "Two-channel separation of speech using direction-of-arrival estimation and sinusoids plus transients modeling".

*Proceedings of 2003 IEEE International Symposium on Intelligent Signal Processing and Communication Systems, ISPACS 2003*. 2003. pp. 127-132.
conference

M. Helén and T. Virtanen. "Perceptually motivated parametric representation for harmonic sounds for data compression purposes".

*Proceedings of the 6th International Conference on Digital Audio Effects DAFx-03*. 2003. pp. 249-253.
conference

T. Virtanen. "Algorithm for the separation of harmonic sounds with time-frequency smoothness constraint".

*Proceedings of the 6th International Conference on Digital Audio Effects DAFx-03*. 2003. pp. 35-40.
conference

J. Paulus and A. Klapuri. "Model-based Event Labeling in the Transcription of Percussive Audio Signals".

*Proc. of the 6th International Conference on Digital Audio Effects*. Davies and Mike eds. 2003. pp. 73-77.
conference

A. Eronen. "Musical instrument recognition using ICA-based transform of features and discriminatively trained HMMs".

*Proceedings of the Seventh International Symposium on Signal Processing and its Applications*. 2003. pp. 133-136.
conference

J. Paulus and A. Klapuri. "Conventional and Periodic N-grams in the Transcription of Drum Sequences".

*Proc. of the IEEE International Conference on Multimedia and Expo*. 2003. pp. 737-740.
conference

K. Koppinen. "Design of narrowband fir filters with minimal noise gain using complex interpolation".

*IEEE Proceedings of 2003 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2003*. 2003. pp. 265-268.
conference

P. Pertilä, T. Pirinen, A. Visa and T. Korhonen. "Comparison of three post-processing methods for acoustic localization".

*Proceedings of SPIE, Unattended Ground Sensor Technologies and Applications V*. 2003. pp. 9-17.
conference

T. Pirinen, P. Pertilä and A. Visa. "A new method for outlier removal in time delay based direction of arrival estimates".

*Proceedings of SPIE, Unattended Ground Sensor Technologies and Applications V*. 2003. pp. 18-29.
conference

R. Niemistö and T. Mäkelä. "On performance of linear adaptive filtering algorithms in acoustic echo control in presence of distorting loudspeakers".

*Proceedings of the Eight International Workshop on Acoustic Echo and Noise Control, IWAENC 2003, Kyoto, Japan, 8-11 September 2003*. S. Makino and M. Miyoshi eds. 2003. pp. 79-82.
conference

T. Mäkelä and R. Niemistö. "Effects of harmonic components generated by polynomial preprocessors in acoustic echo control".

*Proceedings of the Eight International Workshop on Acoustic Echo and Noise Control, IWAENC 2003, Kyoto, Japan, 8-11 September 2003*. S. Makino and M. Miyoshi eds. 2003. pp. 139-142.
conference

S. Kuja-Halkola and A. Eronen. "Simultaneous training and order selection of gaussian mixture models for speaker recognition".

*Proceedings of the 2003 Finnish Signal Processing Symposium, FINSIG'03, Tampere, Finland, 19 May 2003*. H. Huttunen, A. Gotchev and A. Vasilache eds. 2003. pp. 259-263.
conference

T. Viitaniemi, A. Klapuri and A. Eronen. "A probabilistic model for the transcription of single-voice melodies".

*Proceedings of the 2003 Finnish Signal Processing Symposium, FINSIG'03, Tampere, Finland, 19 May 2003*. H. Huttunen, A. Gotchev and A. Vasilache eds. 2003. pp. 59-63.
article

E. Gómez, A. Klapuri and B. Meudic. "Melody Description and Extraction in the Context of Music Content Processing",

*Journal of New Music Research*, Vol. 32. 2003.
conference

A. Eronen and T. Heittola. "Discriminative training of unsupervised acoustic models for non-speech audio".

*Proceedings of the 2003 Finnish Signal Processing Symposium, FINSIG'03*. 2003. pp. 54-58.
article

A. Klapuri. "Multiple fundamental frequency estimation by harmonicity and spectral smoothness",

*IEEE Trans. Speech and Audio Processing*, Vol. 11. 2003, pp. 804-816.
conference

T. Pirinen, P. Pertilä and A. Visa. "Toward intelligent sensors - reliability for time delay based direction of arrival estimates".

*IEEE Proceedings of 2003 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2003*. 2003.
conference

T. Virtanen. "Sound Source Separation Using Sparse Coding with Temporal Continuity Objective".

*International Computer Music Conference*. 2003.## 2002

conference

J. Paulus and A. Klapuri. "Measuring the Similarity of Rhythmic Patterns".

*Proc. of the Third International Conference on Music Information Retrieval*. Fingerhut and Michael eds. 2002. pp. 150-156.
conference

T. Heittola and A. Klapuri. "Locating segments with drums in music signals".

*International Conference on Music Information Retrieval (ISMIR)*. 2002. pp. 271-272.
conference

A. Eronen et al.. "Audio-based context awareness - Acoustic modeling and perceptual evaluation".

*Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing*. 2002. pp. 1941-1944.
conference

E. B. Bilcu, P. Salmela, J. Suontausta and J. Saarinen. "Application of the Neural Networks for Text-to-Phoneme Mapping".

*Proceedings of EUSIPCO 2002 the XI European Signal Processing Conference, September 3-6, 2002, Tolouse, France*. 2002. pp. 97-100.
conference

E. B. Bilcu, J. Suontausta and J. Saarinen. "A New Transform Domain Neural Network for Text-To-Phoneme Mapping".

*Proceedings of the 6th WSEAS International Multiconference on Circuits, Systems, Communications and Computers, CSCC 2002, July 7-14, 2002, Grete, Greece*. 2002. pp. 4591-4596.
conference

J. Yli-Hietanen and T. Saarelainen. "Analysis of robust time-delay based angle-of-arrival estimation methods".

*DSP2002, 14th International Conference on Digital Signal Processing Proceedings, July 1-3, 2002, Santorini, Greece*. A. N. Skodras and A. G. Constantinides eds. 2002. pp. 239-242.
conference

R. Niemistö and T. Mäkelä. "Robust adaptive polynomial filters for acoustic echo cancellation".

*Proceedings of the 5th Nordic Signal Processing Symposium, NORSIG 2002, October 4-7, 2002, on board Hurtigruten, Norway*. 2002. pp. 5 s.
conference

R. Niemistö, T. Mäkelä and V. Myllylä. "Robust fast affine projection algorithm for nonlinear acoustic echo cancellation".

*Proceedings of EUSIPCO 2002, XI European Signal Processing Conference, September 3-6, 2002, Tolouse, France*. 2002. pp. 523-526.
conference

V. Peltonen, J. Tuomi, A. Klapuri, J. Huopaniemi and T. Sorsa. "Computational auditory scene recognition".

*IEEE International Conference on Audio, Speech and Signal Processing*. 2002.
conference

T. Virtanen and A. Klapuri. "Separation of Harmonic Sounds Using Linear Models for the Overtone Series".

*IEEE International Conference on Audio, Speech and Signal Processing*. 2002.## 2001

conference

A. Klapuri, A. Eronen, J. Seppänen and T. Virtanen. "Automatic transcription of music".

*Symposium on Stochastic Modeling of Music, 14th Meeting of the FWO Research Society on Foundations of Music Research*. 2001.
conference

A. Eronen. "Comparison of features for musical instrument recognition".

*Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA*. 2001. pp. 19-22.
conference

A. Klapuri, T. Virtanen, A. Eronen and J. Seppänen. "Automatic transcription of musical recordings".

*Consistent & Reliable Acoustic Cues Workshop, CRAC-01*. 2001.
conference

T. Virtanen. "Accure Sinusoidal Model Analysis and Parameter Redustion by Fusion of Componets".

*Audio Engineering Society, Convention Paper, Presented at the 110th Convention*. 2001.
conference

V. Peltonen, A. Eronen, M. Parviainen and A. Klapuri. "Recognition of Everyday Auditory Scenes: Potentials, Latencies and Clues".

*Audio Engineering Society, Convention Paper, Presented at the 110th Convention, 2001 May 12-15, Amsterdam, The Netherlands*. 2001. pp. 4 s.
conference

V. Peltonen, A. Eronen, M. Parviainen and A. Klapuri. "Recognition of everyday auditory scenes: potentials, latencies and cues".

*110th Audio Engineering Society Convention*. 2001.
conference

T. Virtanen and A. Klapuri. "Separation of Harmonic Sounds Using Multipitch Analysis and Iterative Parameter Estimation".

*in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics*. 2001.
conference

A. Klapuri. "Multipitch estimation and sound separation by the spectral smoothness principle".

*IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)*. 2001.
conference

A. Klapuri. "Means of integrating audio content analysis algorithms".

*10th Audio Engineering Society Convention*. 2001.## 2000

conference

T. Mikkonen and K. Koppinen. "Soft-Decision Decoding of Binary Block Codes in Celp Speech Coding".

*Eusipco 2000, X European Signal Processing Conference*. 2000. pp. 825-828.
conference

K. Koppinen and J. Astola. "Generalized IIR polynomial predictive filters".

*Signal Processing X Theories and Applications, Proceedings of EUSIPCO 2000, 10th European Signal Processing Conference*. 2000. pp. 2457-2460.
conference

J. Kivimäki, T. Lahti and K. Koppinen. "A Phonetic Vocoder for Finnish".

*Proceedings of the X European Signal Processing Conference (EUSIPCO)*. 2000. pp. 1301-1304.
conference

J. Yli-Hietanen, T. Saarelainen and J. Routakangas. "Robust Angle-of-Arrival Estimation of Transient Signals".

*In Proceedings of the IEEE Nordic Signal Processing Symposium (NORSIG)*. 2000. pp. 65-68.
conference

A.-V. Rosti and V. Koivunen. "Classification of mfsk modulated signals using the mean of complex envelope".

*Signal Processing X Theories and Applications, Proceedings of EUSIPCO 2000, tenth European Signal processing Conference, 4-8 September 2000, Tampere, Finland*. M. Gabbouj and P. Kuosmanen eds. 2000. pp. 581-584.
conference

T. Saarelainen and J. Yli-Hietanen. "A design method for small sensor arrays in angle of arrival estimation".

*2000 10th European Signal Processing Conference*. 2000. pp. 1-4.
conference

J. Yli-Hietanen, T. Saarelainen and J. Routakangas. "Robust Angle of Arrival Estimation of Transient Signals".

*Norsig 2000, Vildmarkshotellet, Kolmården, Sweden, June 13 - June 15, 2000*. 2000. pp. 65-68.
conference

T. Saarelainen and J. Yli-Hietanen. "Design Method for Small Sensor Arrays in Angle of Arrival Estimation".

*Signal Processing X, Theories and Applications, EUSIPCO 2000, 4-8 September 2000, Tampere, Finland*. 2000. pp. 1589-1592.
conference

A. Klapuri, T. Virtanen and J.-M. Holm. "Robust multipitch estimation for the analysis and manipulation of polyphonic musical signals".

*In Proc. COST-G6 Conference on Digital Audio Effects, DAFx-00*. 2000.
conference

T. Virtanen and A. Klapuri. "Separation of Harmonic Sound Sources Using Sinusoidal Modeling".

*IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)*. 2000.
conference

A. Klapuri. "Qualitative and quantitative aspects in the design of periodicity estimation algorithms".

*Proceedings of the European Signal Processing Conference EUSIPCO*. 2000.
conference

A. Eronen and A. Klapuri. "Musical Instrument Recognition Using Cepstral Coefficients and Temporal Features".

*IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000*. 2000.
conference

J. Sillanpää, A. Klapuri, J. Seppänen and T. Virtanen. "Recognition of acoustic noise mixtures by combined bottom-up and top-down processing".

*Proceedings of the European Signal Processing Conference EUSIPCO*. 2000.## 1999

conference

A. Klapuri. "Pitch Estimation Using Multiple Independent Time-Frequency Windows".

*Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics*. 1999.
conference

J. Yli-Hietanen, K. Koppinen and J. Astola. "Time-delay Selection for Robust Angle of Arrival Estimation".

*Proceedings of the IASTED Internatioanl Conference Signal and Image Processing (SIP99)*. 1999.
conference

J. Yli-Hietanen, K. Koppinen and K. Halonen. "Cluster Filter".

*In proceedings of the European Signal Processing Conference*. 1999. pp. 1905-1907.
conference

A. Klapuri. "Wide-band Pitch Estimation for Natural Sound Sources with Inharmonicities".

*106th Audio Engineering Society Convention*. 1999.
conference

J. Seppänen, S. Kananoja, J. Yli-Hietanen, K. Koppinen and J. Sjöberg. "Maximization of the subjective loudness of speech with constrained amplitude".

*Applications of Signal Processing to Audio and Acoustics, 1999 IEEE Workshop on*. 1999. pp. 139-142.
conference

A. Klapuri. "Sound Onset Detection by Applying Psychoacoustic Knowledge".

*IEEE International Conference on Acoustics, Speech and Signal Processing*. 1999.## 1998

conference

J. Yli-Hietanen, K. Koppinen and E. Paajanen. "Siren Sound Suppression for Speech Enhancement in Mobile Communications".

*ICSPAT98*. 1998. pp. 1277-1280.
conference

K. Koppinen, J. Yli-Hietanen and P. Händel. "Design of Multi-Delay Predictive Filters Using Dynamic Programming".

*Proceedings of EUSIPCO'98, 9th European Signal Processing Conference*. Theodoridis and S. et Al. eds. 1998. pp. 161-164.
conference

A. Klapuri. "Number Theoretical Means of Resolving a Mixture of Several Harmonic Sounds".

*Proceedings of the European Signal Processing Conference*. 1998.## 1997

conference

K. Koppinen, J. Yli-Hietanen and J. Astola. "Optimization of generalized predictors".

*IMTC Proceedings*. 1997. pp. 54-59.## 1996

conference

K. Koppinen, O. Vainio and J. Astola. "Analysis and Design of Polynomial Predictors".

*Proc. IEEE Nordic Signal Processing Symposium*. 1996. pp. 45-48.
conference

J. Yli-Hietanen, K. Kalliojärvi and J. Astola. "Low-complexity angle of arrival estimation of wideband signals using small arrays".

*Proceedings of 8th Workshop on Statistical Signal and Array Processing*. 1996. pp. 109-112.
conference

J. Yli-Hietanen, K. Kalliojärvi and J. Astola. "Robust Time-Delay Based Angle of Arrival Estimation".

*Proceedings of Norsig'96*. 1996.## No year

article

"Improving competing voices segregation for hearing impaired listeners using a low-latency deep neural network algorithm".

conference

"Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features".

conference

"An active learning method using clustering and committee-based sample selection for sound event classification".