default search action
Themos Stafylakis
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j10]Federico Landini, Mireia Díez, Themos Stafylakis, Lukás Burget:
DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3450-3465 (2024) - [c69]Christos Vlachos, Themos Stafylakis, Ion Androutsopoulos:
Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems. ACL (Findings) 2024: 7216-7240 - [c68]Lin Zhang, Themos Stafylakis, Federico Landini, Mireia Díez, Anna Silnova, Lukás Burget:
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information? Odyssey 2024: 123-130 - [i27]Lin Zhang, Themos Stafylakis, Federico Landini, Mireia Díez, Anna Silnova, Lukás Burget:
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information? CoRR abs/2402.19325 (2024) - [i26]Christos Vlachos, Themos Stafylakis, Ion Androutsopoulos:
Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems. CoRR abs/2406.06127 (2024) - [i25]Johan Rohdin, Lin Zhang, Oldrich Plchot, Vojtech Stanek, David Mihola, Junyi Peng, Themos Stafylakis, Dmitriy Beveraki, Anna Silnova, Jan Brukner, Lukás Burget:
BUT Systems and Analyses for the ASVspoof 5 Challenge. CoRR abs/2408.11152 (2024) - [i24]Junyi Peng, Ladislav Mosner, Lin Zhang, Oldrich Plchot, Themos Stafylakis, Lukás Burget, Jan Cernocký:
CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification. CoRR abs/2409.15234 (2024) - 2023
- [j9]Triantafyllos Kefalas, Eftychia Fotiadou, Markos Georgopoulos, Yannis Panagakis, Pingchuan Ma, Stavros Petridis, Themos Stafylakis, Maja Pantic:
KAN-AV dataset for audio-visual face and speech analysis in the wild. Image Vis. Comput. 140: 104839 (2023) - [c67]Alexandros Xenos, Themos Stafylakis, Ioannis Patras, Georgios Tzimiropoulos:
A Simple Baseline for Knowledge-Based Visual Question Answering. EMNLP 2023: 14871-14877 - [c66]Sofoklis Kakouros, Themos Stafylakis, Ladislav Mosner, Lukás Burget:
Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing. ICASSP 2023: 1-5 - [c65]Junyi Peng, Themos Stafylakis, Rongzhi Gu, Oldrich Plchot, Ladislav Mosner, Lukás Burget, Jan Cernocký:
Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters. ICASSP 2023: 1-5 - [c64]Pavel Matejka, Anna Silnova, Josef Slavícek, Ladislav Mosner, Oldrich Plchot, Michal Klco, Junyi Peng, Themos Stafylakis, Lukás Burget:
Description and Analysis of ABC Submission to NIST LRE 2022. INTERSPEECH 2023: 511-515 - [c63]Junyi Peng, Oldrich Plchot, Themos Stafylakis, Ladislav Mosner, Lukás Burget, Jan Cernocký:
Improving Speaker Verification with Self-Pretrained Transformer Models. INTERSPEECH 2023: 5361-5365 - [i23]Alexandros Xenos, Themos Stafylakis, Ioannis Patras, Georgios Tzimiropoulos:
A Simple Baseline for Knowledge-Based Visual Question Answering. CoRR abs/2310.13570 (2023) - [i22]Federico Landini, Mireia Díez, Themos Stafylakis, Lukás Burget:
DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors. CoRR abs/2312.04324 (2023) - 2022
- [c62]Themos Stafylakis, Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Anna Silnova, Lukás Burget, Jan Cernocký:
Training speaker embedding extractors using multi-speaker audio with unknown speaker boundaries. INTERSPEECH 2022: 605-609 - [c61]Niko Brummer, Albert Swart, Ladislav Mosner, Anna Silnova, Oldrich Plchot, Themos Stafylakis, Lukás Burget:
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings. INTERSPEECH 2022: 1446-1450 - [c60]Anna Silnova, Themos Stafylakis, Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Pavel Matejka, Lukás Burget, Ondrej Glembek, Niko Brummer:
Analyzing Speaker Verification Embedding Extractors and Back-Ends Under Language and Channel Mismatch. Odyssey 2022: 9-16 - [c59]Jahangir Alam, Radek Benes, Marian Beszédes, Lukás Burget, Mohamed Dahmane, Abderrahim Fathan, Hamed Ghodrati, Ondrej Glembek, Woo Hyun Kang, Pavel Matejka, Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Anna Silnova, Themos Stafylakis:
Development of ABC Systems for the 2021 Edition of NIST Speaker Recognition Evaluation. Odyssey 2022: 346-353 - [c58]Gaëlle Laperrière, Valentin Pelloin, Mickaël Rouvier, Themos Stafylakis, Yannick Estève:
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding. SLT 2022: 361-368 - [c57]Junyi Peng, Oldrich Plchot, Themos Stafylakis, Ladislav Mosner, Lukás Burget, Jan Cernocký:
An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification. SLT 2022: 555-562 - [c56]Themos Stafylakis, Ladislav Mosner, Sofoklis Kakouros, Oldrich Plchot, Lukás Burget, Jan Cernocký:
Extracting Speaker and Emotion Information from Self-Supervised Speech Models via Channel-Wise Correlations. SLT 2022: 1136-1143 - [i21]Niko Brümmer, Albert Swart, Ladislav Mosner, Anna Silnova, Oldrich Plchot, Themos Stafylakis, Lukás Burget:
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings. CoRR abs/2203.14893 (2022) - [i20]Gaëlle Laperrière, Valentin Pelloin, Mickaël Rouvier, Themos Stafylakis, Yannick Estève:
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding. CoRR abs/2210.05291 (2022) - [i19]Themos Stafylakis, Ladislav Mosner, Sofoklis Kakouros, Oldrich Plchot, Lukás Burget, Jan Cernocký:
Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations. CoRR abs/2210.09513 (2022) - [i18]Junyi Peng, Themos Stafylakis, Rongzhi Gu, Oldrich Plchot, Ladislav Mosner, Lukás Burget, Jan Cernocký:
Parameter-efficient transfer learning of pre-trained Transformer models for speaker verification using adapters. CoRR abs/2210.16032 (2022) - [i17]Sofoklis Kakouros, Themos Stafylakis, Ladislav Mosner, Lukás Burget:
Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing. CoRR abs/2211.01756 (2022) - 2021
- [c55]Themos Stafylakis, Johan Rohdin, Lukás Burget:
Speaker Embeddings by Modeling Channel-Wise Correlations. Interspeech 2021: 501-505 - [i16]Themos Stafylakis, Johan Rohdin, Lukás Burget:
Speaker embeddings by modeling channel-wise correlations. CoRR abs/2104.02571 (2021) - 2020
- [c54]Liliane Momeni, Triantafyllos Afouras, Themos Stafylakis, Samuel Albanie, Andrew Zisserman:
Seeing wake words: Audio-visual Keyword Spotting. BMVC 2020 - [c53]Elisavet Palogiannidi, Ioannis Gkinis, George Mastrapas, Petr Mizera, Themos Stafylakis:
End-to-End Architectures for ASR-Free Spoken Language Understanding. ICASSP 2020: 7974-7978 - [c52]Anna Silnova, Niko Brummer, Johan Rohdin, Themos Stafylakis, Lukás Burget:
Probabilistic Embeddings for Speaker Diarization. Odyssey 2020: 24-31 - [c51]Jahangir Alam, Gilles Boulianne, Lukás Burget, Mohamed Dahmane, Mireia Díez Sánchez, Alicia Lozano-Diez, Ondrej Glembek, Pierre-Luc St-Charles, Marc Lalonde, Pavel Matejka, Petr Mizera, João Monteiro, Ladislav Mosner, Cedric Noiseux, Ondrej Novotný, Oldrich Plchot, Johan Rohdin, Anna Silnova, Josef Slavícek, Themos Stafylakis, Shuai Wang, Hossein Zeinali:
Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. Odyssey 2020: 289-295 - [i15]Anna Silnova, Niko Brümmer, Johan Rohdin, Themos Stafylakis, Lukás Burget:
Probabilistic embeddings for speaker diarization. CoRR abs/2004.04096 (2020) - [i14]Liliane Momeni, Triantafyllos Afouras, Themos Stafylakis, Samuel Albanie, Andrew Zisserman:
Seeing wake words: Audio-visual Keyword Spotting. CoRR abs/2009.01225 (2020)
2010 – 2019
- 2019
- [j8]Nooshin Maghsoodi, Hossein Sameti, Hossein Zeinali, Themos Stafylakis:
Speaker Recognition With Random Digit Strings Using Uncertainty Normalized HMM-Based i-Vectors. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1815-1825 (2019) - [c50]Johan Rohdin, Themos Stafylakis, Anna Silnova, Hossein Zeinali, Lukás Burget, Oldrich Plchot:
Speaker Verification Using End-to-end Adversarial Language Adaptation. ICASSP 2019: 6006-6010 - [c49]Hossein Zeinali, Lukás Burget, Johan Rohdin, Themos Stafylakis, Jan Honza Cernocký:
How to Improve Your Speaker Embeddings Extractor in Generic Toolkits. ICASSP 2019: 6141-6145 - [c48]Hossein Zeinali, Themos Stafylakis, Georgia Athanasopoulou, Johan Rohdin, Ioannis Gkinis, Lukás Burget, Jan Cernocký:
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge. INTERSPEECH 2019: 1073-1077 - [c47]Themos Stafylakis, Johan Rohdin, Oldrich Plchot, Petr Mizera, Lukás Burget:
Self-Supervised Speaker Embeddings. INTERSPEECH 2019: 2863-2867 - [c46]Andreas Nautsch, Jose Patino, Amos Treiber, Themos Stafylakis, Petr Mizera, Massimiliano Todisco, Thomas Schneider, Nicholas W. D. Evans:
Privacy-Preserving Speaker Recognition with Cohort Score Normalisation. INTERSPEECH 2019: 2868-2872 - [i13]Themos Stafylakis, Johan Rohdin, Oldrich Plchot, Petr Mizera, Lukás Burget:
Self-supervised speaker embeddings. CoRR abs/1904.03486 (2019) - [i12]Andreas Nautsch, Jose Patino, Amos Treiber, Themos Stafylakis, Petr Mizera, Massimiliano Todisco, Thomas Schneider, Nicholas W. D. Evans:
Privacy-Preserving Speaker Recognition with Cohort Score Normalisation. CoRR abs/1907.03454 (2019) - [i11]Nooshin Maghsoodi, Hossein Sameti, Hossein Zeinali, Themos Stafylakis:
Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors. CoRR abs/1907.06111 (2019) - [i10]Hossein Zeinali, Themos Stafylakis, Georgia Athanasopoulou, Johan Rohdin, Ioannis Gkinis, Lukás Burget, Jan Honza Cernocký:
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge. CoRR abs/1907.12908 (2019) - 2018
- [j7]Themos Stafylakis, Muhammad Haris Khan, Georgios Tzimiropoulos:
Pushing the boundaries of audiovisual word recognition using Residual Networks and LSTMs. Comput. Vis. Image Underst. 176-177: 22-32 (2018) - [c45]Themos Stafylakis, Georgios Tzimiropoulos:
Zero-Shot Keyword Spotting for Visual Speech Recognition In-the-wild. ECCV (4) 2018: 536-552 - [c44]Themos Stafylakis, Georgios Tzimiropoulos:
Deep Word Embeddings for Visual Speech Recognition. ICASSP 2018: 4974-4978 - [c43]Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Feipeng Cai, Georgios Tzimiropoulos, Maja Pantic:
End-to-End Audiovisual Speech Recognition. ICASSP 2018: 6548-6552 - [c42]Niko Brummer, Anna Silnova, Lukás Burget, Themos Stafylakis:
Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model. Odyssey 2018: 349-356 - [c41]Hossein Zeinali, Hossein Sameti, Themos Stafylakis:
DeepMine Speech Processing Database: Text-Dependent and Independent Speaker Verification and Speech Recognition in Persian and English. Odyssey 2018: 386-392 - [c40]Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Georgios Tzimiropoulos, Maja Pantic:
Audio-Visual Speech Recognition with a Hybrid CTC/Attention Architecture. SLT 2018: 513-520 - [i9]Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Feipeng Cai, Georgios Tzimiropoulos, Maja Pantic:
End-to-end Audiovisual Speech Recognition. CoRR abs/1802.06424 (2018) - [i8]Niko Brummer, Anna Silnova, Lukás Burget, Themos Stafylakis:
Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model. CoRR abs/1802.09777 (2018) - [i7]Themos Stafylakis, Georgios Tzimiropoulos:
Zero-shot keyword spotting for visual speech recognition in-the-wild. CoRR abs/1807.08469 (2018) - [i6]Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Georgios Tzimiropoulos, Maja Pantic:
Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture. CoRR abs/1810.00108 (2018) - [i5]Themos Stafylakis, Muhammad Haris Khan, Georgios Tzimiropoulos:
Pushing the boundaries of audiovisual word recognition using Residual Networks and LSTMs. CoRR abs/1811.01194 (2018) - [i4]Hossein Zeinali, Lukás Burget, Johan Rohdin, Themos Stafylakis, Jan Cernocký:
How to Improve Your Speaker Embeddings Extractor in Generic Toolkits. CoRR abs/1811.02066 (2018) - [i3]Johan Rohdin, Themos Stafylakis, Anna Silnova, Hossein Zeinali, Lukás Burget, Oldrich Plchot:
Speaker verification using end-to-end adversarial language adaptation. CoRR abs/1811.02331 (2018) - 2017
- [c39]Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, Chunlei Zhang, Andreas Nautsch, Themos Stafylakis, Gang Liu, Mickaël Rouvier, Wei Rao, Federico Alegre, J. Ma, Man-Wai Mak, Achintya Kumar Sarkar, Héctor Delgado, Rahim Saeidi, Hagai Aronowitz, Aleksandr Sizov, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Bin Ma, Ville Vestman, Md. Sahidullah, M. Halonen, Anssi Kanervisto, Gaël Le Lan, Fahimeh Bahmaninezhad, Sergey Isadskiy, Christian Rathgeb, Christoph Busch, Georgios Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, Pierre-Michel Bousquet, Moez Ajili, Waad Ben Kheder, Driss Matrouf, Zhi Hao Lim, Chenglin Xu, Haihua Xu, Xiong Xiao, Eng Siong Chng, Benoit G. B. Fauve, Kaavya Sriskandaraja, Vidhyasaharan Sethu, W. W. Lin, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Massimiliano Todisco, Nicholas W. D. Evans, Haizhou Li, John H. L. Hansen, Jean-François Bonastre, Eliathamby Ambikairajah:
The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016. INTERSPEECH 2017: 1328-1332 - [c38]Themos Stafylakis, Georgios Tzimiropoulos:
Combining Residual Networks with LSTMs for Lipreading. INTERSPEECH 2017: 3652-3656 - [i2]Themos Stafylakis, Georgios Tzimiropoulos:
Combining Residual Networks with LSTMs for Lipreading. CoRR abs/1703.04105 (2017) - [i1]Themos Stafylakis, Georgios Tzimiropoulos:
Deep word embeddings for visual speech recognition. CoRR abs/1710.11201 (2017) - 2016
- [j6]Themos Stafylakis, Patrick Kenny, Md. Jahangir Alam, Marcel Kockmann:
Speaker and Channel Factors in Text-Dependent Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 65-78 (2016) - [j5]Themos Stafylakis, Md. Jahangir Alam, Patrick Kenny:
Text-Dependent Speaker Recognition With Random Digit Strings. IEEE ACM Trans. Audio Speech Lang. Process. 24(7): 1194-1203 (2016) - [c37]Andreas Nautsch, Hong Hao, Themos Stafylakis, Christian Rathgeb, Christoph Busch:
Towards PLDA-RBM based speaker recognition in mobile environment: Designing stacked/deep PLDA-RBM systems. ICASSP 2016: 5055-5059 - [c36]Gautam Bhattacharya, Patrick Kenny, Jahangir Alam, Themos Stafylakis:
Deep Neural Network based Text-Dependent Speaker Verification : Preliminary Results. Odyssey 2016: 9-15 - [c35]Patrick Kenny, Themos Stafylakis, Jahangir Alam, Vishwa Gupta, Marcel Kockmann:
Uncertainty Modeling Without Subspace Methods For Text-Dependent Speaker Recognition. Odyssey 2016: 16-23 - [c34]Md. Jahangir Alam, Patrick Kenny, Vishwa Gupta, Themos Stafylakis:
Spoofing Detection on the ASVspoof2015 Challenge Corpus Employing Deep Neural Networks. Odyssey 2016: 270-276 - [c33]Themos Stafylakis, Patrick Kenny, Vishwa Gupta, Jahangir Alam, Marcel Kockmann:
Compensation for phonetic nuisance variability in speaker recognition using DNNs. Odyssey 2016: 340-345 - 2015
- [c32]Patrick Kenny, Themos Stafylakis, Jahangir Alam, Marcel Kockmann:
JFA modeling with left-to-right structure and a new backend for text-dependent speaker recognition. ICASSP 2015: 4689-4693 - [c31]Themos Stafylakis, Patrick Kenny, Md. Jahangir Alam, Marcel Kockmann:
JFA for speaker recognition with random digit strings. INTERSPEECH 2015: 190-194 - [c30]Md. Jahangir Alam, Patrick Kenny, Themos Stafylakis:
Combining amplitude and phase-based features for speaker verification with short duration utterances. INTERSPEECH 2015: 249-253 - [c29]Md. Jahangir Alam, Patrick Kenny, Gautam Bhattacharya, Themos Stafylakis:
Development of CRIM system for the automatic speaker verification spoofing and countermeasures challenge 2015. INTERSPEECH 2015: 2072-2076 - [c28]Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Marcel Kockmann:
An i-vector backend for speaker verification. INTERSPEECH 2015: 2307-2311 - [c27]Kong-Aik Lee, Anthony Larcher, Guangsen Wang, Patrick Kenny, Niko Brümmer, David A. van Leeuwen, Hagai Aronowitz, Marcel Kockmann, Carlos Vaquero, Bin Ma, Haizhou Li, Themos Stafylakis, Md. Jahangir Alam, Albert Swart, Javier Perez:
The reddots data collection for speaker recognition. INTERSPEECH 2015: 2996-3000 - 2014
- [j4]Mohammed Senoussaoui, Patrick Kenny, Themos Stafylakis, Pierre Dumouchel:
A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization. IEEE ACM Trans. Audio Speech Lang. Process. 22(1): 217-227 (2014) - [c26]Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Md. Jahangir Alam:
JFA-based front ends for speaker recognition. ICASSP 2014: 1705-1709 - [c25]David Martínez González, Lukás Burget, Themos Stafylakis, Yun Lei, Patrick Kenny, Eduardo Lleida:
Unscented transform for ivector-based noisy speaker recognition. ICASSP 2014: 4042-4046 - [c24]Vishwa Gupta, Patrick Kenny, Pierre Ouellet, Themos Stafylakis:
I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription. ICASSP 2014: 6334-6338 - [c23]Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Pierre Ouellet, Marcel Kockmann:
In-domain versus out-of-domain training for text-dependent JFA. INTERSPEECH 2014: 1332-1336 - [c22]Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Md. Jahangir Alam, Pierre Dumouchel:
Supervised/Unsupervised Voice Activity Detectors for Text-dependent Speaker Recognition on the RSR2015 Corpus. Odyssey 2014: 123-130 - [c21]Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Pierre Ouellet, Marcel Kockmann:
Joint Factor Analysis for Text-Dependent Speaker Verification. Odyssey 2014: 200-207 - [c20]Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Vishwa Gupta, Md. Jahangir Alam:
Deep Neural Networks for extracting Baum-Welch statistics for Speaker Recognition. Odyssey 2014: 293-298 - 2013
- [c19]Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Md. Jahangir Alam, Pierre Dumouchel:
PLDA for speaker verification with utterances of arbitrary duration. ICASSP 2013: 7649-7653 - [c18]Mohammed Senoussaoui, Patrick Kenny, Pierre Dumouchel, Themos Stafylakis:
Efficient iterative mean shift based cosine dissimilarity for multi-recording speaker clustering. ICASSP 2013: 7712-7715 - [c17]Themos Stafylakis, Patrick Kenny, Vishwa Gupta, Pierre Dumouchel:
Compensation for inter-frame correlations in speaker diarization and recognition. ICASSP 2013: 7731-7735 - [c16]Themos Stafylakis, Patrick Kenny, Pierre Ouellet, Javier Perez, Marcel Kockmann, Pierre Dumouchel:
Text-dependent speaker recognition using PLDA with uncertainty propagation. INTERSPEECH 2013: 3684-3688 - 2012
- [c15]Aggelos Gkiokas, Vassilis Katsouros, George Carayannis, Themos Stafylakis:
Music tempo estimation and beat tracking by applying source separation and metrical relations. ICASSP 2012: 421-424 - [c14]Themos Stafylakis, Patrick Kenny, Mohammed Senoussaoui, Pierre Dumouchel:
PLDA using Gaussian Restricted Boltzmann Machines with application to Speaker Verification. INTERSPEECH 2012: 1692-1695 - [c13]Themos Stafylakis, Vassilis Katsouros, Patrick Kenny, Pierre Dumouchel:
A mean shift algorithm for manifolds of exponential families. ISSPA 2012: 511-516 - [c12]Themos Stafylakis, Patrick Kenny, Mohammed Senoussaoui, Pierre Dumouchel:
Preliminary investigation of Boltzmann machine classifiers for speaker recognition. Odyssey 2012: 109-116 - [c11]Themos Stafylakis, Vassilis Katsouros, Patrick Kenny, Pierre Dumouchel:
Mean shift algorithm for exponential families with applications to speaker clustering. Odyssey 2012: 324-329 - 2011
- [j3]Frieda Charalabopoulou, Themos Stafylakis, George K. Mikros:
Developing a Scoring Algorithm for Automatic Pronunciation Assessment of Modern Greek. J. Quant. Linguistics 18(1): 1-22 (2011) - [c10]Themos Stafylakis, Xavier Anguera Miró, Vassilis Katsouros, George Carayannis:
Closed-form expressions vs. BIC: A comparison for speaker clustering. ICASSP 2011: 2228-2231 - [c9]Fotini Simistira, Vassilis Papavassiliou, Themos Stafylakis, Vassilis Katsouros:
Enhancing Handwritten Word Segmentation by Employing Local Spatial Features. ICDAR 2011: 1314-1318 - 2010
- [j2]Themos Stafylakis, Vassilis Katsouros, George Carayannis:
The Segmental Bayesian Information Criterion and Its Applications to Speaker Diarization. IEEE J. Sel. Top. Signal Process. 4(5): 857-866 (2010) - [j1]Vassilis Papavassiliou, Themos Stafylakis, Vassilis Katsouros, George Carayannis:
Handwritten document image segmentation into text lines and words. Pattern Recognit. 43(1): 369-377 (2010) - [c8]Themos Stafylakis, Georgios Tzimiropoulos, Vassilis Katsouros, George Carayannis:
A new penalty term for the BIC with respect to speaker diarization. ICASSP 2010: 4978-4981 - [c7]Themos Stafylakis, Xavier Anguera:
Improvements to the equal-parameter BIC for speaker diarization. INTERSPEECH 2010: 314-317 - [c6]Themos Stafylakis, Vassilis Katsouros, George Carayannis:
Speaker clustering via the mean shift algorithm. Odyssey 2010: 33
2000 – 2009
- 2009
- [c5]Themos Stafylakis, Vassilis Katsouros, George Carayannis:
Redefining the Bayesian information criterion for speaker diarisation. INTERSPEECH 2009: 1051-1054 - 2008
- [c4]Iason Demiros, George Carayannis, Vassilios Antonopoulos, Georgios Kambourakis, Vassilios Katsouros, Panayotis Kolevris, Marios Nottas, Harris Papageorgiou, Vassilis Papavassiliou, Spyros Raptis, Fotini Simistira, Themos Stafylakis:
PANOPTIS: A System for Intelligent Monitoring of the Hellenic Broadcast Sector. DEXA Workshops 2008: 605-609 - [c3]Themos Stafylakis, Vassilis Papavassiliou, Vassilis Katsouros, George Carayannis:
Robust text-line and word segmentation for handwritten documents images. ICASSP 2008: 3393-3396 - 2007
- [c2]Themos Stafylakis, Vassilis Katsouros, George Carayannis:
Efficient combination of parametric spaces, models and metrics for speaker diarization1. ASRU 2007: 256-261 - [c1]Vassilis Papavassiliou, Themos Stafylakis, Vassilis Katsouros, George Carayannis:
A Parametric Spectral-Based Method for Verification of Text in Videos. ICDAR 2007: 879-883
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 20:13 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint