Naoyuki Kanda
Title
Cited by
Cited by
Year
Elastic spectral distortion for low resource speech recognition with deep neural networks
N Kanda, R Takeda, Y Obuchi
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on …, 2013
852013
A two-layer model for behavior and dialogue planning in conversational service robots
M Nakano, Y Hasegawa, K Nakadai, T Nakamura, J Takeuchi, T Torii, ...
2005 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2005
652005
CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
602020
Multi-domain spoken dialogue system with extensibility and robustness against speech recognition errors
K Komatani, N Kanda, M Nakano, K Nakadai, H Tsujino, T Ogata, ...
Proceedings of the 7th SIGdial Workshop on Discourse and Dialogue, 9-17, 2006
502006
End-to-end neural speaker diarization with permutation-free objectives
Y Fujita, N Kanda, S Horiguchi, K Nagamatsu, S Watanabe
arXiv preprint arXiv:1909.05952, 2019
482019
End-to-end neural speaker diarization with self-attention
Y Fujita, N Kanda, S Horiguchi, Y Xue, K Nagamatsu, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
472019
Maximum a posteriori Based Decoding for CTC Acoustic Models
N Kanda, X Lu, H Kawai
Interspeech 2016, 1868-1872, 2016
422016
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays
N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ...
Proc. CHiME-5, 6-10, 2018
402018
A multi-expert model for dialogue and behavior control of conversational robots and agents
M Nakano, Y Hasegawa, K Funakoshi, J Takeuchi, T Torii, K Nakadai, ...
Knowledge-Based Systems 24 (2), 248-256, 2011
402011
Open-vocabulary keyword detection from super-large scale speech database
N Kanda, H Sagawa, T Sumiyoshi, Y Obuchi
2008 IEEE 10th Workshop on Multimedia Signal Processing, 939-944, 2008
382008
Investigation of lattice-free maximum mutual information-based acoustic models with sequence-level Kullback-Leibler divergence
N Kanda, Y Fujita, K Nagamatsu
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 69-76, 2017
262017
Guided source separation meets a strong ASR backend: Hitachi/Paderborn University joint investigation for dinner party ASR
N Kanda, C Boeddeker, J Heitkaemper, Y Fujita, S Horiguchi, ...
arXiv preprint arXiv:1905.12230, 2019
252019
Acoustic modeling for distant multi-talker speech recognition with single-and multi-channel branches
N Kanda, Y Fujita, S Horiguchi, R Ikeshita, K Nagamatsu, S Watanabe
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
242019
Lattice-free State-level Minimum Bayes Risk Training of Acoustic Models
N Kanda, Y Fujita, K Nagamatsu
Interspeech 2018, 2923-2927, 2018
242018
Contextual constraints based on dialogue models in database search task for spoken dialogue systems
K Komatani, N Kanda, T Ogata, HG Okuno
Proc. European Conf. Speech Commun. & Tech.(EUROSPEECH), 877-880, 2005
202005
Face-voice matching using cross-modal embeddings
S Horiguchi, N Kanda, K Nagamatsu
Proceedings of the 26th ACM international conference on Multimedia, 1011-1019, 2018
172018
Serialized output training for end-to-end overlapped speech recognition
N Kanda, Y Gaur, X Wang, Z Meng, T Yoshioka
arXiv preprint arXiv:2003.12687, 2020
132020
多段リスコアリングに基づく大規模音声中の任意検索語検出
神田直之, 住吉貴志, 小窪浩明, 佐川浩彦, 大淵康成
電子情報通信学会論文誌 D 95 (4), 969-981, 2012
132012
Joint speaker counting, speech recognition, and speaker identification for overlapped speech of any number of speakers
N Kanda, Y Gaur, X Wang, Z Meng, Z Chen, T Zhou, T Yoshioka
arXiv preprint arXiv:2006.10930, 2020
122020
Simultaneous speech recognition and speaker diarization for monaural dialogue recordings with target-speaker acoustic models
N Kanda, S Horiguchi, Y Fujita, Y Xue, K Nagamatsu, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 31-38, 2019
122019
The system can't perform the operation now. Try again later.
Articles 1–20