Home
#Audio, Speech & NLP

#Audio, Speech & NLP

At EMNLP 2023, the Top Conference in the Field of Natural Language Processing, Sony’s R&D activities were presented.
Dialogue Generation Conditional on Predefined Stories: Preliminary Results
- #Audio, Speech & NLP
Self-supervised audio encoder with contrastive pretraining for Respiratory Anomaly Detection
- #Audio, Speech & NLP
Distortion Audio Effects: Learning How to Recover the Clean Signal
- #Audio, Speech & NLP
Automatic Music Mixing with Deep Learning and Out-of-Domain Data
- #Audio, Speech & NLP
Secondary channel estimation in spatial active noise control systems using a single moving higher order microphone
- #Audio, Speech & NLP
360 Virtual Mixing Environment
- #Audio, Speech & NLP
Fast Convergent Method for Active Noise Control Over Spatial Region with Causal Constraint
- #Audio, Speech & NLP
Hierarchical disentangled representation learning for singing voice conversion
- #Audio, Speech & NLP
Sound directivity by PT-symmetric acoustic dipoles
- #Audio, Speech & NLP
PT-symmetric Helmholtz resonator dipoles for sound directivity
- #Audio, Speech & NLP
Sound Quality Improvement of MPEG-H 3D Audio Encoder
- #Audio, Speech & NLP
Similarity-and-Independence-Aware Beamformer: Method for Target Source Extraction using Magnitude Spectrogram as Reference
- #Audio, Speech & NLP
Analytic error control methods for efficient rotation in dynamic binaural rendering of Ambisonics
- #Audio, Speech & NLP
Directional Dependency of Subjective Sound Pressure Perception on Three-Dimensional Sound
- #Audio, Speech & NLP
Metric Learning with Background Noise Class for Few-shot Detection of Rare Sound Events
- #Audio, Speech & NLP
Improving Voice Separation by Incorporating End-To-End Speech Recognition
- #Audio, Speech & NLP
Array-Geometry-Aware Spatial Active Noise Control Based on Direction-of-Arrival Weighting
- #Audio, Speech & NLP