Speech and Computer [E-Book] : 25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023, Proceedings, Part II / edited by Alexey Karpov, K. Samudravijaya, K. T. Deepak, Rajesh M. Hegde, Shyam S. Agrawal, S. R. Mahadeva Prasanna.
The two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29-December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and sele...
Saved in:
Full text |
|
Personal Name(s): | Agrawal, Shyam S., editor |
Deepak, K. T., editor / Hegde, Rajesh M., editor / Karpov, Alexey, editor / Prasanna, S. R. Mahadeva, editor / Samudravijaya, K., editor | |
Edition: |
1st edition 2023. |
Imprint: |
Cham :
Springer,
2023
|
Physical Description: |
XXVI, 568 pages 195 illustrations, 141 illustrations in color (online resource) |
Note: |
englisch |
ISBN: |
9783031483127 |
DOI: |
10.1007/978-3-031-48312-7 |
Series Title: |
/* Depending on the record driver, $field may either be an array with
"name" and "number" keys or a flat string containing only the series
name. We should account for both cases to maximize compatibility. */?>
Lecture Notes in Artificial Intelligence ;
14339 /* Depending on the record driver, $field may either be an array with "name" and "number" keys or a flat string containing only the series name. We should account for both cases to maximize compatibility. */?> Lecture Notes in Computer Science |
Subject (LOC): |
- Industrial Speech and Language Technology
- Analysing Breathing Patterns in Reading and Spontaneous Speech
- Audio-Visual Speaker Verification via Joint Cross Attention
- A Novel Scheme to Classify Read and Spontaneous Speech
- Analysis of a Hinglish ASR System's Performance for Fraud Detection
- Anomaly Detection in Speech: A Comprehensive Approach for Enhanced Speech Analysis
- CAPTuring Accents: An Approach to Personalize Pronunciation Training for Learners with Different L1 Backgrounds
- Speech Technology for Under-Resourced Languages
- Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages
- Phone Durations Modeling for Livvi-Karelian ASR
- Significance of Indic Self-Supervised Speech Representations for Indic Under-Resourced ASR
- Study of Various End-to-End Keyword Spotting Systems on the Bengali language under Low-Resource Condition
- Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language
- Studying the Effect of Frame-Level Concatenation of GFCC and TS-MFCC Features on Zero-Shot Children's ASR
- Code-Mixed Text-to-Speech Synthesis under Low-Resource Constraints
- An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language
- An ASR Corpus in Chhattisgarhi, a Low Resource Indian Language
- Cross Lingual Style Transfer using Multiscale Loss Function for Soliga: A Low Resource Tribal Language
- Preliminary Analysis of Lambani Vowels and Vowel Classification using Acoustic Feature
- Curriculum Learning based Approach for Faster Convergence of TTS Model
- Rhythm Measures and Language Endangerment: the Case of Deori
- Konkani Phonetic Transcription System 1.0
- Speech Analysis and Synthesis
- E-TTS: Expressive Text-to-Speech Synthesis for Hindi using Data Augmentation
- Direct vs Cascaded Speech-to-Speech Translation using Transformer
- Deep Learning based Speech Quality Assessment Focusing on Noise Effects
- Quantifying the Emotional Landscape of Music with Three Dimensions
- Analysis of Mandarin vs. English Language for Emotional Voice Conversion
- Audio DeepFake Detection Employing Multiple Parametric Exponential Linear Units
- A Comparison of Learned Representations with Jointly Optimized VAE and DNN for Syllable Stress Detection
- On the Asymptotic Behaviour of the Speech Signal
- Improvement of Audio-Visual Keyword Spotting System Accuracy using Excitation Source Feature
- Developing a Question Answering System on the material of Holocaust survivors' testimonies in Russian
- Enhancing Children's Short Utterance based ASV using Data Augmentation Techniques and Feature Concatenation Approach
- Studying the Effectiveness of Data Augmentation and Frequency-Domain Linear Prediction Coefficients in Children's Speaker Verification under Low-Resource Conditions
- Constant-Q based Harmonic and Pitch Features for Normal vs Pathological Infant Cry Classification
- Robustness of Whisper Features for Infant Cry Classification
- Speaker and Language Identification, Verification, and Diarization
- I-MSV 2022: Indic-Multilingual and Multi-Sensor Speaker Verification Challenge
- Multi-Task Learning over Mixup Variants for the Speaker Verification Task
- Exploring the Impact of Different Approaches for Spoken Dialect Identification of Konkani Language
- Adversarially Trained Hierarchical Attention Network for Domain-Invariant Spoken Language Identification
- Ensemble of Incremental System Enhancements for Robust Speaker Diarization in Code-Switched Real-Life Audios
- Enhancing Language Identification in Indian Context through Exploiting Learned Features with Wav2Vec2.0
- Design and Development of Voice OTP Authentication System
- End-to-End Native Language Identification using a Modified Vision Transformer(ViT) from L2 English Speech
- Dialect Identification in Ao using Modulation-based Representation
- Self-Supervised Speaker Verification Employing Augmentation Mix and Self-Augmented Training-based Clustering. .