How This AI Model Generates Singing Avatars From Lyrics Post date August 7, 2025 Post author By Phonology Technology Post categories In ai-generated-music, autoregressive-transformer, human-motion-synthesis, multimodal-transformers, rapverse-dataset, smpl-x-motion-data, text-to-motion-ai, text-to-speech-models
Joint Modeling of Text, Audio, and 3D Motion Using RapVerse Post date August 7, 2025 Post author By Phonology Technology Post categories In ai-generated-music, autoregressive-transformer, human-motion-synthesis, multimodal-transformers, rapverse-dataset, smpl-x-motion-data, text-to-motion-ai, text-to-speech-models
This AI Turns Lyrics Into Fully Synced Song and Dance Performances Post date August 7, 2025 Post author By Phonology Technology Post categories In ai-generated-music, autoregressive-transformer, human-motion-synthesis, multimodal-transformers, rapverse-dataset, smpl-x-motion-data, text-to-motion-ai, text-to-speech-models
Text-to-Rap AI Turns Lyrics Into Vocals, Gestures, and Facial Expressions Post date August 7, 2025 Post author By Phonology Technology Post categories In ai-generated-music, autoregressive-transformer, human-motion-synthesis, multimodal-transformers, rapverse-dataset, smpl-x-motion-data, text-to-motion-ai, text-to-speech-models
A Multimodal Dataset for Synthesizing Rap Vocals and 3D Motion Post date August 7, 2025 Post author By Phonology Technology Post categories In ai-generated-music, autoregressive-transformer, human-motion-synthesis, multimodal-transformers, rapverse-dataset, smpl-x-motion-data, text-to-motion-ai, text-to-speech-models
The RapVerse Dataset: A New Benchmark in Text-to-Music and Motion Generation Post date August 7, 2025 Post author By Phonology Technology Post categories In ai-generated-music, autoregressive-transformer, human-motion-synthesis, multimodal-transformers, rapverse-dataset, smpl-x-motion-data, text-to-motion-ai, text-to-speech-models
A Single Prompt Will Have This AI Rapping and Dancing Post date August 7, 2025 Post author By Phonology Technology Post categories In ai-generated-music, autoregressive-transformer, hackernoon-top-story, human-motion-synthesis, multimodal-transformers, rapverse-dataset, text-to-motion-ai, text-to-speech-models
What It Takes to Train a Versatile Speech AI System Post date June 20, 2025 Post author By Phonology Technology Post categories In audio-language-model, automatic-speech-recognition, generalization-capability, instruction-finetuning, multimodal-learning, multitask-learning, speech-processing, zero-shot-learning
How We Pre-Trained a 300M Parameter Audio Encoder With Random Quantization Post date June 19, 2025 Post author By Phonology Technology Post categories In audio-language-model, automatic-speech-recognition, generalization-capability, instruction-finetuning, multimodal-learning, multitask-learning, speech-processing, zero-shot-learning
A Unified Multimodal Approach to Speech Processing with LLMs Post date June 19, 2025 Post author By Phonology Technology Post categories In audio-language-model, automatic-speech-recognition, generalization-capability, instruction-finetuning, multimodal-learning, multitask-learning, speech-processing, zero-shot-learning
Adaptive Attacks Expose SLM Vulnerabilities and Qualitative Insights Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
Transfer Attacks Reveal SLM Vulnerabilities and Effective Noise Defenses Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
Cross-Prompt Attacks and Data Ablations Impact SLM Robustness Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
Safety Alignment and Jailbreak Attacks Challenge Modern LLMs Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
Audio Encoder Pre-training and Evaluation Enhance SLM Safety Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
Integrated Speech Language Models Face Critical Safety Vulnerabilities Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
SpeechVerse Unites Audio Encoder and LLM for Superior Spoken QA Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
Unified Speech and Language Models Can Be Vulnerable to Adversarial Attacks Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, robustness-countermeasures, speech-language-models, spoken-question-answering, white-box-attacks
SLMs Outperform Competitors Yet Suffer Rapid Adversarial Jailbreaks Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
Adversarial Settings and Random Noise Reveal Speech LLM Vulnerabilities Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
Datasets and Evaluation Define the Robustness of Speech Language Models Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
Adversarial Attacks Challenge the Integrity of Speech Language Models Post date February 6, 2025 Post author By Phonology Technology Post categories In adversarial-attacks, black-box-attacks, jailbreaking, large-language-models, multimodal-models, robustness-countermeasures, speech-language-models, white-box-attacks
AccentFold: Enhancing Accent Recognition – Conclusion, Limitations, and References Post date August 28, 2024 Post author By Phonology Technology Post categories In accentfold, african-accents, asr-adaptation, computational-linguistics, language-technology, linguistic-embeddings, phonological-patterns, speech recognition
AccentFold: Enhancing Accent Recognition – Empirical Study of AccentFold Post date August 28, 2024 Post author By Phonology Technology Post categories In accentfold, african-accents, asr-adaptation, computational-linguistics, language-technology, linguistic-embeddings, phonological-patterns, speech recognition
AccentFold: Enhancing Accent Recognition – What Information Does AccentFold capture? Post date August 28, 2024 Post author By Phonology Technology Post categories In accentfold, african-accents, asr-adaptation, computational-linguistics, language-technology, linguistic-embeddings, phonological-patterns, speech recognition
AccentFold: Enhancing Accent Recognition – AccentFold Post date August 28, 2024 Post author By Phonology Technology Post categories In accentfold, african-accents, asr-adaptation, computational-linguistics, language-technology, linguistic-embeddings, phonological-patterns, speech recognition
AccentFold: Enhancing Accent Recognition – Related Work Post date August 28, 2024 Post author By Phonology Technology Post categories In accentfold, african-accents, asr-adaptation, computational-linguistics, language-technology, linguistic-embeddings, phonological-patterns, speech recognition
AccentFold: Enhancing Accent Recognition – Abstract and Introduction Post date August 28, 2024 Post author By Phonology Technology Post categories In accentfold, african-accents, asr-adaptation, computational-linguistics, language-technology, linguistic-embeddings, phonological-patterns, speech recognition