BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//AES GERMANY - ECPv6.16.2//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-ORIGINAL-URL:https://aesgermany.org
X-WR-CALDESC:Events for AES GERMANY
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:Europe/Berlin
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:20230326T010000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:20231029T010000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:20240331T010000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:20241027T010000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:20250330T010000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:20251026T010000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=Europe/Berlin:20241214T000000
DTEND;TZID=Europe/Berlin:20241214T235959
DTSTAMP:20260525T021030
CREATED:20240829T142507Z
LAST-MODIFIED:20240829T142508Z
UID:2878-1734134400-1734220799@aesgermany.org
SUMMARY:NeurIPS 2024 Workshop on Audio Imagination: AI-Driven Speech\, Music\, and Sound Generation
DESCRIPTION:See: https://audio-imagination.github.io/ \n\n\n\nAudio Imagination Workshop \n\n\n\nGenerative AI has been at the forefront of AI research in recent times\, with numerous studies showcasing remarkable and surprising generation capabilities across various modalities such as text\, image\, and audio. Audio Imagination Workshop at NeurIPS 2024 aims to bring the latest advancements in generative AI focusing on audio generation. Audio generation presents unique challenges due to the nature of the audio signal\, its perception by humans\, and its relationship with other modalities like text and visuals. Modern generative methods have brought about new opportunities for solving well-studied audio generation problems\, such as text-to-speech synthesis\, while also leading to explorations of exciting new problems. The workshop seeks to bring together researchers working on different audio generation problems and facilitate concentrated discussions on the topic. It will feature engaging invited talks\, high-quality papers presented through oral and poster sessions\, and a demo session to showcase the current state of audio generation methods. \n\n\n\nCall For Papers \n\n\n\nWe invite submissions for Main Paper and Demo Tracks. Please go to Submission Page for more details. \n\n\n\nFeel free to contact the organizers if you have any question regarding the workshop. \n\n\n\nThe Audio Imagination Workshop at the Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) aims to bring together researchers working in the field of generative AI for audio\, speech\, music\, including multimodal generative AI with audio as one of the modalities. \n\n\n\nWe invite researchers to submit papers focusing on\, but not limited to\, the following topics related to audio generation: \n\n\n\n\nTextual prompts and natural language inputs based generation and editing of audio\, such as text-to-speech (i.e.\, speech synthesis)\, text-to-music and text-to-sound\n\n\n\nAudio/Speech in LLMs/Multimodal LLMs\n\n\n\nConnection of audio generation with text generation\, including similarities and differences.\n\n\n\nVideo to Audio/Speech/Music Generation\n\n\n\nMultimodal generation of audio – going beyond unimodal inputs (text/video/audio) to audio — using multiple modalities for generating audio\n\n\n\nData for audio/speech/music generative AI\n\n\n\nGenerative methods for and its impact on established speech tasks such as speech enhancement\, source separation\, voice conversion\, speech to speech translation\, to mention a few\n\n\n\nGeneration of spatial audio and experiences driven by spatial audio.\n\n\n\nGeneration of audio for virtual or augmented reality (VR/AR)\n\n\n\nSynchronized Generation of audio along with visuals\n\n\n\nImpact of generative audio on media and content creation technologies\n\n\n\nInterpretability in generative AI for audio/speech/music.\n\n\n\nResponsibility in generative AI for audio/speech/music.\n\n\n\nNovel applications of audio/speech/music generation\n\n\n\n\nWe welcome submissions from researchers in academia and industry. The workshop will provide a platform for discussing the latest advances in the field and identifying future research directions. \n\n\n\nWe invite submission in two tracks\, Main Paper Track and Demo Track. The submission process and details are outlined below. Please reach out to the organizers for any questions/confusion. \n\n\n\nMain paper track \n\n\n\nThe main paper track is the primary submission track for the Audio Imagination workshop and will facilitate discussions on relevant topics. Accepted papers will be presented through oral talks or poster sessions. Please note that Audio Imagination is an in-person workshop and papers are expected to be presented in person. \n\n\n\nDemo Session \n\n\n\nA key component of the Audio Imagination workshop is that we will also hold a demo session\, where participants will have a chance to showcase their advanced audio generation methods and technologies. The demo track will enable listening experiences for workshop participants which is critical to understand\, evaluate and contextualize generated audio. The demo session will be conducted alongside poster sessions. \n\n\n\nPlease Check Out the Submissions Page for details on paper formatting and submission details. \n\n\n\nImportant Dates \n\n\n\n\nSeptember 18th – Main Paper Submission Deadline\n\n\n\nSeptember 21st – Demo Paper Submission Deadline\n\n\n\nOctober 9th – Paper & Demo Acceptance Notification\n\n\n\nDecember 14th – Workshop \n\n\n\n\nOrganisers: \n\n\n\nAnurag Kumar\, Research Lead and Scientist at Meta\, USA \n\n\n\nZhaoheng Ni\, Research Scientist at Meta\, USA \n\n\n\nYapeng Tian\, Assistant Professor at The University of Texas at Dallas\, USA \n\n\n\nBerrak Sisman\, Assistant professor at The University of Texas at Dallas\, USA \n\n\n\nWenwu Wang\, Professor at University of Surrey\, United Kingdom \n\n\n\nShinji Watanabe\, Associate Professor at Carnegie Mellon University\, USA \n\n\n\nPlease feel free to circulate this call information. Many thanks.   \n\n\n\nBest wishes\,Wenwu  —Wenwu Wang \n\n\n\nProfessor of Signal Processing and Machine Learning \n\n\n\nCentre for Vision Speech and Signal Processing (CVSSP) \n\n\n\n& Surrey Institute for People Centred AI \n\n\n\nUniversity of Surrey \n\n\n\nGuildford\, GU2 7XHUnited KingdomPhone: +44 (0) 1483 686039Fax: +44 (0) 1483 686031Email: w.wang@surrey.ac.uk \n\n\n\nhttps://personalpages.surrey.ac.uk/w.wang/
URL:https://aesgermany.org/event/neurips-2024-workshop-on-audio-imagination-ai-driven-speech-music-and-sound-generation
END:VEVENT
END:VCALENDAR