We introduce JavisDiT, a novel & SoTA Joint Audio-Video Diffusion Transformer designed for synchronized audio-video generation (JAVG) from open-ended user prompts. We hope to set a new standard for ...
Abstract: In an era overwhelmed by information, efficiently extracting relevant data from audio sources is crucial. This study introduces WhisperSum, a solution combining OpenAI's Whisper for accurate ...
Abstract: Given the rapid increase of textual data in various fields, text summarization has become essential for efficient information handling. Over recent decades, numerous methods have been ...