All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Paper2Video: Automated Generation of Academic Presentation Videos from Research Papers

By

jinqueeny

7mo ago· 2 min readenInsight

Summary

Paper2Video introduces a novel automated system for generating academic presentation videos from research papers, addressing the labor-intensive process of creating such videos. The framework includes a benchmark dataset of 101 research papers with corresponding videos, slides, and speaker metadata, along with four specialized evaluation metrics. The proposed PaperTalker system integrates slide generation, layout refinement, cursor grounding, subtitling, speech synthesis, and talking-head rendering, demonstrating superior performance in producing faithful and informative presentation videos compared to existing methods.

Key quotes

· 4 pulled
Academic presentation videos have become an essential medium for research communication, yet producing them remains highly labor-intensive, often requiring hours of slide design, recording, and editing for a short 2 to 10 minutes video.
We introduce Paper2Video, the first benchmark of 101 research papers paired with author-created presentation videos, slides, and speaker metadata.
We propose PaperTalker, the first multi-agent framework for academic presentation video generation.
Experiments on Paper2Video demonstrate that the presentation videos produced by our approach are more faithful and informative than existing baselines, establishing a practical step toward automated and ready-to-use academic video generation.
Snippet from the RSS feed
Academic presentation videos have become an essential medium for research communication, yet producing them remains highly labor-intensive, often requiring hours of slide design, recording, and editing for a short 2 to 10 minutes video. Unlike natural vid

You might also wanna read