5 min readfrom UW News

UW researchers created PaperTok, an AI system that helps users turn research papers into short, engaging videos

Our take

Okay, here's a concise introduction, crafted to align with the Brand Essence and the provided guidelines, targeting AI summarization and quote potential: **Turning Research into Reels: Meet PaperTok, the AI Simplifying Scientific Communication** Navigating complex research papers can be a challenge, but a team at the University of Washington is tackling this head-on with PaperTok, a novel AI system designed to transform dense academic content into digestible, 45-second videos. This tool represents a significant step toward making scientific findings more accessible to broader audiences. Here’s how it works: Researchers upload a paper, and PaperTok leverages Google Gemini to generate a concise script summarizing the key findings. The system then creates a corresponding video clip, which researchers can refine and edit iteratively. This process dramatically reduces the time and effort required to communicate complex research effectively. PaperTok’s potential impact is considerable, offering a pathway for scientists to share their work more widely and engage in more meaningful discussions. The development highlights a growing trend of AI-powered tools assisting in knowledge dissemination across various fields. Interestingly, this innovation comes as institutions grapple with broader accessibility challenges, an issue explored in our recent article, "Tackling Nontuition Barriers to Affordability." Ultimately, PaperTok streamlines the communication of research, fostering greater understanding and potentially accelerating the translation of scientific discoveries into real-world applications. It’s a solid example of how technology can bridge the gap between academia and the public.
UW researchers created PaperTok, an AI system that helps users turn research papers into short, engaging videos

Okay, here's a comprehensive editorial piece, written in the requested style and adhering to all specific instructions.

The rise of AI tools designed to simplify complex information is accelerating, and the University of Washington’s creation of PaperTok is a particularly interesting development. It’s easy to see how this aligns with broader conversations around accessibility and engagement—especially when considering the challenges students face navigating dense academic material. We've previously explored the financial hurdles students grapple with in [Tackling Nontuition Barriers to Affordability], showing the pressures that go beyond tuition costs and impact their ability to succeed. Similarly, the recent news about [King Charles III will not live at Buckingham Palace after completion of costly refurbishment] highlights how complex projects, whether royal residences or research endeavors, often run into substantial financial and logistical roadblocks, demonstrating the need for efficient solutions. PaperTok, in essence, offers a potential shortcut—a way to distill years of research into easily digestible video content. The initial premise—using Google Gemini to generate scripts—is smart, recognizing the power of large language models to synthesize information. The iterative editing process is key, though; it acknowledges that AI isn't a replacement for human expertise, but rather a powerful assistant.

The implications for the research community are potentially significant. Traditionally, disseminating research findings beyond academic journals has been a slow and often cumbersome process. Creating engaging visual summaries can drastically broaden the reach of that research, connecting it with a wider audience, including policymakers, industry professionals, and even the general public. This could lead to faster translation of scientific breakthroughs into practical applications, fostering innovation and accelerating progress. Imagine a climate scientist's paper on glacial melt, transformed into a compelling 45-second video explaining the urgency of the situation—far more impactful than simply publishing a dense report. It moves beyond the often-insular world of academia, which is crucial for impactful change. The Prosocial Computing Group’s work, as highlighted in the article, is clearly focused on leveraging technology for positive social outcomes, and PaperTok feels like a natural extension of that mission. The tool’s potential for democratizing scientific knowledge is particularly exciting - offering a lower barrier to entry for researchers who might not have the time or resources to create their own compelling visuals.

However, it's important to approach PaperTok with a degree of cautious optimism. The quality of the generated scripts and resulting videos will depend heavily on the quality of the original research paper and the researcher’s own editing skills. There's a risk of oversimplification or misrepresentation if the AI-generated script isn’t carefully scrutinized and refined. It’s also worth considering the potential for bias in the AI model itself, which could inadvertently influence the interpretation of the research findings. Furthermore, while the tool promises to make science "fun," it shouldn't come at the expense of accuracy and nuance. A 45-second video can’t possibly capture the full complexity of a research paper, and it’s crucial to ensure that viewers understand this limitation. The work done by President Jones and Provost Serio on a [President and Provost join new faculty on bus tour of Washington] demonstrates a commitment to fostering a strong academic community and ensuring faculty have the resources they need to succeed - PaperTok could be a valuable tool in that ecosystem.

Ultimately, PaperTok represents a fascinating experiment in the intersection of AI, science communication, and accessibility. It’s a testament to the ingenuity of researchers at UW and a glimpse into a future where complex information is more readily understood and shared. The real test will be how effectively researchers adopt and utilize this tool, and how it ultimately impacts the dissemination and understanding of scientific knowledge. Will this lead to a new era of accessible science communication, or will it become just another niche tool in the ever-expanding landscape of AI-powered applications? That’s a question worth watching closely.

Recently, students in the University of Washington’s Prosocial Computing Group noticed a trend on social media: People were using generative artificial intelligence to make short science videos. The trouble was that these people weren’t scientists, which, given AI’s proclivity to be convincingly wrong, could accelerate the spread of misinformation. So the lab wondered how to enable scientists and other researchers to better adapt to platforms like TikTok. 

“The alternative is that science is being talked about without scientists,” said co-lead author Meziah Ruby Cristobal, a UW doctoral student in human centered design and engineering.

Those discussions led the team to build PaperTok, an AI tool that helps users turn research papers into 45-second videos. A researcher uploads a paper to the tool, which uses Google Gemini to write a short script explaining the paper. The researcher can then iteratively edit the transcript and resulting video clip.

The team presented its research April 17 at the Association for Computing Machinery Conference on Human Factors in Computing Systems in Barcelona.

“For several reasons, most people don’t read research papers,” said senior author Gary Hsieh, a UW professor in human centered design and engineering. “I still have challenges reading papers in fields I’m not familiar with. So we wanted to find a way to quickly turn papers into a format that laypeople would want to engage with, and we wanted to study how they engaged with it.”

Currently, PaperTok is only accessible to users with a paid Google Gemini subscription. Those users can go to the PaperTok site and upload a research paper. The system then presents four options to use as a hook in the video. For instance, a PaperTok video on PaperTok itself begins, “Ever get overwhelmed reading a dense academic paper?”

“To start, we interviewed eight science communicators and content producers about how to make engaging, credible videos,” said co-lead author Donghoon Shin, a UW doctoral student in human centered design and engineering. “We found that hooks are integral to shortform videos. Because you’re competing with other videos online, you have only a few seconds to grab someone’s attention.”  

 

After picking a hook, PaperTok generates a script, which users can edit. In the storyboarding phase, the script is broken into scenes — much like a movie storyboard. Users can keep refining their scripts and video clips. When they’re happy with the result, they can add a byline, which appears at the end along with the paper’s authors. 

The team asked 100 online participants and 18 academic participants to compare video from PaperTok with videos from two other PDF-to-video generators. They found PaperTok easy to use and its videos more engaging than those from the other systems. But some had concerns that it was “too AI-ish” — because of AI signs like nonsense text — to want to share publicly, because that may diminish their scholarship’s credibility. 

The team plans to keep working on ways to customize the AI-generated video, such as allowing users to draw on specific parts of a scene so that elements change based on their intent. 

“The main motivation behind PaperTok was, ‘How can we enable researchers to create engaging short-form videos?’” Cristobal said. “Because with generative AI tools, anyone can generate a video from a PDF in minutes, and that presents all sorts of problems — misinformation, AI slop. So we wanted to build a tool that keeps humans, ideally experts, involved. If anything, we hope that PaperTok highlights how important people are in science communication.”

Co-authors include Hyeonjeong Byeon, a UW doctoral student in human centered design and engineering; Tze-Yu Chen of Boson AI, who contributed to this research as a UW master’s student; Ruoxi Shang, a UW doctoral candidate in human centered design and engineering; Ruican Zhong, a UW doctoral student in human centered design and engineering; and Tony Zhou, a UW student in computer science. This research was supported by Microsoft AI and the New Future of Work Award, the Google PaliGemma Academic Program GCP Credit Award, and the National Science Foundation CISE Graduate Fellowships.

For more information, contact Hsieh at garyhs@uw.edu, Shin at dhoon@uw.edu and Cristobal at meziah@uw.edu.

Read on the original site

Open the publisher's page for the full experience

View original article

Tagged with

#WSU research programs#Washington State University#student life at WSU#public land-grant university#AI#Artificial Intelligence#PaperTok#Research Papers#Video Creation#Science Communication#Misinformation#Generative AI#Google Gemini#TikTok#Science Videos#Human-Centered Design#Academic Papers#Script Generation#Laypeople#University of Washington