Logo image
FSFVE: Few Shot Compressed Face Video Enhancement

FSFVE: Few Shot Compressed Face Video Enhancement

Varun Ramesh Jois, Antonella DiLillo James Storer
DCC (Los Alamitos, Calif.), pp.133-142
03/24/2026
Central Processing Unit Equations face enhancement Faces Modeling Printing Radio access networks realtime Regional area networks Timing Training videoconferencing Videos
Videocalling has become a popular form of communication in the world today, with many companies providing free services for it. However, there are still millions of people around the world that experience poor quality videocalls due to limitations in bandwidth. This despite, most people having the required hardware. In this paper we present a novel framework for enhancing highly compressed videocalls. We show, that with as little as 10 frames of the face, we can rapidly (in under 100 seconds) train a model to enhance that instance of the videocall. The model can be trained either prior to or during the call, enhancing the rest of the call by producing better quality video. The video conferencing application need not be modified - it can be off the shelf with our system as a layer on top that trains quickly then simply lets the video conferencing application (e.g. Zoom) run as usual, where our system intercepts and improves images before they are displayed. The model is designed to run in realtime on low-compute devices such as a typical laptop CPU. Experimentally, we show that the model significantly improves quality of compressed face video both quantitatively as well as perceptually. Code can be found at https://github.com/varun-jois/FSFVE.
1
Logo image