SYSTEMS AND METHODS FOR CAPTURING AND PROCESSING SCREEN-RECORDED USER-SPECIFIC RECOMMENDED OUTPUT, DIGITAL ADVERTISEMENTS, AND AI-GENERATED MIXED MEDIA
Inventors
MANH NGUYEN, BRICE GOWER, AKASH CHOUDHARY, ETHAN CABRAL, MENGSHU NIE, ANNE-MARIE MULUMBA, WASALA Rankothge Waruna Gayan KULAWANSHA, JOSEPH ALBI, SAACHI BAGDE, HO SHING KWONG, Kareem Rahaman, Akash Sidhu, Amir Ali Vahid Kassiri
Abstract
A system and method for analyzing screen-recorded personalized digital content using on-device computer vision and generative AI. A user captures content from a recommender system interface, such as screen activity or browser-rendered content, optionally with concurrent voice commentary. On-device processing generates intermediate representations using CLIP-style embeddings, OCR text, and transcribed audio tokens, flagging advertisements, changes in content, diversity in content, and similarities in content. A compact generative AI model produces metadata summaries, which users may annotate with tags or comments. A composite metadata package is transmitted to a cloud system, while raw media is deleted. The invention enables privacy-preserving, bandwidth-efficient insight into recommender-based media and user feedback.
CPC Classifications
Filing Date
2025-07-09
Application No.
19264714