ChatCam: Embracing LLMs for Contextual Chatting-to-Camera with Interest-Oriented Video Summarization

自动汇总 计算机科学 多媒体 人工智能
作者
Kaijie Xiao,Yi Gao,Fu Li,Weifeng Xu,P. H. Chen,Weifeng Xu
出处
期刊:Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies [Association for Computing Machinery]
卷期号:8 (4): 1-34
标识
DOI:10.1145/3699731
摘要

Cameras are ubiquitous in society, with users increasingly looking to extract insights about the physical world. Current human-to-camera interaction methods, while advanced, still need to support an intuitive, conversational interaction as one would expect in human-to-human communication. To achieve a more natural interaction between humans and cameras, we proposed a novel contextual chatting-to-camera paradigm. This paradigm allows users to interact with the camera using natural language including raising interests and questions. In response, the camera can customize specific tasks tailored to these interests and attempt to provide answers to the questions asked. We designed ChatCam, embracing LLMs for contextual chatting-to-camera with interest-oriented video summarization. With a novel prompt with the actor-critic LLMs approach, ChatCam can understand users' interests and translate them into some tasks and objects. ChatCam can also customize relevant models with the help of the multi-modal large language model and deep reinforcement learning on the resource-constrained edge and maintain high accuracy. Results show that ChatCam achieves an improvement up to 43.9% in understanding user interests and 21.1% in model accuracy compared to state-of-the-art methods in multiple settings. Various examples and the user study also prove the effectiveness of ChatCam in practice.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Georges-09完成签到,获得积分10
刚刚
w2503完成签到,获得积分10
1秒前
勤奋的天亦完成签到,获得积分10
1秒前
我是老大应助望月暑生采纳,获得10
1秒前
2秒前
3秒前
小猪完成签到,获得积分10
3秒前
3秒前
积极废物完成签到 ,获得积分10
3秒前
我想放假完成签到,获得积分10
5秒前
随便不放假完成签到 ,获得积分10
5秒前
柏林熊完成签到,获得积分10
6秒前
马听云发布了新的文献求助10
7秒前
古丁完成签到,获得积分10
8秒前
压缩完成签到 ,获得积分10
8秒前
飘逸的苡发布了新的文献求助30
8秒前
guajiguaji发布了新的文献求助10
9秒前
我想放假发布了新的文献求助10
9秒前
gzgljh完成签到,获得积分10
10秒前
10秒前
11秒前
魁梧的蜜蜂完成签到,获得积分10
11秒前
澈哩完成签到,获得积分10
12秒前
Miya_han完成签到,获得积分10
12秒前
薛乎虚完成签到 ,获得积分10
12秒前
科研的神龙猫完成签到,获得积分10
14秒前
思源应助熊敢采纳,获得10
14秒前
坚强怀绿完成签到,获得积分10
15秒前
领导范儿应助飞快的诗槐采纳,获得10
15秒前
15秒前
十元完成签到,获得积分10
15秒前
loststarts完成签到 ,获得积分10
16秒前
yoyo完成签到,获得积分10
16秒前
WUCHEN完成签到,获得积分10
16秒前
Joshua完成签到,获得积分10
17秒前
焱焱不忘完成签到 ,获得积分0
17秒前
石头发布了新的文献求助10
17秒前
缥缈的绿兰完成签到,获得积分10
18秒前
王建平完成签到 ,获得积分10
18秒前
斯文败类应助xiaxue采纳,获得10
18秒前
高分求助中
The Mother of All Tableaux Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 2400
Ophthalmic Equipment Market by Devices(surgical: vitreorentinal,IOLs,OVDs,contact lens,RGP lens,backflush,diagnostic&monitoring:OCT,actorefractor,keratometer,tonometer,ophthalmoscpe,OVD), End User,Buying Criteria-Global Forecast to2029 2000
Optimal Transport: A Comprehensive Introduction to Modeling, Analysis, Simulation, Applications 800
Official Methods of Analysis of AOAC INTERNATIONAL 600
ACSM’s Guidelines for Exercise Testing and Prescription, 12th edition 588
Residual Stress Measurement by X-Ray Diffraction, 2003 Edition HS-784/2003 588
T/CIET 1202-2025 可吸收再生氧化纤维素止血材料 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 3950076
求助须知:如何正确求助?哪些是违规求助? 3495418
关于积分的说明 11077056
捐赠科研通 3225984
什么是DOI,文献DOI怎么找? 1783357
邀请新用户注册赠送积分活动 867663
科研通“疑难数据库(出版商)”最低求助积分说明 800855