466 |
Jheng, Duen-Chian ; Harchan, Bill Louis ; Kostka de Sztemberg, Berenika Nawoja ; Hsu, Jen-Hao ; Hu, Min-Chun |
Badminton Footwork Practice via an Immersive Virtual Reality System |
468 |
Wattasseril, Jobin Idiculla; Döllner, Jürgen |
SelectSum: Topic-Based Selective Summarization of Speech-Based Videos |
469 |
Hamanaka, Masatoshi |
Real-time Visualizer for Turntablist Performance |
470 |
Gan, Wenbin; Dao, Minh-Son; Zettsu, Koji |
DriveCoach: Smart Driving Assistance with Multimodal Risk Prediction and Risk Adaptive Behavior Recommendation |
472 |
Fernandez Roblero, Jaime Boanerjes ; Ali, Muhammad Intizar |
System Demo of Modeling Smart University Campus Virtual Environments |
473 |
Mohamed Serouis, Ibrahim; Sèdes, Florence |
AMDA: Advancing Multimedia Data Annotation for human-centric situations |
475 |
HUNG-YAO, PENG; ZI-HENG, ZHONG; CHENG-CHIH, TSAI; CHING-YEH, CHIANG; TSE-YU, PAN |
FencBuddy: Action-aware Depth Perception Training for Fencing Attacks |
477 |
Izumi, Kota; Yanai, Keiji |
WaveFontStyler: Font Style Transfer Based on Sound |
479 |
Korb, Martin; Bailer, Werner |
Training a Segmentation-based Visual Anonymization Service for Street Scenes |
480 |
Kawanishi, Yasutomo; Nakamura, Yutaka; Shintani, Taiken; Ishi, Carlos T.; Kawano, Seiya; Yoshino, Koichiro; Minato, Takashi; Minoh, Michihiko |
RoboDJ: Live Commentary Robots System Driven by Physical- and Cyber-world Observations |
481 |
Chiang, Yung-Chu ; Tang, Zi-Xian ; Luo, Yi-Ching ; Chang, Jason S. |
CleverFox: Integrating Visual Mnemonics with AI for Enhanced Language Learning |
482 |
Iino, Nami ; Iino, Akinaru |
Fingering Prediction for Classical Guitar: Dataset Creation and Model Development |
483 |
Kitahara, Tetsuro ; Tsutsumi, Takuya ; Nagoshi, Takaaki ; Suzuki, Taizan |
An Implementation of Networked JamSketch |
485 |
Garcia Contreras, Angel Fernando ; Chang, Wen-Yu ; Kawano, Seiya ; Chen, Yun-Nung ; Yoshino, Koichiro |
Using Language Models to Generate and Forget the Narrative Memories of an Assistive Robot |
486 |
Borgli, Hanna ; Stensland, Håkon Kvale ; Halvorsen, Pål |
Better Image Segmentation with Classification: Guiding Zero-Shot Models Using Class Activation Maps |
487 |
Li, Bohan ; Li, Xingyi ; Liang, Yangwen ; Wang, Shuangquan ; Song, Kee-Bong |
Leveraging Latent Diffusion in 3D Gaussian Splatting for Novel View Synthesis |
488 |
Limberg, Christian ; Zhang, Zhe ; Kastner, Marc A. |
Transformer-Based Audio Generation Conditioned by 2D Latent Maps: A Demonstration |
489 |
YUAN, HONGHUI; YANAI, KEIJI |
KuzushijiFontDiff: Diffusion Model for Japanese Kuzushiji Font Generation |
490 |
YUAN, HONGHUI; YANAI, KEIJI |
SceneTextStyler: Editing Text with Style Transformation |
492 |
Lynch, Kelley ; Rim, Kyeongmin ; King, Owen ; Pustejovsky, James |
Multimodal Interoperability with the CLAMS Platform |
493 |
Kontostathis, Ioannis; Apostolidis, Evlampios; Apostolidis, Konstantinos; Mezaris, Vasileios |
Enhancing User Control in AI-Based Video Summarization for Social Media |
494 |
Khan, Omar Shahbaz ; Duane, Aaron ; Hasnan, Hariz ; Blavec, Noé Le ; Ouvrard, Pierre ; Verdon, Johan ; d’Orazio, Laurent ; Thierry, Constance ; Jónsson, Björn Þór |
Multi-Dimensional Exploration of Media Collection Metadata |
496 |
Huang, Wei-Lun ; Hidayati, Shintami Chusnul ; Pan, Tse-Yu |
Movie Retrieval Systems Using Genre-guided Multimodal Learning Techniques |
497 |
Kongmeesub, Onanong; Gurrin, Cathal; Nie, Dongyun |
A User Identification and Reading Style Detection System Based on Eye Movement Patterns During Reading |
484 |
Le, Duy-Dong ; Huynh, Duy-Thanh ; Bao, Pham The |
Federated Learning with Multimodal-Sensing and Knowledge Distillation: An application on real-world benchmark dataset |
499 |
Vu, Dang ; Dang, Tien ; Nguyen, Quoc-Trung ; Pham, Tan |
Efficient Deployment of Multimodal AI Models: Leveraging Pruning, Quantization and Multi-Objective Optimization for Edge Computing |