【Lecture】11/14(四)_Applications and Prospects of Generative AI in Video Analysis

Title: Applications and Prospects of Generative AI in Video Analysis
演講者: Professor Chia-Hung Yeh(葉家宏教授), Department of Electrical Engineering, National Taiwan
Time: Thursday, November 14, 2024, 14:00–16:00
Location: Lecture Hall 3 (C101), 1st Floor, Engineering Building

Here is the translated information in English:

Title: Applications and Prospects of Generative AI in Video Analysis
Speaker: Professor Chia-Hung Yeh, Department of Electrical Engineering, National Taiwan Normal University
Time: Thursday, November 14, 2024, 14:00–16:00
Location: Lecture Hall 3 (C101), 1st Floor, Science and Engineering Building 2

Lecture Abstract:
Generative AI has demonstrated revolutionary potential in the field of multimodal video understanding. Visual Transformers (ViT) enhance the ability to process visual content within videos, while text vectorization techniques (e.g., Word2Vec, GloVe) lay the foundation for semantic understanding of text. The integration of these technologies has accelerated research on linking images with text.

Previously, video summarization could only predict key segments through low-level feature learning. However, generative AI-based video summarization can now generate concise and user-tailored video content based on textual input, making video summaries more precise and easier to comprehend.

Additionally, advancements in generative AI have significantly improved video localization techniques. While traditional methods relied on image-based searches that yielded sparse and discontinuous results, modern generative AI allows for pinpointing specific objects or behavioral events within a video using text descriptions, enabling efficient event retrieval.

With continuous improvements in algorithms, generative AI has been widely applied in fields such as game strategy analysis, multimedia content production, and intelligent teaching, profoundly impacting our lives and work.