Visual Question Answering [E-Book] : From Theory to Application / by Qi Wu, Peng Wang, Xin Wang, Xiaodong He, Wenwu Zhu.
Visual Question Answering (VQA) usually combines visual inputs like image and video with a natural language question concerning the input and generates a natural language answer as the output. This is by nature a multi-disciplinary research problem, involving computer vision (CV), natural language p...
Saved in:
Full text |
|
Personal Name(s): | Wu, Qi, author |
He, Xiaodong, author / Wang, Peng, author / Wang, Xin, author / Zhu, Wenwu, author | |
Edition: |
1st edition 2022. |
Imprint: |
Singapore :
Springer,
2022
|
Physical Description: |
XIII, 238 pages 104 illustrations, 92 illustrations in color (online resource) |
Note: |
englisch |
ISBN: |
9789811909641 |
DOI: |
10.1007/978-981-19-0964-1 |
Series Title: |
/* Depending on the record driver, $field may either be an array with
"name" and "number" keys or a flat string containing only the series
name. We should account for both cases to maximize compatibility. */?>
Advances in Computer Vision and Pattern Recognition
|
Subject (LOC): |
- 1. Introduction
- 2. Deep Learning Basics
- 3. Question Answering (QA) Basics
- 4. The Classical Visual Question Answering
- 5. Knowledge-based VQA.