Publications | 大阪大学データビリティフロンティア機構

Zongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara (2024). Revisiting Pixel-Level Contrastive Pre-Training on Scene Images. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.

URL

Ryosuke Kawamura, Hideaki Hayashi, Noriko Takemura, Hajime Nagahara (2024). MIDAS: Mixing Ambiguous Data With Soft Labels for Dynamic Facial Expression Recognition. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.

PDF

Jiahao Zhang, Bowen Wang, Liangzhi Li, Yuta Nakashima, Hajime Nagahara (2024). Instruct Me More! Random Prompting for Visual In-Context Learning. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.

PDF

Ryoya Mizuno, Keita Takahashi, Michitaka Yoshida, Chihiro Tsutake, Toshiaki Fujii, Hajime Nagahara (2024). Compressive Acquisition of Light Field Video Using Aperture-Exposure-Coded Camera. ITE Transactions on Media Technology and Applications.

URL

Noa Garcia, Yusuke Hirota, Yankun Wu, Yuta Nakashima (2023). Uncurated image-text datasets: Shedding light on demographic bias. Proc.~IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

URL

Mayu Otani, Riku Togashi, Yu Sawai, Ryosuke Ishigami, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin’ichi Satoh (2023). Toward verifiable and reproducible human evaluation for text-to-image generation. Proc.~IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

URL

Yankun Wu, Yuta Nakashima, Noa Garcia (2023). Not only generative art: Stable diffusion for content-style disentanglement in art analysis. Proc.~ 2023 ACM International Conference on Multimedia Retrieval (ICMR).

DOI URL

Zekun Yang, n̆derlineYuta Nakashima, Haruo Takemura (2023). Multi-modal humor segment prediction in video. Multimedia Systems.

DOI URL

Yusuke Hirota, Yuta Nakashima, Noa Garcia (2023). Model-agnostic gender debiased image captioning. Proc.~IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

URL

Bowen Wang, Liangzhi Li, Yuta Nakashima, Hajime Nagahara (2023). Learning bottleneck concepts in image classification. Proc.~IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

URL

Guillaume Habault, Minh-Son Dao, Michael Alexander Riegler, Duc Tien Dang Nguyen, Yuta Nakashima, Cathal Gurrin (2023). ICDAR’23: Intelligent Cross-Data Analysis and Retrieval. Proc.~ACM International Conference on Multimedia Retrieval.

Bowen Wang, Liangzhi Li, n̆derlineYuta Nakashima, Ryo Kawasaki, Hajime Nagahara (2023). Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory. BMC Medical Informatics and Decision Making.

DOI URL

Chenhao Li, Trung Thanh Ngo, Hajime Nagahara (2023). Inverse Rendering of Translucent Objects using Physical and Neural Renderers. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Thuong Nguyen Canh, Trung Thanh Ngo, Hajime Nagahara (2023). Human-Imperceptible Identification With Learnable Lensless Imaging. IEEE Access.

URL

Kiichi Goto, Taikan Suehara, Tamaki Yoshioka, Masakazu Kurata, Hajime Nagahara, Yuta Nakashima, Noriko Takemura, Masako Iwasaki (2023). Development of a vertex finding algorithm using Recurrent Neural Network. Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment.

Chenhao Li, Yuta Taniguchi, Min Lu, Shin'ichi Konomi, Hajime Nagahara (2023). Cross-language font style transfer. Applied Intelligence.

Zongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara (2023). Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.

Liangzhi Li, Manisha Verma, Bowen Wang, Yuta Nakashima, Hajime Nagahara, Ryo Kawasaki (2023). Automated grading system of retinal arterio-venous crossing patterns: A deep learning approach replicating ophthalmologist’s diagnostic process of arteriolosclerosis. PLOS Digital Health.

DOI URL

Yusuke Hirota, Yuta Nakashima, Noa Garcia (2022). Quantifying Societal Bias Amplification in Image Captioning. Proc.~IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

Ryoya Mizuno, Keita Takahashi, Michitaka Yoshida, Chihiro Tsutake, Toshiaki Fujii, Hajime Nagahara (2022). Acquiring a Dynamic Light Field Through a Single-Shot Coded Image. Proc.~IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

PDF

Anh-Khoa Vo, Yuta Nakashima (2022). Tone Classification for Political Advertising Video using Multimodal Cues. Proceedings of the 3rd ACM Workshop on Intelligent Cross-Data Analysis and Retrieval.

Manisha Verma, Yuta Nakashima, Noriko Takemura, Hajime Nagahara (2022). Multi-label disengagement and behavior prediction in online learning. Artificial Intelligence in Education: 23rd International Conference, AIED 2022, Durham, UK, July 27–31, 2022, Proceedings, Part I.

Bowen Wang, Liangzhi Li, Manisha Verma, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara (2022). Match them up: visually explainable few-shot image classification. Applied Intelligence.

DOI URL

Minh-Son Dao, Michael Alexander Riegler, Duc-Tien Dang-Nguyen, Cathal Gurrin, Yuta Nakashima, Mianxiong Dong (2022). ICDAR'22: Intelligent Cross-Data Analysis and Retrieval. Proceedings of the 2022 International Conference on Multimedia Retrieval.

Haruya Suzuki, Sora Tarumoto, Tomoyuki Kajiwara, Takashi Ninomiya, Yuta Nakashima, Hajime Nagahara (2022). Emotional Intensity Estimation based on Writer’s Personality. Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing: Student Research Workshop.

Sudhakar Kumawat, Manisha Verma, Yuta Nakashima, Shanmuganathan Raman (2022). Depthwise spatio-temporal STFT convolutional neural networks for human action recognition. IEEE Trans.~Pattern Analysis and Machine Intelligence.

DOI URL

Hitoshi Teshima, Naoki Wake, Diego Thomas, Yuta Nakashima, Hiroshi Kawasaki, Katsushi Ikeuchi (2022). Deep Gesture Generation for Social Robots Using Type-Specific Libraries. 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

Koji Tanaka, Chenhui Chu, Tomoyuki Kajiwara, Yuta Nakashima, Noriko Takemura, Hajime Nagahara, Takao Fujikawa (2022). Corpus Construction for Historical Newspapers: A Case Study on Public Meeting Corpus Construction Using OCR Error Correction. SN Computer Science.

Tianran Wu, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima, Haruo Takemura (2021). Transferring domain-agnostic knowledge in video question answering. Proc.~British Machine Vision Conference (BMVC).

Bowen Wang, Liangzhi Li, Yuta Nakashima, Takehiro Yamamoto, Hiroaki Ohshima, Yoshiyuki Shoji, Kenro Aihara, Noriko Kando (2021). Image Retrieval by Hierarchy-aware Deep Hashing Based on Multi-task Learning. Proc.~ACM International Conference on Multimedia Retrieval (ICMR).

URL

Cheikh Brahim El Vaigh, Noa Garcia, Benjamin Renoust, Chenhui Chu, Yuta Nakashima, Hajime Nagahara (2021). GCNBoost: Artwork Classificationby Label Propagation Through a Knowledge Graph. Proc.~ACM International Conference on Multimedia Retrieval (ICMR).

PDF

Zechen Bai, Yuta Nakashima, Noa Garcia (2021). Explain me the painting: Multi-topic knowledgeable art description generation. Proc.~IEEE/CVF International Conference on Computer Vision (ICCV).

PDF

Yiming Qian, Cheikh Brahim El Vaigh, Yuta Nakashima, Benjamin Renoust, Hajime Nagahara, Yutaka Fujioka (2021). Built year prediction from Buddha face with heterogeneous labels. Proc.~Workshop on Structuring and Understanding of Multimedia Heritage Contents (SUMAC).

URL

Akihiko Sayo, Diego Thomas, Hiroshi Kawasaki, Yuta Nakashima, Katsushi Ikeuchi (2021). PoseRN: A 2D pose refinement network for bias-free multi-view 3D human pose estimation. Proc.~International Conference on Image Processing (ICIP).

PDF

Manisha Verma, Yuta Nakashima, Hirokazu Kobori, Ryota Takaoka, Noriko Takemura, Tsukasa Kimura, Hajime Nagahara, Masayuki Numao, Kazumitsu Shinohara (2021). Learners' efficiency prediction using facial behavior analysis. Proc.~International Conference on Image Processing (ICIP).

URL

Jules Samaran, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima (2021). Attending self-attention: A case study of visually grounded supervision in vision-and-language transformers. Proc.~Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: Student Research Workshop.

URL

Zekun Yang, Noa Garcia, Chenhui Chu, Mayu Otani, Yuta Nakashima, Haruo Takemura (2021). A comparative study of language Transformers for video question answering. Neurocomputing.

DOI URL

Tomoyuki Kajiwara, Chenhui Chu, Noriko Takemura, Yuta Nakashima, Hajime Nagahara (2021). WRIME: A new dataset for emotional intensity estimation with subjective and objective annotations. Proc.~Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT).

URL

Bowen Wang, Liangzhi Li, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara, Yasushi Yagi (2021). Noisy-LSTM: Improving temporal awareness for video semantic segmentation. IEEE Access.

DOI URL

Yuta Kayatani, Zekun Yang, Mayu Otani, Noa Garcia, Chenhui Chu, Yuta Nakashima, Haruo Takemura (2021). The laughing machine: Predicting humor in video. Proceedings - IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

PDF URL

Noboru Babaguchi, Isao Echizen, Junichi Yamagishi, Naoko Nitta, Yuta Nakashima, Kazuaki Nakamura, Kazuhiro Kono, Seiko Myojin Fuming Fand, Zhenzhong Kuang, Huy H Nguyen, Ngoc-Dung T Tieu (2021). Preventing fake information generation against media clone attacks. IEICE Transactions on Information and Systems.

PDF DOI URL

Isao Echizen, Noboru Babaguchi, Junichi Yamagishi, Naoko Nitta, Yuta Nakashima, Kazuaki Nakamura, Kazuhiro Kono, Fuming Fand, Seiko Myojin, Zhenzhong Kuang, Huy H Nguyen, Ngoc-Dung T Tieu (2021). Generation and detection of media clones. *IEICE Transactions on Information and Systems *.

PDF DOI URL

Kohei Sakai, Yasutaka Inagaki, Keita Takahashi, Toshiaki Fujii, Hajime Nagahara (2021). CFA Handling and Quality Analysis for Compressive Light Field Camera. ITE Transactions on Media Technology and Applications.