Publications

You can also find my articles on my Google Scholar profile.

ABMAMBA: Multimodal Large Language Model with Aligned Hierarchical Bidirectional Scan for Efficient Video Captioning

D. Yashima, S. Kurita, Y. Oda, S. Suzuki, S. Otsuki, and K. Sugiura

ICPR 2026 • 2026 (h5-index: 68)

Paper

HiFlow: Tokenization-Free Scale-Wise Autoregressive Policy Learning via Flow Matching

D. Yashima, K. Seno, S. Kurita, Y. Oda, and K. Sugiura

arXiv • 2026

Paper

AnoleVLA: Lightweight Vision-Language-Action Model with Deep State Space Models for Mobile Manipulation

Y. Takagi, M. Kambara, D. Yashima, K. Seno, K. Tokura, and K. Sugiura

arXiv • 2026

Paper

ReMoRa: Multimodal Large Language Model based on Refined Motion Representation for Long-Video Understanding

D. Yashima, S. Kurita, Y. Oda, and K. Sugiura

CVPR 2026 • 2026 (Acceptance Rate: 25.42%, h5-index: 450)

Paper

NaiLIA: Multimodal Nail Design Retrieval Based on Dense Intent Descriptions and Palette Queries

K. Amemiya, D. Yashima, K. Katsumata, T. Komatsu, R. Korekata, S. Otsuki, and K. Sugiura

CVPR 2026 Findings • 2026 (Acceptance Rate (main + findings): 36%, h5-index: 450)

Paper Code

AIRoA MoMa Dataset: A Large-Scale Hierarchical Dataset for Mobile Manipulation

R. Takanami, P. Khrapchenkov, S. Morikuni, J. Arima, Y. Takaba, S. Maeda, T. Okubo, G. Sano, S. Sekioka, A. Kadoya, M. Kambara, N. Nishiura, H. Suzuki, T. Yoshimoto, K. Sakamoto, S. Ono, H. Yang, D. Yashima, A. Horo, T. Motoda, K. Chiyoma, H. Ito, K. Fukuda, A. Goto, K. Morinaga, Y. Ikeda, R. Kawada, M. Yoshikawa, N. Kosuge, Y. Noguchi, K. Ota, T. Matsushima, Y. Iwasawa, Y. Matsuo, and T. Ogata

arXiv • 2025

Paper Code

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning With Dense Labeling

D. Yashima, R. Korekata, and K. Sugiura

IEEE RA-L • 2025 (IF: 5.2, h5-index: 132)

Paper Code

Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement

K. Katsumata, M. Kambara, D. Yashima, R. Korekata, and K. Sugiura

IEEE RA-L • 2025 (IF: 5.2, h5-index: 132)

Paper Code

Daichi Yashima

Publications