DocKylin

Visual Document Understanding Library

A reimplementation of key DocKylin modules to improve visual document understanding with efficient visual processing

[AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming

GitHub

9 stars
2 watching
0 forks
Language: Python
last commit: about 1 month ago