DocKylin
Visual Document Understanding Library
A reimplementation of key DocKylin modules to improve visual document understanding with efficient visual processing
[AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming
9 stars
2 watching
0 forks
Language: Python
last commit: about 1 month ago