Hello there! I'm Xiang An.
I'm a Computer Engineering.
  
- 
Fully Open Framework for Democratized Multimodal Training 
 LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training
- 
Enhancing the OCR and localization capabilities of Large Pretrain Vision Transformer (ViT) in ICCV2025: 
 RICE: Region-based Cluster Discrimination for Visual Representation Learning
- 
Large-scale Vision Model Trained on LAION400M and COYO700M with Weak Supervision in ECCV2024: 
 MLCD: Multi-label Cluster Discrimination for Visual Representation Learning
- 
Large-scale Vision Model Trained on LAION400M with Weak Supervision: 
 UNICOM: Universal and Compact Representation Learning for Image Retrieval in ICLR2023
- 
Distributed and Hybrid Parallel Large-Scale Classification Algorithm, trained on WebFace260M: 
 Partial FC: Efficient and Robust Training of Face Recognition CNNs by Partial FC in CVPR2022
If you are interested in joining our team, please send your resume to xiangan@deepglint.com. We look forward to advancing the frontiers of Vision and Video Pretraining together with you!





