Multimodal Side-Tuning for Document Classification