上一条: Deep Video Harmonization with Color Mapping Consistency
下一条: Action-Aware Embedding Enhancement for Image-Text Retrieval