My main research is on the intersection of natual language processing (NLP), computer vision (CV), and speech.