Mentoring

Undergrad and MS students whom I co-mentored with Prof. C.V. Jawahar.

2021 – 2023

High level understanding of videos; VQA over images with text — DocVQA, InfographicVQA, etc.

2021 – 2023

Video VQA — text-based video question answering and understanding video scenes through text.

2020

Medical VQA

2018 – 2021

Road Text — recognition of text on roads and its application to navigation / autonomous driving.

2016 – 2018

Handwritten text recognition

2016 – 2018

Unconstrained scene text recognition in a seq2seq framework

2015 – 2016

Scene text detection and recognition

Summer 2014

RNN + CTC on GPU

Dec 2013

DAISY Audio Book Library and Playback — web and desktop apps