About me

Hello, I’m Pankaj Choudhury currently pursuing my PhD from Centre for Linguistic Science and Technology, IIT Guwahati. My research topic is “Automatic Image Caption generation in Assamese Language”. My thesis supervisors are Dr. Prithwijit Guha and Prof. Sukumar Nandi.

Education

Ph.D., CLST Indian Institue of Technology Guwahati
M.Tech., CSE Assam Univeristy, Silchar (May 2016)
B.Tech., CSE Don Bosco College of Engg & Tech, Guwahati (July 2012)

Research

Automatic Image caption generation is the process of generating textual description of an image. It uses both Natural Language Processing and Computer Vision to generate the captions. Given a input image the model generates a natural language description of the image. To generate the description the model use Computer vision to understand the image and later with the help of Natural Language Processing generates semantically and syntactically correct descriptions. My research goal is to generate captions in low resource Assamese language. I am also interested in other topics like visual question answering, face detection etc.

Publication

P. Choudhury, P. Guha, S. Nandi “Image Caption Synthesis for Low Resource Assamese Language using Bi-LSTM with Bilinear Attention.” 37th Pacific Asia Conference on Language, Information and Computation (PACLIC 37), 2023 (Accepted)
P. Choudhury, P. Guha, S. Nandi “Relevance of Language-Specific Training on Image Caption Synthesis for Low Resource Assamese Language” 8th International Conference on Asian Language Processing (IALP-2023), 2023 (Accepted)
Y. Aggarwal, P. Choudhury, P. Guha “Face Detection in Challenging Scenes with a Customized Backbone” 8th International Conference on Computer Vision & Image Processing (CVIP-2023), 2023 (Accepted)

Contact

Address: CLST Central Library 3rd floor,

IIT Guwahati,

Assam-781039

Email : pankajchoudhury@iitg.ac.in