I am Jillur Rahman Saurav, a PhD student at UT Arlington. I am currently working on large scale medical image processing at Luber Lab. I earned a BSc(Engg.) in Computer Science and Engineering from Shahjalal University of Science and Technology, Sylhet, Bangladesh.

I have been very fortunate to be supervised by Professor M. Shahidur Rahman in my bachelor thesis. I also worked as a software engineer at pipilika.com , the very first search engine in Bangladesh where I researched different Bangla NLP problems advised by Dr. Farida Chowdhury and Dr. Ruhul Amin.

  • Deep Learning
  • Data Science
  • Natural Language Processing
  • Computer Vision
  • Computational Social Science
  • PhD in Computer Science, (2021 - present)
    University of Texas at Arlington

  • B.Sc. (Engg.) in Computer Science, 2018
    Shahjalal University of Science & Technology

  • Software engineer, 2018-2021

Current Research

Observing the Unobserved: A Newspaper Based Dengue Surveillance System for the Low-Income Regions of Bangladesh

Nazia Tasnim, Md. Istiak Hossain Shihab, Moqsadur Rahman, Jillur Rahman Saurav , Sheikh Rabiul Islam and Mohammad Ruhul Amin.
FLAIRS-34 | Florida Artificial Intelligence Research Society Conference 2021


Word Completion and Sequence Prediction in Bangla Language Using Trie and a Hybrid Approach of Sequential LSTM and N-gram
Soumik Sarker, Md. Ekramul Islam, Jillur Rahman Saurav, Md Mahadi Hasan Nahid ICAICT'20 | International Conference on Advanced Information and Communication Technology (ICAICT)

Query Expansion for Bangla Search Engine Pipilika
Md. Rezaul Islam, Jillur Rahman Saurav, Mahbubur Rub Talha, Farida Chowdhury TENSYMP'20 | IEEE Region 10 Symposium

End to End Parts of Speech Tagging and Named Entity Recognition in Bangla Language
Jillur Rahman Saurav, Summit Haque, Farida Chowdhury ICBSLP'19 | International Conference on Speech and Language Processing (ICBSLP) 2019

Bangla Speech Recognition for Voice Search
Jillur Rahman Saurav, Shakhawat Amin, Shafkat Kibria, M. Shahidur Rahman ICBSLP'18 | International Conference on Speech and Language Processing (ICBSLP) 2018


News aggregator Sercvice for Bangla Newspapers
Performed tasks includes designing architecture,developing generic parser, clustering news, categorizing news, summary extraction.
Technology: Django, Scrapy, Elasticsearch,Keras, Redis, Docker

Knowledge graph based on Bangladesh’s national portals data
Built a knowledge graph using the datafrom Bangladesh’s national portals by analyzing the text of the entities on the websites (5552 web portals) usingK-means clustering and Nearest Neighbour method.
Technology: Python, Elasticsearch, sklearn, Docker

Context-aware spell checker for Bangla language
Worked as a team member for developing Bk-tree, n-gram based spell checker for Bangla language.
Technology: Spring boot, Apache Solr

Query Analysis
Developed a deep-learning-based query classifier to understand search queries, implemented autocomplete and related search features.
Technology: Flask, Keras, Elasticsearch

Data Analytics for COVID-19 self-screening tool
Performed various statistical analysis on a Covid-19self-screening tool’s data (535,291 participents) comprising association analysis among symptoms, symptoms clustering, identifying danger zones, correlation with Covid cases.
Technology: Pandas, Sklearn, Matplotlib

Sentiment Analysis Dataset for Bangla language
Worked as a team member for developing the largest sentiment analysis dataset for Bangla language. The performed tasks included scraping data from various sources, cleaning data, and selecting data for annotation.
Technology: Scrapy, Pandas

Computer Vision Projects
Worked on several computer vision projects. Tasks included real-time object detection, reverse image search, Image captioning in the Bangla language
Technology: OpenCV, Keras