Posts by Collection

publications

TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla

Published in SemEval, 2022

A SemEval team paper on Bangla named entity recognition using augmentation and model ensembles.

Recommended citation: Tasnim, N., Shihab, M. I., Sushmit, A., Bethard, S., Sadeque, F. (2022). "TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla." SemEval.
Download Paper

Cardiac CT motion artifact grading via semi-automatic labeling and vessel tracking using synthetic image-augmented training data

Published in Journal of X-Ray Science and Technology, 2022

A method for grading motion artifacts in cardiac CT using synthetic image augmentation and vessel tracking.

Recommended citation: Xu, Y., Sushmit, A., Lyu, Q., …, Yu, H. (2022). "Cardiac CT motion artifact grading via semi-automatic labeling and vessel tracking using synthetic image-augmented training data." Journal of X-Ray Science and Technology.
Download Paper

Towards Santali linguistic inclusion: Building the first Santali-to-English translation model using mT5 Transformer and data augmentation

Published in ACL Workshop, 2024

The first Santali-to-English translation model built with mT5 and data augmentation.

Recommended citation: Billah, S. M. M., Subarna, A. A., …, Sushmit, A. (2024). "Towards Santali linguistic inclusion: Building the first Santali-to-English translation model using mT5 Transformer and data augmentation." ACL Workshop.
Download Paper

IPA transcription of Bengali texts

Published in Speech and Language, 2024

IPA transcription tools and evaluation for Bengali text data.

Recommended citation: Fatema, K., Haider, F. D., …, Sushmit, A. (2024). "IPA transcription of Bengali texts." Speech and Language.
Download Paper

Abugida normalizer and parser for Unicode texts

Published in Software/Toolkit, 2025

An Abugida normalizer and parser for Unicode texts across Indic scripts.

Recommended citation: Ansary, M. N., Adib, Q. A. R., …, Sushmit, A. (2025). "Abugida normalizer and parser for Unicode texts." Software/Toolkit.
Download Paper

talks

Workshop: Matlab and Image Processing

Published:

A six-day hands-on workshop on image processing and deep learning, teaching practical Matlab workflows and AI methods to university students.

Champion, IEEE VIP CUP 2018

Published:

Champion at IEEE VIP CUP 2018, presenting at ICIP 2018 in Greece on a 3D CT lung tumor detection and segmentation pipeline.

Runner-up, IEEE VIP CUP 2019

Published:

Runner-up at IEEE VIP CUP 2019 while coaching the BUET Synapticans and tutoring six undergraduate students on a privacy-aware activity classification system using first-person video.

NVIDIA Titan Xp GPU Grant

Published:

Awarded an NVIDIA Titan Xp GPU through the NVIDIA GPU Grant program to support data science research and experimentation.

Bengali Language Technology Roadmap

Published:

A conference presentation on Bengali language technology history, research challenges, and an inclusive roadmap for regional AI development.

Guest Lecture: Introduction to Data Science

Published:

Guest lecture and course design support for SWE 227 Introduction to Data Science, covering core concepts in data science, model evaluation, and practical applications.

Public Lecture: Demystifying AI

Published:

A public lecture on demystifying artificial intelligence, covering AI ethics, applied machine learning, and implications for policy and society in Bangladesh.

Kaggle Best Community Competition Award

Published:

Awarded Kaggle Best Community Competition Award for organizing the DL Sprint competition on Bengali Automatic Speech Recognition with 59 participating teams.

Host, National ASR Hackathon

Published:

Hosted the National ASR Hackathon organized by Bengali.AI and BUET CSE, serving as host, data curator, and judge.

Selected Delegate, ITU-T AI Capacity Building Workshop

Published:

Selected as a delegate for an ITU-T AI capacity building workshop, engaging regulators and ministry officials on AI ethics, risk governance, and integrating ITU-T recommendations into national digital strategy.

teaching

Coordinator, Bengali.AI

Research Leadership and Mentorship, Bengali.AI, 2020

Coordinated community-driven AI research initiatives, launched large-scale Bengali datasets and competitions, crowdsourced speech and sign language corpora, and mentored emerging researchers on dataset creation, annotation protocols, and reproducible research workflows.

VC Fellow and Lecturer

University teaching, Brac University, Department of Computer Science and Engineering, 2021

Taught courses in Artificial Intelligence, Complex Variables & Laplace Transformations, and Numerical Analysis. Supervised research, graded coursework, and provided student consultation and academic support.

Graduate Research & Teaching Assistant

Graduate teaching assistantship, Rensselaer Polytechnic Institute, Biomedical Engineering, 2021

Supported undergraduate and graduate teaching activities in biomedical imaging, including lab sessions, grading, academic mentoring, and collaborative research supervision.

Technical Trainer

Government capacity building, Bangladesh Korea Institute of Information & Communication Technology (BKIICT), Bangladesh Computer Council, 2026

Delivered public-sector capacity building to over 300 government officials on national digital policy, system interoperability, ICT ordinance implementation, and regulatory framework adoption. Designed and delivered a course on Data Analytics and Data-Driven Decision Making covering data collection methodologies, statistical analysis, and real-time visualization. Instructed public-sector professionals on cybersecurity, personal data protection, and digital hygiene to strengthen institutional risk posture. Fostered strategic data literacy by teaching participants to evaluate data source integrity, challenge operational assumptions, and translate evidence into organizational decisions.