Samiul Alam

PhD Candidate, Ohio State University.

prof_pic.jpg

Hi there! I am an incoming PhD. student at Ohio State University. I work on improving privacy and fairness in deep learning. I used to work at Samsung Research and Development Institute in Bangladesh as a Software Engineer till 2021. I graduated from Bangladesh University of Engineering and Technology in 2017.

I am one of the co-founders of bengali.ai. I lead the collection and standardisation efforts behind popular open source Bengali Natural Language and OCR Datasets like NumtaDB, Bengali Graphemes and Common Voice Bengali Speech Dataset.

I am deeply passionate about my work and regularly try and find novel applications. As such, I have worked in a wide range of applications. If you have any questions about my research or have any relevant ideas in mind, feel free to e-mail me.

news

Sep 15, 2025 Two of my papers have been accepted at NeurIPS 2025!
Aug 15, 2025 Happy to announce that I wrapped up my PhD internship at Google.
May 17, 2024 My work on multi-state filtering has been published in Ploss One.
May 12, 2024 Our survey paper on Efficient LLMs has been accepted at TMLR!
Dec 14, 2023 My research on Multi-channel Skin Conductance deconvolution was published at IEEE Journal of Open Engineering, Medicine and Biology

selected publications

  1. effllm.jpg
    Efficient Large Language Models: A Survey
    Zhongwei Wan, Xin Wang, Che Liu, and 9 more authors
    Transactions on Machine Learning Research, 2024
    Survey Certification
  2. sparse.jpg
    Sparse Multichannel Decomposition of Electrodermal Activity With Physiological Priors
    Samiul Alam, Md. Rafiul Amin, and Rose T. Faghih
    IEEE Open Journal of Engineering in Medicine and Biology, 2023
  3. oodspeech.png
    OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking
    Fazle Rabbi Rakib, Souhardya Saha Dip, Samiul Alam, and 11 more authors
    In Proc. INTERSPEECH 2023, 2023
  4. fedrolex.png
    FedRolex: Model-Heterogeneous Federated Learning with Rolling Sub-Model Extraction
    Samiul Alam, Luyang Liu, Ming Yan, and 1 more author
    In Advances in Neural Information Processing Systems, 2022
  5. dlsprint.jpg
    Bengali Common Voice Speech Dataset for Automatic Speech Recognition
    Samiul Alam, Asif Sushmit, Zaowad Abdullah, and 6 more authors
    arXiv preprint arXiv:2206.14053, 2022
  6. grapheme.png
    A large multi-target dataset of common bengali handwritten graphemes
    Samiul Alam, Tahsin Reasat, Asif Shahriyar Sushmit, and 4 more authors
    2021
  7. numta.gif
    Numtadb-assembled bengali handwritten digits
    Samiul Alam, Tahsin Reasat, Rashed Mohammad Doha, and 1 more author
    arXiv preprint arXiv:1806.02452, 2018