Md Akil Raihan Iftee

I am a Research Assistant at the Center for Computational & Data Sciences (CCDS), Independent University, Bangladesh. I completed my Bachelor’s from the Department of Computer Science and Engineering, Khulna University of Engineering & Technology, Khulna, with a CGPA of 3.73/4.00. I have hands-on experience in both academic research and teaching, gained through roles as a Research Assistant and Teaching Assistant/Lecturer across various institutions.

My research is supervised by Prof. Dr. Amin Ahsan Ali and Prof. Dr. AKM Mahbubur Rahman from CCDS. Currently, my researches and projects focus on Multi-modal learning (LLM based foundation models), Trustworthy ML, Federated Learning and Continual Learning/ Adaptive Learning System. I am fortunate to collaborate with and receive mentorship from Md Mofijul Islam, Aman Chadha, Sajib Mistri, Ankur Sarker (All of them are collaborators of CCDS).

I am actively looking for Fully-funded Graduate (Ph.D / Masters) / Visiting Student Researcher positions in good CS research labs around the globe.

News

Jul 10, 2025	Our SloMo-Fast and pFedBBN are avialable in online. Please have a look at the papers.
Jul 10, 2025	Reaching 200+ Citations of my research papers. Check in my Google Scholar.
May 23, 2025	Two of our papers BD Open LULC Map and RGC-BENT have been accepted in IEEE International Conference on Image Processing (ICIP-2025).
Mar 31, 2025	Our FedCTTA paper got accepted in IJCNN 2025
Nov 27, 2024	I am excited to share that 3 of my papers have been accepted for presentation at the 27th International Conference on Computer and Information Technology (ICCIT) 2024.
Nov 01, 2024	I have joined as a Research Assistant at Center for Computational & Data Sciences
Mar 11, 2024	Started an internship at the Center for Computational & Data Sciences
Feb 24, 2024	Successfully defended undergraduate thesis under the supervision of Sk. Imran Hossain

Selected Publications

Preprint

SloMo-Fast: Slow-Momentum and Fast-Adaptive Teachers for Source-Free Continual Test-Time Adaptation

2025

Abs HTML PDF

Continual Test-Time Adaptation (CTTA) is crucial for deploying models in real-world applications with unseen, evolving target domains. Existing CTTA methods, however, often rely on source data or prototypes, limiting their applicability in privacy-sensitive and resource-constrained settings. Additionally, these methods suffer from long-term forgetting, which degrades performance on previously encountered domains as target domains shift. To address these challenges, we propose SloMo-Fast, a source-free, dual-teacher CTTA framework designed for enhanced adaptability and generalization. It includes two complementary teachers: the Slow-Teacher, which exhibits slow forgetting and retains long-term knowledge of previously encountered domains to ensure robust generalization, and the Fast-Teacher rapidly adapts to new domains while accumulating and integrating knowledge across them. This framework efficiently preserves knowledge of past domains, adapts efficiently to new ones. Our extensive experimental results demonstrate that SloMo-Fast consistently outperforms state-of-the-art methods across CTTA benchmarks, achieving a mean error rate of 33.8% in various TTA settings. Notably, it surpasses existing methods by a margin of at least 1.5%. Additionally, SloMo-Fast achieves significant performance improvements in Mixed Domain and our proposed new benchmark Mixed domain comes after Continual Domain scenarios along with Cyclic repeatation in continual test time adaptation setting, indicating its ability to learn generalized representations across domains.
Preprint

pFedBBN: A Personalized Federated Test-Time Adaptation with Balanced Batch Normalization for Class-Imbalanced Data

2025

Abs HTML PDF

Federated learning (FL) enables collaborative model training across decentralized clients while preserving the privacy of local data. Class imbalance remains a fundamental challenge in FL, where rare but critical classes are often severely underrepresented in individual client datasets. Although prior work has addressed class imbalance during training through reliable aggregation and local class distribution alignment, these methods typically rely on access to labeled data or coordination among clients, and none are designed for unsupervised adaptation during inference under federated constraints. We propose a new approach called pFedBBN, a personalized federated test-time adaptation framework that incorporates balanced batch normalization (BBN) to mitigate prediction bias by treating all classes equally during feature normalization, while also enabling a weighted collaboration of clients based on their BBN statistics, which indicates that clients from similar domains prioritize each other more during adaptation. The method supports fully unsupervised local adaptation and introduces a class-aware model aggregation strategy that enables personalized inference without compromising privacy. It effectively addresses both distribution shifts and class imbalance through balanced feature normalization and domain-aware collaboration, without requiring any labeled or raw data from clients. Experiments on corrupted versions of CIFAR-10 and CIFAR-100 demonstrate that pFedBBN significantly improves robustness and performance on minority classes compared to existing federated and test-time adaptation baselines.

Research Areas

Multimodal Learning:

Context-Aware Cross-Modal Alignment for HAR Using LLMs and Wearable Sensors (pdf)
A multimodal (video, language, sensor modality) model where video features are obtained from a context prompt guided Video-LLava, and applied Keyless Attention for sensor feature fusion.
Multimodal Bias Removal through Machine Unlearning in Large Language Models (pdf)
A multimodal machine-unlearning framework that selectively removes socially biased knowledge from MLLMs while preserving their reasoning ability and overall utility.
ProverbBench: A Visual Proverbs Benchmark for Evaluating Multimodal Large Language Models (pdf)
A multimodal proverb-understanding pipeline where text and visual encoders extract meanings, align concepts, incorporate historical context, and generate an LLM-based final interpretation.

Generative AI:

Diffusion-Based Image Editing with Vision-Language Instructions (pdf)
An Image Editing Tool with User Instructions had: language processor (LLAVA), segmenter (SAM), and image editor (Stable Diffusion)

Trustworthy ML, Security & Privacy:

A Backpropagation-Free Jailbreak Attacks on Multimodal Large Language Models
We introduce a black-box jailbreak framework that leverages reinforcement learning with bandit feedback to attack multimodal LLMs without gradients
Privacy-Preserving NSFW Image Generation via Diffusion Unlearning
Protecting Diffusion Models (Stable Diffusion, Flux) from NSFW Jailbreaking via Harmful Knowledge Forgetting with Machine Unlearning
The Dark Side of Prompt Tuning: Poisoning Attacks on Vision-Language Models at Test Time
PGD-based poisoning of unlabeled inference data can corrupt on-the-fly prompt tuning in CLIP-based vision-language models.
Data Stealing Attacks in Federated Learning for Satellite Communication Systems
Investigated vulnerabilities of federated learning models in satellite communication systems to data stealing attacks (model inversion, membership inference)..
Federated Unlearning Attack
Designed a black-box attack where a malicious client sends unlearn requests targeting important data of other clients using membership inference attacks.
Gradient Inversion Attack in Test Time Adaptation (pdf)
Investigates privacy vulnerabilities during model adaptation by recovering input data from gradient signals.
Federated Adversarial Attack in Test-Time Adaptation
Designed a gradient inversion attack for retrieving local client data during test-time continual learning and Proposed a defence by encrypting gradients before sending to the aggregator.

Federated Learning:

When a Modality is Missing: A Cross-Modal Recovery for Federated Multimodal LLM Models (pdf)
Introduces a modality recovery approach for clients missing either image or text in federated multimodal learning.
FedBalanceTTA – Federated Learning with Balanced Test Time Adaptation (pdf)
Combines test-time adaptation with federated learning to balance performance across domains and clients.
Federated Personalized Scanpath Prediction for Privacy-Preserving UI Optimization (pdf)
Proposes personalized scanpath learning to optimize user interfaces while preserving eye-tracking privacy.

Human Computer Interaction:

Voice Activated Gaze-Enhanced Multimodal Interaction for Accessibility (pdf)
Combines voice and gaze input to support accessible multimodal interaction for users with limited mobility.
Controllable 3D UI Exploration and its Future (pdf)
Explores new paradigms for interacting with 3D user interfaces, focusing on control and user experience.

IoT, Embedded System, Wearable/ Wearless Sensor:

Wearable Sensor Feature Alignment (Accelerometer, Gyro, Orientation) for Human Activity Recognition (pdf)
Aligns sensor features using projection techniques to enhance HAR in low-resource or noisy environments.

Medical Imaging:

LLM-Driven Question Answering and Captioning for 3D CT and MRI Data (pdf)
A vision encoder to extract feature embedding of 3D brain MRI and abdominal CT volume data and context alignment via question answering and captioning through Large Language Model (LLM/VLM, Qwen-4B)
Organ-Seg: A Vision-Language and LLM-Enhanced Framework for User-Guided Abdominal Organ Segmentation.
Instruction-guided medical image segmentation framework that combines LLaVA and SAM to deliver accurate, context-aware, and user-driven segmentation, even under false premises.
3D Cerebrovascular Segmentation Using Semi-Supervised Approach (pdf)
Semi-Supervised Uncertainty-Aware Knowledge Distillation for brain vessel segmentation from 3D MRA data (ITK-TubeTK dataset)

Md Akil Raihan Iftee

News

Selected Publications

Research Areas

Multimodal Learning:

Generative AI:

Trustworthy ML, Security & Privacy:

Federated Learning:

Human Computer Interaction:

IoT, Embedded System, Wearable/ Wearless Sensor:

Medical Imaging:

Education

Work & Research Experience

Research Experience

Teaching Experience

Honors and Awards