Ebelechukwu Nwafor

About
Publications
Research
Students
Teaching

About
Publications
Research
Students
Teaching

Work

Research Areas

My lab works across security, language, and health — building systems that are trustworthy and equitable. Click any area to learn more.

Health / NLP

Low-resource Languages

Nigeria is one of the most populous and linguistically diverse countries in Africa, with over 500 languages spoken. Yet the vast majority of NLP research focuses on high-resource languages like English and Mandarin, leaving Nigerian language speakers severely underserved by modern AI tools.

Our work in this area focuses on creating high-quality, machine-readable datasets for low-resource Nigerian languages — particularly Igbo and Nigerian Pidgin — and developing neural machine translation (NMT) models that can bridge these languages with English and other widely-spoken languages.

We survey the landscape of machine translation research on Nigerian languages, identify gaps in existing datasets and methodologies, and propose future directions for growing the community of researchers working on African language technologies.

Keywords

NLPMachine TranslationLow-resource LanguagesIgboNigerian PidginDataset Creation

Related Publications

2025

Fostering Digital Inclusion for Low-Resource Nigerian Languages: A Case Study of Igbo and Nigerian Pidgin

Proceedings of the Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2025)

2022

A Survey of Machine Translation Tasks on Nigerian Languages

Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC), 2022

View all publications →

Health / NLP

Health Informatics

Health informatics sits at the intersection of computing and public health, leveraging data-driven methods to improve outcomes, inform policy, and understand population-level trends. My research in this area applies modern NLP techniques to real-world health datasets.

One focus area is understanding public sentiment around vaccines — particularly COVID-19 vaccines — by analyzing social media data across geographic regions. This work helps public health communicators understand hesitancy patterns and tailor their messaging accordingly.

We are also exploring predictive models for health forum data, working to identify early indicators of disease exacerbation in patient communities discussing conditions like asthma, and developing privacy-preserving techniques for sensitive medical data.

Keywords

Health InformaticsNLPSentiment AnalysisCOVID-19Vaccine DiscoursePredictive ModelingPrivacy

Related Publications

2023

Privacy-Preserving Intrusion Detection System for Internet of Vehicles using Split Learning

IEEE/ACM 10th International Conference on Big Data Computing, Applications and Technologies (BDCAT), 2023

2021

Covid vaccine sentiment analysis by geographic region

IEEE International Conference on Big Data (Big Data), 2021

2018

Anomaly-based intrusion detection of IoT device sensor data using provenance graphs

1st International Workshop on Security and Privacy for the Internet-of-Things, 2018

View all publications →

Security / IoT

IoT & Vehicular Security

Modern vehicles and IoT deployments are increasingly interconnected, making them attractive targets for cyberattacks. Controller Area Network (CAN) buses — the communication backbone of most vehicles — were designed without security in mind, leaving them vulnerable to injection and replay attacks.

Our work applies large language models (CANBERT) to detect intrusions on in-vehicle networks by treating CAN bus traffic as a language and learning normal communication patterns. Anomalies in this 'language' signal potential attacks.

We extend this to broader IoT settings using graph-based representation learning to model device communication patterns, and federated learning to enable collaborative anomaly detection across devices without sharing raw data — preserving privacy while improving detection accuracy.

Keywords

IoT SecurityVehicular NetworksCAN BusBERTIntrusion DetectionGraph LearningFederated Learning

Related Publications

2024

Evaluating Large Language Models for Enhanced Intrusion Detection in Internet of Things Networks

IEEE Global Communications Conference (GLOBECOM), 2024

2023

FedCime: An Efficient Federated Learning Approach For Clients in Mobile Edge Computing

IEEE International Conference on Edge Computing and Communications (EDGE), 2023

2023

Privacy-Preserving Intrusion Detection System for Internet of Vehicles using Split Learning

IEEE/ACM 10th International Conference on Big Data Computing, Applications and Technologies (BDCAT), 2023

2022

CANBERT: A Language-based Intrusion Detection Model for In-vehicle Networks

IEEE 21st International Conference on Machine Learning and Applications (ICMLA), 2022

View all publications →

Security / IoT

Data Provenance for Cyber-Physical Systems

Data provenance — the ability to trace the origin and history of data as it moves through a system — is a powerful tool for security and accountability. In cyber-physical systems (CPS) and IoT environments, where devices are often resource-constrained and interconnected, provenance tracking introduces unique challenges.

My doctoral work established trace-based provenance collection frameworks for IoT devices, enabling lightweight capture of data flow information even on embedded systems with limited memory and processing power.

By modeling provenance as graphs, we can apply anomaly detection algorithms to identify unusual data flows that may indicate compromise, misconfiguration, or attack — providing a fundamentally new lens for CPS security that complements traditional signature-based approaches.

Keywords

Data ProvenanceCyber-Physical SystemsIoTAnomaly DetectionProvenance GraphsEmbedded Systems

Related Publications

2023

IoT-MGSec: Mitigating Man-in-the-Middle Attacks in IoT Networks Using Graph-Based Learning

IEEE 22nd International Conference on Machine Learning and Applications (ICMLA), 2023

2021

Dynamic load sharing in memory constrained devices: a survey

IEEE 7th World Forum on Internet of Things (WF-IoT), 2021

2021

Detecting network traffic intrusions on memory constrained embedded systems

IEEE International Symposium on Technologies for Homeland Security (HST), 2021

2019

Towards an Interactive Visualization Framework for IoT Device Data Flow

IEEE, 2019

View all publications →

Systems / ML

Federated Learning for Edge Computing

Federated learning enables multiple devices to collaboratively train machine learning models without sharing their raw data — a crucial property in settings where privacy matters and data is sensitive. At the mobile edge, however, participating devices face severe constraints: limited battery life, unstable connectivity, and heterogeneous hardware.

FedCime, one of our contributions in this space, addresses the challenge of efficient federated learning for mobile edge clients by reducing communication overhead and adapting to varying client capabilities without sacrificing model quality.

We also apply federated learning to vehicular edge networks, where vehicles cooperate on tasks like task offloading decisions using deep reinforcement learning — enabling energy-efficient collaborative intelligence across a highly dynamic network topology.

Keywords

Federated LearningEdge ComputingMobile NetworksVehicular NetworksDeep Reinforcement LearningPrivacy

Related Publications

2023

Deep Reinforcement Learning for Energy-Efficient Task Offloading in Cooperative Vehicular Edge Networks

IEEE 21st International Conference on Industrial Informatics (INDIN), 2023

2023

FedCime: An Efficient Federated Learning Approach For Clients in Mobile Edge Computing

IEEE International Conference on Edge Computing and Communications (EDGE), 2023

2023

Privacy-Preserving Intrusion Detection System for Internet of Vehicles using Split Learning

IEEE/ACM 10th International Conference on Big Data Computing, Applications and Technologies (BDCAT), 2023

2018

Anomaly-based intrusion detection of IoT device sensor data using provenance graphs

1st International Workshop on Security and Privacy for the Internet-of-Things, 2018

View all publications →

Systems / IoT

Load Sharing in Memory-Constrained Devices

Many IoT deployments rely on microcontrollers and embedded systems with kilobytes of RAM and flash storage — far too limited for the complex workloads modern applications demand. Dynamic load sharing offers a path forward: intelligently distributing computation across devices in a network to collectively handle tasks that no single device could manage alone.

Our survey work maps the landscape of existing load-sharing approaches for memory-constrained devices, identifying gaps in current techniques and opportunities for improvement. We examine strategies ranging from task migration to cooperative caching.

Building on this, we design and simulate load-sharing protocols that account for the real-world constraints of IoT environments: intermittent connectivity, heterogeneous hardware, and strict energy budgets — enabling richer functionality without requiring hardware upgrades.

Keywords

IoTEmbedded SystemsLoad SharingMemory ConstraintsResource ManagementEdge Computing

Related Publications

2024

Simulating Load Sharing for Resource Constrained Devices

IEEE Access, Volume 12, 2024

2023

FedCime: An Efficient Federated Learning Approach For Clients in Mobile Edge Computing

IEEE International Conference on Edge Computing and Communications (EDGE), 2023

2023

IoT-MGSec: Mitigating Man-in-the-Middle Attacks in IoT Networks Using Graph-Based Learning

IEEE 22nd International Conference on Machine Learning and Applications (ICMLA), 2023

2021

Dynamic load sharing in memory constrained devices: a survey

IEEE 7th World Forum on Internet of Things (WF-IoT), 2021

View all publications →

Ebelechukwu Nwafor

© 2024 · Associate Professor, Villanova University