Naeem Shahabi Sani

Bio & Experience

I'm a PhD candidate in Computer Science at the University of Oklahoma, advised by Dr. Lan and Dr. Hougen. My research focuses on machine learning, artificial intelligence, and optimization, with particular emphasis on federated learning, recommender systems, randomized algorithms, low-rank matrix factorization, and learning under missing data. I develop scalable ML systems that are both theoretically grounded and effective in practice. My work spans feature engineering, dimensionality reduction, and model optimization (PCA, autoencoders, learned feature selection), including neural network design choices such as activation functions, with pipelines achieving up to 10× runtime speedups while maintaining accuracy. I've validated these approaches on real-world datasets including MovieLens, FilmTrust, and BookCross. My expertise covers supervised and unsupervised learning, deep learning, and privacy-preserving distributed systems including differential privacy and large-scale matrix recovery. Previously, I completed an M.Sc. in Artificial Intelligence, publishing peer-reviewed research on community detection and recommender systems using multi-objective evolutionary algorithms.

News

Our paper is nominated for the Best Paper Award at ACM GECCO 2026 (San José, Costa Rica)
Paper accepted at ACM GECCO 2026 (San José, Costa Rica): "Evolving Multi-Channel Confidence-Aware Activation Functions for Missing Data with Channel Propagation"
Paper accepted at SCRS ITAI 2026 (Boston, USA): "One-Shot Federated Low-Rank Matrix Factorization"

Research Interests

Bio-Inspired Optimization Feature Engineering Swarm Intelligence Randomized Algorithms Machine Learning LLM GenAI

Active Researches

Knowledge-Guided Neural Architecture Search with Large Language Models

Paper in Progress

Neural architecture search (NAS) is typically driven by reinforcement learning or evolutionary algorithms that explore enormous design spaces with little prior knowledge. We investigate how large language models can guide this search by proposing, mutating, and reasoning about candidate architectural components rather than sampling them blindly. Building on our work evolving multi-channel activation functions, we use an LLM to seed and refine genetic programming populations, injecting domain knowledge into the search and accelerating convergence toward high-performing, interpretable network components. This bridges bio-inspired optimization with LLM-driven reasoning to make architecture search more sample-efficient and explainable.

Randomized Least Square for Efficient Low-Rank Matrix Recovery

Under Review (2026)

Low-rank matrix recovery is a fundamental problem. Popular recovery techniques such as alternate least square (ALS) or nuclear norm minimization all require iterative optimization of the recovered matrix, which limits their efficiency. In this paper, we propose an extremely fast recovery technique based on randomized least square (RLSMR). It does not require iterative optimization, for it randomly assigns one low-rank factor and only optimizes another low-rank factor once (which are then used to recover the matrix). In experiment, we show RLSMR runs substantially faster than other techniques while maintaining competitive performance.

A Dragonfly and Ants at Work: A Hybrid Bio-Inspired Framework for Active Learning in Community Detection

Paper in Progress

We introduce a bio-inspired hybrid approach that combines Ant Colony Optimization and the Dragonfly algorithm for feature selection and community detection in complex networks. By integrating an active learning component, the framework reduces labeling costs while enhancing model robustness. Preliminary results on benchmark datasets (e.g., Zachary's Karate Club) demonstrate improved modularity, labeling efficiency, and stability compared to baseline methods.

Projects

Featured Project

CiteSage — Agentic Research Copilot

A self-correcting RAG system built on a multi-agent architecture that autonomously handles academic research tasks. Routes queries intelligently across local papers, arXiv, and Semantic Scholar, generates citation-grounded responses with page-level attribution, and validates outputs through hallucination detection with automatic retry.

Python, LangGraph, LangChain, ChromaDB, MCP, Streamlit, Docker

Featured Project

Enterprise RAG Agent Platform

A backend-first enterprise RAG platform built around the parts most demos skip: permission-aware retrieval that filters chunks by role before they reach the LLM, citation-grounded answers with page-level attribution, evidence checks that refuse to answer when support is weak, and query logging with latency, evidence, and groundedness metrics. Runs fully offline with local embeddings and a mock LLM, or against OpenAI/Anthropic/Ollama.

Python, FastAPI, LangGraph, ChromaDB, JWT/RBAC, Streamlit, Docker

LLM Compare — Multi-Provider Model Dashboard

An interactive dashboard that sends the same prompt to up to four models at once across OpenAI, Anthropic, Google Gemini, and local Ollama, then compares their responses side-by-side. Surfaces latency, token usage, estimated cost, and auto-scored response quality through Plotly charts, with prompt templates and CSV export for systematic model evaluation.

Python, Streamlit, Plotly, OpenAI, Anthropic, Gemini, Ollama

Probabilistic Random Forest (PRF)

An uncertainty-aware Random Forest implementation that handles measurement uncertainties in both input features and labels, making it significantly more robust in real-world scenarios where data inherently contains noise.

Python

Job-Shop Accounting Database System

Full-stack manufacturing tracker using Java + Azure SQL that takes you from customer order to cost accounting and job close-out.

Java, Azure SQL

Selected Publications

Evolving Multi-Channel Confidence-Aware Activation Functions for Missing Data with Channel Propagation

N. Shahabi Sani, Ferial Najiantabriz, Shayan Shafaei, Dean Frederick Hougen

ACM Genetic and Evolutionary Computation Conference (ACM GECCO), 9 pages, July 2026

Recently Accepted (2026) Best Paper Nominee

Paper

We propose a method to handle missing data in neural networks by developing custom activation functions. Rather than using standard functions like ReLU that only consider feature values, we employ genetic programming to create multivariate functions incorporating the feature value, missingness indicators, and confidence scores. We introduce ChannelProp, an algorithm that tracks these reliability signals throughout the network. Testing on datasets with various types of missingness shows our approach improves classification performance compared to conventional methods that don't account for data quality indicators.

One-Shot Federated Low-Rank Matrix Factorization

N. Shahabi Sani, Shayan Shafaei, Shangqing Zhao, Qi Li, Dean F. Hougen, Chao Lan

SCRS International Conference on Information Technology and Artificial Intelligence (SCRS ITAI), 9 pages, March 2026

Recently Accepted (2026)

Low-rank matrix factorization (LRMF) is a fundamental problem with broad applications. In recent years, federated LRMF has gained intensive interests due to the increasing ethical concerns on data privacy and advancement of IoT and edge computing infrastructure. However, current federated LRMF algorithms are based on gradient descent, which requires repeated communication between distributed clients and a center. Such repeated communication is extremely inefficient and has many downsides from the energy and security perspectives. In this paper, we propose the first one-shot federated LRMF algorithm called OFMF. It only requires one round of client-center communication, by leveraging the idea of random matrix factorization. Efficiency analysis suggests OFMF is more efficient than current algorithms in both communication and local computation. Through experiments on three real-world data sets, we show OFMF achieves similar or better factorization performance than prior algorithm, while substantially improves the overall factorization efficiency (e.g., 40 times faster). We also evaluate a differentially private version of OFMF and show it performs similarly, which means OFMF additionally gains differential privacy for free.

A multi-objective ant colony optimization algorithm for community detection in complex networks

N. Shahabi Sani, Mohammad Manthouri, Faezeh Farivar

Journal of Ambient Intelligence and Humanized Computing, 2018

Paper

Studying the structure of the evolutionary communities in complex networks is essential for detecting the relationships between their structures and functions. Recent community detection algorithms often use the single-objective optimization criterion. One such criterion is modularity which has fundamental problems and disadvantages and does not illustrate complex networks' structures. In this study, a novel multi-objective optimization algorithm based on ant colony algorithm (ACO) is recommended to solve the community detection problem in complex networks. In the proposed method, a Pareto archive is considered to store non-dominated solutions found during the algorithm's process. The proposed method maximizes both goals of community fitness and community score in a trade-off manner to solve community detection problem. In the proposed approach, updating the pheromone in ACO has been changed through Pareto concept and Pareto Archive. So, only non-dominated solutions that have entered the Pareto archive after each iteration are updated and strengthened through global updating. In contrast, the dominated solutions are weakened and forgotten through local updating. This method of updating the Pheromone will improve algorithm exploration space, and therefore, the algorithm will search and find new solutions in the optimal space. In comparison to other algorithms, the results of the experiments show that this algorithm successfully detects network structures and is competitive with the popular state-of-the-art approaches.

A new strategy in trust-based recommender system using k-means clustering

N. Shahabi Sani, Ferial Najian Tabriz

International Journal of Advanced Computer Science and Applications (IJACSA)

Paper

Recommender systems are among the most important parts of online systems, including online stores such as Amazon, Netflix that have become very popular in the recent years. These systems lead users to finding desired information and goods in electronic environments. Recommender systems are one of the main tools to overcome the problem of information overload. Collaborative filtering (CF) is one of the best approaches for recommender systems and are spreading as a dominant approach. However, they have the problem of coldstart and data sparsity. Trust-based approaches try to create a neighborhood and network of trusted users that demonstrate users' trust in each other's opinions. As such, these systems recommend items based on users' relationships. In the proposed method, we try to resolve the problems of low coverage rate and high RMSE rate in trust-based recommender systems using k-means clustering and ant colony algorithm (TBRSK). For clustering data, the k-means method has been used on MovieLens and Epinion datasets and the rating matrix is calculated to have the least overlapping.

Teaching

Graduate Teaching Assistant, University of Oklahoma, School of Computer Science

Mentored students through office hours and targeted help sessions, emphasizing rigorous reasoning and clear problem decomposition.
Provided algorithm walkthroughs (e.g., how/why an approach works, edge cases, and complexity reasoning) to strengthen students' conceptual understanding.
Evaluated assignments and exams with standardized rubrics; delivered constructive feedback to improve future performance.
Supported course logistics by answering questions, resolving confusion early, and ensuring consistent expectations across sections.

Cryptography

Spring 2026

Discrete Structures

Fall 2025

Algorithm Analysis

Spring 2025

Discrete Structures

Fall 2024

Machine Learning

Spring 2024

Theory of Computation

Fall 2023

Skills & Tools

Languages

Python Java SQL C++ MATLAB

ML / AI

PyTorch TensorFlow Scikit-learn NumPy Pandas LangChain Hugging Face ChromaDB

Tools

Git Docker LaTeX Azure SQL FastAPI Streamlit

Contact

For inquiries, feel free to email me at shahabi@ou.edu or use the form below.