Rahul Babu

Hello! I am Rahul, Software Engineer with experience in building AI agents, backend development, and cloud-native solutions. Skilled in building LLM-powered research assistants, NLP pipelines, and scalable APIs that process millions of records daily.

At Hovian Inc., Integrated AI and UX enhancements into an Electron + React desktop app by building reusable UI components, REST API integrations, and a Python-based LLM text-to-animated GIF generation pipeline, plus dark mode and native notifications.

Previously at ASU's Decision Theater, Built and deployed LLM + ML systems (RAG, automated eval tooling, and an opioid-risk forecasting dashboard) by automating data pipelines, integrating public datasets, and shipping interactive apps on Jetstream2 and Google Cloud Run.

Core Skills: Python, FastAPI, Django, Flask, React.js, LangChain, Hugging Face, Scikit-learn, PostgreSQL, MongoDB, AWS (EC2, RDS, S3, Glue, ECS), Docker, CI/CD. Certification: AWS Solutions Architect - Associate.

I have 3 years of experience building AI-driven full-stack applications and analytics solutions using React, JavaScript, Python, and Java. I have also deployed containerized microservices on AWS (EC2, RDS, S3, DocumentDB) with CI/CD and monitoring through CloudWatch.

Years Experience

3

Projects

16

Publications

2

Certifications

10

Rahul Babu portrait photo

Projects

16 projects shown

Interactive Portfolio

Next.js, TypeScript, Three.js, R3F, GraphQL, Hygraph | Dec 2025 - Current

Live Demo Code

  • Built and deployed an interactive 3D portfolio experience using Next.js and TypeScript with Three.js and React Three Fiber (Drei, Rapier) for immersive UI and animations.
  • Integrated Hygraph (GraphCMS) using GraphQL and Apollo Client to manage portfolio content through a CMS-driven workflow.

ClimateActionPolicy RAG Application

Streamlit, LangChain, ChromaDB, LLaMA 3, Transformers | July 2024 - Sept 2024

Live Demo Code

  • Deployed a Retrieval-Augmented Generation application that retrieves relevant policy documents and generates context-enriched responses for climate action recommendations.
  • Built an interactive Streamlit chat interface with multi-session history so users can manage and review conversations.

Opioid Overdose Forecasting Data Visualizer

React, TypeScript, FastAPI, Docker | April 2025 - May 2025

Live Demo Code

  • Developed an interactive web application to visualize Arizona opioid-risk data at the Census Block Group level using a FastAPI backend and React + TypeScript frontend.
  • Designed backend filtering and API serving and packaged the system with a unified Dockerfile for streamlined deployment.

Fake-Job-Detection

Python, Machine Learning, NLP, Fraud Detection | Feb 2025 - May 2025

Developed a fake job posting predictor to identify fraudulent listings that mimic legitimate roles and help improve trust and safety for job seekers and hiring platforms.

Code

  • Built a machine learning workflow to classify job posts as real or fraudulent using textual and structured listing signals.
  • Framed the system for real-world safety use cases, such as flagging suspicious listings before applicants share personal information or make payments.

Python

Machine Learning

NLP

Fraud Detection

Classification

Distributed NoSQL Database System for an E-Commerce Platform

Kafka, Redis, MongoDB, FastAPI | Sep 2024 - Dec 2024

Code

  • Designed a distributed NoSQL database system using Kafka, Redis, and MongoDB, achieving 100% consistency post-recovery and under 500 ms average response time for e-commerce data operations in fault-tolerant scenarios.
  • Optimized cross-region replication with FastAPI middleware, ensuring zero data loss during node failures and 248 ms response times during outages through Kafka-backed asynchronous recovery.

Kafka

Redis

MongoDB

FastAPI

Distributed Systems

NoSQL

Fault Tolerance

Cross-Region Replication

User Identification, Authentication, and Interactions for Social Media Data Analytics using AI

AI Research, Social Media Analytics, Team Leadership | Sep 2024 - Dec 2024

Led a research project as the team lead for an 8-member group and documented a 60-page research paper on user identification, authentication, and interaction analysis for social media data analytics using AI.

Code

  • Directed project planning, research coordination, and technical execution across an 8-member team.
  • Compiled and authored a comprehensive 60-page technical research report covering system design, experimentation, and findings.

Artificial Intelligence

Social Media Analytics

User Authentication

Research Leadership

Technical Documentation

Movie Review WebApp

React Search UI, Elastic App Search, Node.js, npm | Aug 2024 - Sep 2024

Built a configurable movie-review search experience using Elastic Search UI with App Search engine integration, designed for quick setup, customization, and static deployment.

Code

  • Implemented engine-driven search configuration via JSON (`engine.json`) for fields, facets, sorting, and query suggestions without regenerating the app.
  • Set up reproducible local development with Node 16.13.0 and npm, and enabled static asset build-and-embed workflow for production hosting.

React

Elastic Search UI

Elastic App Search

Node.js

npm

JSON Configuration

Static Build

Netlify Deployment

LLM Response Aggregation Tool

Selenium IDE, Python, LLM Evaluation, Data Automation | May 2024 - Jul 2024

Built a web automation and evaluation pipeline to run prompts through multiple LLMs, aggregate responses in structured sheets, and benchmark output quality with NLP metrics.

Code

  • Automated prompt execution with Selenium IDE across ChatGPT, Gemini, Claude, Perplexity, Meta AI, and HuggingChat using a localhost prompt-reader workflow.
  • Captured model responses in Google Forms and exported to Excel/CSV for downstream analysis and reporting.
  • Implemented Python-based evaluation workflows (Response_Evaluation.ipynb) using BLEU, ROUGE, cosine similarity, Distinct-1/2, sentiment, response-length analysis, and word clouds.
  • Included data-scraping support for 790 climate policy articles from c40knowledgehub.org to build evaluation and comparison datasets.

Selenium IDE

Python

LLM Benchmarking

BLEU

ROUGE

Cosine Similarity

Distinct-1/2

Data Automation

Sentiment Analysis on Twitter Datasets (NLP)

Python, Amazon SageMaker | Jan 2024 - May 2024

Code

  • Implemented SVM, Logistic Regression, Naive Bayes, XGBoost, BiLSTM, and CNN models on 416K tweets, achieving 97% accuracy with CNN using TF-IDF/word-embedding preprocessing and stratified sampling.
  • Identified SVM as the most robust model (94.15% accuracy) through comparative ROC and confusion matrix analysis.

Python

Amazon SageMaker

NLP

SVM

Logistic Regression

Naive Bayes

XGBoost

BiLSTM

CNN

TF-IDF

IoT Device Manager using Blockchain

Jan 2023 - May 2023

This project focuses on establishing a robust and secure framework for the management of IoT devices, underpinned by the principles of data immutability and transparency, all facilitated by the innovative use of blockchain technology.

Code

  • Implemented a blockchain-based solution for IoT device management.
  • Recorded critical transactions including device registration, management, and ownership transfers.
  • Developed a user-friendly web application using ReactJS.

MetaMask

Truffle Framework

Ganache

Solidity

React.js

Forest Monitoring using Hyperspectral Imaging

Nov 2022 - May 2023

This project focuses on utilizing hyperspectral imaging for forest terrain monitoring, aiming to provide detailed insights into forest contents and potential risks. It utilizes the Indian Pines dataset, which captures hyperspectral data from a forested area, and employs a hybrid classification method, including Support Vector Machines (SVM), 2D Convolutional Neural Networks (CNN), 3D CNN, M3D-CNN, SSRN, and a Hybrid model for precise forest analysis.

Code

  • Utilized Indian Pines dataset for hyperspectral imaging.
  • Employed hybrid classification methods combining SVM, 2D CNN, 3D CNN, M3D-CNN, SSRN, and a Hybrid model.
  • Provided comprehensive insights including vegetation cover percentage, building presence, and hazard detection.

Scikit-Learn

Machine Learning

TensorFlow

Exploratory Data Analysis

NumPy

Data Science

Data Analysis

Python

Pandas

Web Phishing Detection

Aug 2022 - Nov 2022

The Web Phishing Detection project is a Python-based solution deployed on the IBM Cloud platform, focused on the detection of phishing websites. It incorporates machine learning techniques to enhance accuracy.

Code

  • Trained 10 machine learning models for phishing detection.
  • Deployed the highest accuracy model on IBM Cloud.
  • Developed a website interface for user interaction with the phishing detection system.

Machine Learning

Exploratory Data Analysis

Data Science

Data Analysis

Python

Path Navigation using Deep Q Learning

Jan 2022 - Sep 2022

An application developed using Kivy and PyTorch Framework. The application simulates an autonomous vehicle navigating a map I designed, complete with manually placed obstacles.

Code

  • Used Deep Q Networks (DQNs) with relu activation functions for navigation.
  • Designed a reward system to improve obstacle avoidance capabilities.
  • Received approval for publication in ADCOM Volume 133.

Machine Learning

NumPy

Python

Tournament Manager

Jul 2021 - Nov 2021

The Tournament Manager is a Java-based application designed to facilitate the efficient organization of sports tournaments.

Code

  • Designed and configured tournaments for different divisions and sports.
  • Streamlined team enrollment and player information updates.
  • Developed using Java and MySQL.

Java

Harmful Comments Classifier

Jan 2021 - Apr 2021

The "Harmful Comments Classifier" is a Python-based Natural Language Processing (NLP) project, executed within the Google Colab environment.

Code

  • Developed a deep learning model for classifying comments based on toxicity.
  • Utilized embedding techniques, bi-directional LSTM, and dense models.
  • Executed within the Google Colab environment.

Machine Learning

Exploratory Data Analysis

Data Science

Data Analysis

Python

EATERIO

Jan 2020 - May 2020

"EATERIO" is a web application built using HTML, CSS, JavaScript, and MongoDB, designed to facilitate food ordering and management within organizations.

Code

  • Streamlined the food ordering process within organizations.
  • Developed using HTML, CSS, JavaScript, and MongoDB.
  • Provided a user-friendly interface for placing and managing food orders.

HTML

CSS

JavaScript

MongoDB

Blogs

Opioid Risk Geo Dashboard

React, TypeScript, FastAPI

Wrote about building an Arizona overdose-risk visualizer at Census Block Group resolution, covering the geospatial UI, API integration, and product decisions behind the dashboard.

Live Demo Medium Code

Climate Policy RAG App

Streamlit, LangChain, ChromaDB

Documented how the climate-policy assistant was built using retrieval, reranking, post-processing, and analytics to improve answer quality and make the app more useful for policy research workflows.

Live Demo Medium Code

Work Experience

Hovian Inc.

Cucamanga, California, USA

Software Engineer

July 2025 - Dec 2025

  • Built and maintained an Electron + React single-page desktop application using React, JavaScript/TypeScript, HTML5, and CSS3, supporting multiple interactive UI views for notification and workflow management.
  • Designed and shipped 20+ modular, reusable React components (chat UI, prompt inputs, settings panels, toggles) to support dynamic features such as AI chat, theme switching, and workflow navigation.
  • Managed client-side state and UI transitions (component-level state + shared app state), reducing UI-related bugs and improving interaction responsiveness and UX consistency.
  • Integrated frontend modules with backend services via RESTful APIs, implementing reliable data fetching, loading states, and error handling for daily user workflows.
  • Developed Python-based image processing utilities to power an AI "Animated GIF" feature (sprite sheet slicing, pixelation with PIL, GIF assembly with imageio), delivering animated outputs from text prompts.
  • Implemented dark mode with persisted user preferences and OS theme sync, and integrated native desktop notifications via Electron APIs to ensure alerts triggered reliably even when the app was minimized.
  • Implemented a hybrid backend communication layer using Node.js with REST (HTTP) APIs and WebSockets to stream real-time Outlook notifications and support AI-generated responses.
  • Built and maintained an Electron + Node.js desktop app with tabbed and multi-view navigation using JavaScript, HTML5, and CSS3 to support notification and workflow management.

ASU Decision Theater

Tempe, Arizona, USA

Software Engineer

May 2024 - May 2025

  • Designed and implemented a machine learning pipeline using Python and R to analyze 50K+ census and public healthdatasets for regional risk prediction.
  • Trained and evaluated predictive models using XGBoost and ensemble techniques, focusing on accuracy and interpretability.
  • Built a real-time interactive risk dashboard using React and TypeScript, supporting 10+ dynamic visual components and deployed with FastAPI and Docker on Google Cloud Run.
  • Developed backend API services using FastAPI to serve dashboard data, enabling low-latency access for real-time filtering and visualization.
  • Automated large-scale data collection using Python and Selenium, gathering 1,000+ healthcare survey responses and 795 climate policy documents, followed by structured analysis and evaluation to measure LLM resilience on heat and healthcare prompts.
  • Deployed a RAG-based chatbot using Llama 3, LangChain, and Chroma on Jetstream2 HPC to summarize 100+ interview transcripts for the Maricopa Association of Governments and support ASU's LLM resilience research with climate policy recommendations.
  • Performed geospatial opioid risk analysis using PostGIS by writing spatial SQL queries (proximity and region joins, hotspot-style aggregations) to map overdose patterns and generate region-level insights for the dashboard.

InnoHat Systems

Chennai, TN, India

Software Developer

Jan 2022 - July 2023

  • Designed hybrid storage using PostgreSQL for transactions and MongoDB for product metadata and user reviews, reducing schema-change overhead while keeping financial data ACID-compliant.
  • Built Flask REST APIs for checkout, refunds, and shipment events, with 200+ test cases in pytest to reach 85% coverage and reduce regression bugs.
  • Orchestrated ETL pipelines in Python (pandas, SQLAlchemy, and PyMongo) to merge order history with review sentiment data, optimizing runtime by 40% on 20M+ rows and 5M+ documents.
  • Containerized services with Docker and deployed on AWS (EC2, RDS, S3, DocumentDB) with CloudWatch monitoring to maintain stability during seasonal traffic spikes.
  • Developed a Django admin panel for managing promotions, vendors, and warehouse sync, replacing manual spreadsheet workflows and reducing operations effort by 12 hours per week.
  • Created a React.js customer portal for browsing catalogs and personalized deals, integrating cache and lazy loading to reduce page load time by 35%.
  • Trained a scikit-learn recommendation engine using transactional PostgreSQL and behavioral MongoDB data, improving upsell conversion by 11% in pilot rollout.

Skills

Publication

Performance within the Foundation Paradox

Feb 2026

The Case for Small, Qualitative Evaluations (SQEs) for Grading LLM Chat Responses for Extreme Heat Adaptation and Other Less-Bonded Fields

Submitted (In Progress)

Co-authored a research study introducing the Desert Language Model Evaluation Framework (DLEF) to assess LLM performance in extreme heat adaptation and show domain-specific strengths and limitations that large-scale benchmarks can miss.

An Automatic Path Navigation for Visually Challenged People using Deep Q Learning

Jan 2023

Advances in Computers, Elsevier, Volume 132 (2024), Pages 205-218

Published a study on autonomous navigation that uses a Deep Q-Learning model with a reward-penalty mechanism to train an agent in a generated environment for real-world obstacle avoidance.

Certification

AWS Certified Solutions Architect - Associate

Amazon Web Services (AWS)

Issued May 2025 - Expires May 2028

Show Credential

AIG - Actuarial Analyst

Forage

Issued Sep 2025

Credential ID: 6D2BARv5wgwb9AqEx

Skills: Data Analysis, Visual Analytics

Show Credential

COEBC - Code of Ethical Business Conduct Training

CITI Program

Issued Jun 2024

Credential ID: 60593265

Show Credential

RCR - Graduate Student and Postdoctoral Researcher Responsible Conduct of Research

CITI Program

Issued Jun 2024

Credential ID: 63090511

Show Credential

IRB - Social & Behavioral Research (Group 2)

CITI Program

Issued Jan 2024 - Expires Jan 2028

Credential ID: 60593339

Show Credential

Computer Architecture and Computer Organization Masterclass

Udemy

Issued Feb 2024

Credential ID: UC-4a6796d5-5009-4e25-af33-bcd88d61649b

Skills: Computer Architecture

Show Credential

Applied Data Science Capstone

Coursera

Issued Aug 2023

Credential ID: 4LQXS9ZN4373

Skills: Exploratory Data Analysis, Visual Analytics, Data Science, Data Analysis

Show Credential

Foundations of User Experience (UX) Design

Coursera

Issued Aug 2023

Credential ID: LMHXK6GDTHZ9

Skills: User Experience (UX), Wireframing

Show Credential

DevOps Beginners to Advanced with Projects - 2023

Udemy

Issued Aug 2023

Credential ID: UC-09351ca5-91b3-4cda-8beb-b87d7e80a6d9

Skills: Scripting, Docker, Linux, Kubernetes, Amazon Web Services (AWS)

Show Credential

Angular - The Complete Guide (2023 Edition)

Udemy

Issued Jun 2023

Credential ID: UC-1bc4bd69-2022-4696-a861-135f3688bb57

Skills: AngularJS, Microsoft Visual Studio Code, Angular

Show Credential

Education

Arizona State University

Tempe, AZ, USA

Master of Science in Computer Science

Aug 2023 - May 2025

CGPA: 3.83/4

Madras Institute of Technology, Anna University

Chennai, TN, India

Bachelor of Engineering in Computer Science

July 2019 - June 2023

CGPA: 8.69/10

Contact

Let us build something useful.

Send a Message