Hello, I'm Mayank Vyas

Brewing Software with AI Solutions. β˜•οΈ

As a Machine Learning Researcher, I leverage Natural Language Processing and Large Language Models to build impactful AI solutions.

Professional Experience

Arizona State University logo
NLP Researcher

Arizona State University

Jan 2025 - Present

Developing efficient Table retrieval RAG pipeline to reduce user query latency and better inference.

  • πŸš€ Served 500+ users- a RAG prototype on 160K+ NQ tables with 10s query time and 98% retrieval accuracy.
  • πŸš€ Built a ranking algorithm using S-Bert to rank query specific gold tables with 98% Accuracy.
  • πŸš€ Conducting research on improving document question-answering pipelines using sparse and learned embeddings (SPALDE) and Contrastive learning techniques for more accurate retrieval and reduced context noise.
  • πŸš€ Working on hierarchical chunking methods to optimize embeddings and improve information retrieval recall.
  • πŸš€ Designing pruning algorithms to discard irrelevant table segments for better recall and reduced hallucinations

Improved data processing and analysis for IoT applications.

  • πŸš€ IoT Infrastructure Development: Engineered a LoRa-based fog computing framework for smart agriculture, reducing sensor energy consumption by 40% and optimizing data transmission using regression models.
  • πŸš€ Data Efficiency: Deployed APAEs (Analytical Prediction Algorithm) across edge-fog-cloud layers, cutting data transmissions by 93.6% while maintaining <10% MAE.
  • πŸš€ System Integration: Streamlined sensor data collection (temperature, humidity, soil moisture) using Arduino and LoRa, achieving 98% irrigation efficiency.
Indian Institute of Information Technology Design & Manufacturing Kancheepuram logo
Machine Learning Assistant - Kalman Filter

Indian Institute of Information Technology Design & Manufacturing Kancheepuram

May 2023 - January 2024
Indian Institute of Information Technology Design & Manufacturing Kancheepuram logo
Research Intern - Machine Learning Framework development

Indian Institute of Information Technology Design & Manufacturing Kancheepuram

April 2023 - January 2023

Developed an IoT machine learning framework using TensorFlow Lite and Decision Trees, enabling real-time actuation during internet outages with minimum transmission costs.

  • πŸš€ Designed a Regressive Prediction Data Forwarding Model (RPDM) using TensorFlow Lite, reducing bandwidth usage by 85% in IoT networks.
  • πŸš€ Achieved 99.97% prediction accuracy with Decision Trees, enabling real-time actuation on edge devices during internet outages.
  • πŸš€ Implemented lightweight model compression for deployment on Raspberry Pi/Arduino, reducing power consumption by 82.89%.

Developed IoT data aggregation and real-time monitoring systems, optimizing efficiency and publishing findings in IEEE AINA 2023

  • πŸš€ Designed a Ward's method clustering algorithm to compress IoT sensor data by 57.39%, deployed on fog nodes to reduce cloud transmission costs by 38%.
  • πŸš€ Integrated with The Things Network, achieving 1.1s latency for real-time field monitoring, improving response time by 35% over traditional cellular networks.
  • πŸš€Published in IEEE AINA 2023 and tested on a 20-acre testbed, cutting energy consumption by 82.89% at tolerance thresholds (Ξ΅=1.0)
Indian Institute of Information Technology Design & Manufacturing Kancheepuram logo
Research Intern - IoT Innovator

Indian Institute of Information Technology Design & Manufacturing Kancheepuram

May 2022 - August 2022
Portfolio

Featured Projects

Showcasing innovative solutions that blend cutting-edge technology with real-world impact

πŸ“Œ Enterprise Sales Analytics Dashboard
February 2025

πŸ“Œ Enterprise Sales Analytics Dashboard

Developed an enterprise-grade Power BI dashboard implementing DAX measures and advanced data modeling techniques to transform raw sales data into actionable business intelligence. The solution features multi-dimensional analysis capabilities with drill-through functionality for granular insights.

Power BI DAX
Data Modeling
ETL Pipeline
+1
4 key achievements
πŸ›’ Intel Automated Checkout System (OSS Contribution)
January 2025 - Present

πŸ›’ Intel Automated Checkout System (OSS Contribution)

Engineered a microservices-based observability solution for Intel's retail edge computing platform that processes real-time computer vision data. Implemented comprehensive telemetry capturing CPU/GPU utilization, inference latency, and throughput metrics critical for retail deployment reliability.

Docker Compose
Grafana Dashboards
MQTT Protocol
+1
4 key achievements
🌱 MaskRoot: Computer Vision for Agricultural Phenomics
April 2023 - April 2024

🌱 MaskRoot: Computer Vision for Agricultural Phenomics

Engineered an instance segmentation pipeline utilizing Mask R-CNN architecture to automate root phenotyping at scale. The system overcomes occlusion challenges through a custom-designed loss function and transfer learning from MS COCO weights to compensate for limited agricultural training data.

TensorFlow 2.x
OpenCV
Mask R-CNN
+1
3 key achievements
πŸ“‘ DASA: Distributed Agricultural Sensing Architecture
May 2022 - August 2022

πŸ“‘ DASA: Distributed Agricultural Sensing Architecture

Designed a hierarchical IoT architecture leveraging LoRaWAN's low-power wide-area network capabilities for agricultural monitoring in remote areas. Implemented a novel fog computing layer using edge devices to perform data preprocessing, anomaly detection, and compression before cloud transmission.

Apache Spark Streaming
LoRaWAN Protocol
Ward Hierarchical Clustering
+1
3 key achievements
🚦 Deep Reinforcement Learning for Urban Traffic Control
January 2024 - April 2024

🚦 Deep Reinforcement Learning for Urban Traffic Control

Developed an adaptive traffic signal control system using Deep Q-Networks (DQN) in the SUMO traffic simulation environment. The system leverages vehicle-to-infrastructure (V2I) communication to optimize traffic flow based on real-time density and waiting time metrics.

RLlib Framework
SUMO Traffic Simulator
TensorFlow
+1
3 key achievements
🧠 Multi-Layer Perceptron Implementation from First Principles
August 2024 - November 2024

🧠 Multi-Layer Perceptron Implementation from First Principles

Built a neural network framework from mathematical foundations without reliance on deep learning libraries. Implemented forward propagation, backpropagation, gradient descent optimization, and regularization techniques to demonstrate core principles of neural computation.

NumPy Vectorization
Computational Graphs
Gradient Descent Optimization
+1
3 key achievements
πŸ“Ά RPDM: Resource-efficient Predictive Decision Model for IoT
August 2023 - January 2024

πŸ“Ά RPDM: Resource-efficient Predictive Decision Model for IoT

Designed an ultra-lightweight machine learning inference system for resource-constrained IoT devices that optimizes when to transmit sensor data based on predictive value. The framework uses model quantization and pruning techniques to enable ML on microcontrollers with severe memory constraints.

TensorFlow Lite for Microcontrollers
Decision Tree Ensemble
FlatBuffers Serialization
+1
3 key achievements
πŸ› οΈ Scalable Data Processing Pipeline for Time-Series Analytics
August 2024 - October 2024

πŸ› οΈ Scalable Data Processing Pipeline for Time-Series Analytics

Architected a distributed ETL pipeline for processing high-frequency sensor data from industrial equipment. The system handles data ingestion, cleansing, transformation, and aggregation while maintaining data lineage for regulatory compliance and audit purposes.

PySpark Structured Streaming
Pandas DataFrames
SQL Window Functions
+1
3 key achievements
πŸ“Š Geospatial Market Intelligence Platform for Tucson Businesses
August 2024 - December 2024

πŸ“Š Geospatial Market Intelligence Platform for Tucson Businesses

Developed a comprehensive market intelligence platform integrating geospatial, demographic, and economic data sources to identify growth patterns and market opportunities in Arizona's urban centers. Utilized advanced spatiotemporal analysis to reveal hidden business patterns.

PySpark Geospatial
Power BI DirectQuery
Tableau Spatial Visualization
+1
3 key achievements

Hackathon Adventures

Interview Unlocked: Agentic AI Interview Prep Framework
Interview Unlocked: Agentic AI Interview Prep Framework

Agentic AI Hackathon

Software Development club at ASU

April 2025

Built an Agentic AI system using Langchain and LangGraph for automated, personalized interview prepβ€”cut manual effort by 90% via modular orchestration, relevant question generation, and evaluation with feedback.

Achieved 90% user satisfaction by generating personalized prep plans and using LLMs for response evaluation, creating a comprehensive interview preparation ecosystem.

Implemented intelligent question generation algorithms that adapt to user skill level and target role requirements, ensuring relevant and challenging practice sessions.

Developed automated feedback mechanisms that provide detailed analysis of responses, highlighting strengths and areas for improvement with actionable insights.

Technologies:

LangChain
LangGraph
Python
LLM
AI Agents
Natural Language Processing
Machine Learning

Designed an end-to-end NLP candidate search engine using BERT (Hugging Face Transformers) and FAISS to convert natural-language queries into embeddings, enabling real-time semantic matching across 10,000+ profiles with <100ms latency.

Engineered a scalable data pipeline using BeautifulSoup, to scrape, clean, and structure 10,000+ GitHub profiles, extracting features like project complexity, commit frequency, and tech stack relevance, which improved candidate-match accuracy by 40% for hiring teams.

Developed a holistic applicant evaluation portal (React frontend + FastAPI backend) where candidates showcase GitHub activity (stars, forks, PRs) alongside resumes. Integrated a Popularity Index algorithm to auto-rank talent, cutting recruiter screening time by 60% while boosting candidate visibility for niche roles.

Technologies:

React
FastAPI
BERT
FAISS
Docker
AWS
GitHub API
BeautifulSoup
Pandas
SQL
Hire-Smart: AI-Powered Technical Recruitment Platform
Hire-Smart: AI-Powered Technical Recruitment Platform

DevHacks x Stratergy Hackathon

DevHacks and Stratergy

March 2025
Gamify: Interactive Learning through Automated Quiz Generation
Gamify: Interactive Learning through Automated Quiz Generation

Zoom App Hackathon

Zoom

April 2025

Developed an innovative Zoom application that leverages real-time transcription of lecture content to automatically generate interactive quizzes for students.

Integrated Zoom's Real-Time Messaging System (RTMS) to capture and process lecture transcripts as they happen, ensuring immediate content relevance.

Implemented Gemini AI to analyze transcriptions and intelligently generate contextually appropriate quiz questions based on the lecture material.

Built an intuitive user interface using React, TypeScript, and Tailwind CSS that seamlessly integrates with the Zoom platform as pop-up quizzes.

Created a backend infrastructure with Supabase for user authentication, quiz storage, and performance analytics.

Technologies:

React
TypeScript
Tailwind CSS
Zoom API
RTMS
Gemini AI
Supabase

Revolutionized industrial digital twin creation by developing a system that generates complete digital twin environments from natural language prompts in under 60 seconds.

Integrated Gemini AI to interpret complex prompts like "Build a 3D model of a 10-assembly-line factory" and translate them into actionable outputs.

Engineered automatic Boto3 script generation that dynamically builds assets, hierarchies, and 3D scenes within AWS IoT TwinMaker and SiteWise.

Implemented real-time data monitoring through AWS SiteWise telemetry integration with LLM interaction capabilities for instant insights.

Reduced digital twin setup time from hours to seconds through complete end-to-end automation.

Technologies:

Gemini AI
AWS IoT TwinMaker
AWS SiteWise
Boto3
Python
LLM
IoT
TwinGenius: AI-Powered Digital Twin Generator
TwinGenius: AI-Powered Digital Twin Generator

Devils Invent Hackathon

Honeywell & Arizona State University

April 2025

Interactive Dashboards

Explore my data visualization projects and analytical insights through interactive dashboards

Capstone Project Analytics

Comprehensive data visualization and analysis dashboard showcasing project metrics, performance indicators, and key insights.

Power BI
Project Analytics

Video Content

Explore my video content showcasing projects, tutorials, and technical insights

Productiviy Tool Demo

Showcasing how this google chrome extension improves your productivity by tracking jobs directly to your google sheets.

2024
1:47
Productivity Tool Demo
Hire-Smart: AI-Powered Technical Recruitment Platform

Showcasing how this AI-powered technical recruitment platform improves the hiring process by using AI to find the best candidates.

2024
6:58
Productivity Tool Demo

About Me

As a Data Science master's student at ASU, I architect intelligent systems by specializing in RAG (Retrieval-Augmented Generation) pipelines for LLMs and developing sophisticated AI Agents. My core expertise lies in Natural Language Processing, where I design high-performance retrieval algorithms to power next-generation AI applications.

I translate complex theory into real-world impact. My project experience includes analyzing Time Series data to build robust IoT Pipelines for smart agriculture and engineering a production-ready, dockerized pipeline for Intel's automated self-checkout system to visualize critical data on Grafana.

I also engineered a Masked R-CNN pipeline to intelligently detect the primary root length of plant species like wheat, brassica napus, and arabidopsis thaliana, enabling biologists to study the root phenome more effectively.

Technical Stack
Python
Java
HTML
C++
TensorFlow
PyTorch
MongoDB
PostgreSQL
React
Django
Power BI
Tableau
12+ Technologies & Growing

Education

Arizona State University logo
Arizona State University

Master of Science in Data Science

Aug 2024 - May 2026

Tempe, Arizona

GPA: 3.6/4.0
Institute of Infrastructure Technology Research and Management logo
Institute of Infrastructure Technology Research and Management

Bachelors of Science in Electrical Engineering

Aug 2020 - May 2024

Ahmedabad, India

GPA: 3.7/4.0