I am Aisha Malik

|

Hey there! I’m a Computer Science student at CUNY Hunter College, passionate about building AI systems that are both powerful and equitable. My work spans machine learning research, software development, and environmental data science — from fine-tuning financial question-answering models to developing AI for dermatology. Along the way, I’ve interned at Microsoft, Cognizant, KPMG, and the NSF, presented award-winning research on satellite-based climate analysis, and built full-stack projects (shoutout to SBench — my capstone project!). Feel free to explore my portfolio and connect with me on LinkedIn!

Profile border
Aisha Malik

Featured Projects

A showcase of some of my favorite work.

Research Experience

A showcase of my research projects and contributions.

Research Wallpaper
Research Presentation

Click to see what I presented at the Emerging Researchers National Conference (ERN) in Washington D.C.!

Poster Presentations Wallpaper

Poster Presentations

Analysis Of Dissolved Organic Content And Water-Color Trends In The Adirondacks Using Satellite Remote Sensing

The American Geophysical Union (AGU) National Conference, 2023

Analysis Of Dissolved Organic Content And Water-Color Trends In The Adirondacks Using Satellite Remote Sensing

Celebrating Research Excellence Symposium at CityTech, 2023

Assessment of Inland Lake Water Color and Dissolved Organic Content via Satellite-Based Machine Learning

The Emerging Researchers National (ERN) Conference, 2024

Assessment of Inland Lake Water Color and Dissolved Organic Content via Satellite-Based Machine Learning

The Mathematical Association of America (MAA) Conference, 2023

Satellite-Based Analysis of Water-Color and Dissolved Organic Content Within Inland Lakes

The American Meteorological Society (AMS) National Conference, 2024

Analyzing Atmospheric Correction Algorithms for Climate Change Impact on Water Quality in Clear Lakes Using Landsat

American Geophysical Union (AGU), 2024

Analysis of Atmospheric Correction Algorithms to Assess Climate Change Impacts on Lake Water Quality

The Emerging Researchers National (ERN) Conference, 2025

Evaluating Atmospheric Correction Algorithms for Accurate Water Quality Detection in Lakes Using Landsat: A Comparative Analysis

The American Meteorological Society (AMS) National Conference, 2025

Python

Python

TypeScript

TypeScript

JavaScript

JavaScript

HTML

HTML

CSS

CSS

SQL

SQL

C++

C++

Python

Python

TypeScript

TypeScript

JavaScript

JavaScript

HTML

HTML

CSS

CSS

SQL

SQL

C++

C++

R

R

PostgreSQL

PostgreSQL

Dart

Dart

Python

Python

TypeScript

TypeScript

JavaScript

JavaScript

HTML

HTML

CSS

CSS

SQL

SQL

C++

C++

Python

Python

TypeScript

TypeScript

JavaScript

JavaScript

HTML

HTML

CSS

CSS

SQL

SQL

C++

C++

R

R

PostgreSQL

PostgreSQL

Dart

Dart

React

React

Tailwind CSS

Tailwind CSS

Raspberry Pi

Raspberry Pi

Firebase

Firebase

Trello

Trello

Jupyter

Jupyter

Vercel

Vercel

TensorFlow

TensorFlow

Vite

Vite

HuggingFace

HuggingFace

Next.js

Next.js

PowerBi

PowerBi

Git

Git

Material-UI

Material-UI

Scikit-learn

Scikit-learn

Keras

Keras

LangChain

LangChain

Flutter

Flutter

Node.js

Node.js

React

React

Tailwind CSS

Tailwind CSS

Raspberry Pi

Raspberry Pi

Firebase

Firebase

Trello

Trello

Jupyter

Jupyter

Vercel

Vercel

TensorFlow

TensorFlow

Vite

Vite

HuggingFace

HuggingFace

Next.js

Next.js

PowerBi

PowerBi

Git

Git

Material-UI

Material-UI

Scikit-learn

Scikit-learn

Keras

Keras

LangChain

LangChain

Flutter

Flutter

Node.js

Node.js

Experience

My professional journey and roles

May - July 2025

Data Science Intern

Company Logo

Microsoft

»

Reproduced key findings from the paper “Community-Based Fact-Checking on Twitter’s Birdwatch Platform” by replicating core data visualizations.

»

Built logistic regression models using 10K+ public data points to analyze predictors of helpfulness ratings and total votes on Community Notes, reporting standardized estimates with 99% confidence intervals.

»

Automated data collection and cleaning workflows using R and Unix Shell scripting to ensure full reproducibility.

»

Conducted an independent regression analysis to predict daily Citibike trip volumes using multiple linear regression with polynomial and interaction terms, applying k-fold cross-validation for model robustness.

RStatistical ModelingRegression AnalysisData VisualizationShell ScriptingReproducible ResearchCross-ValidationData Cleaning
Jun 2023 - Present

Machine Learning Engineer Intern

Company Logo

National Science Foundation (NSF)

»

Collaborated with NASA research teams to develop predictive models for harmful algal blooms (HABs) in freshwater lakes.

»

Applied atmospheric correction algorithms to enhance satellite-derived water quality data for CDOM analysis.

»

Built advanced machine learning models to estimate Dissolved Organic Carbon (DOC) using multi-sensor satellite data (Landsat 5/7/8, Sentinel-2) and 30+ years of in-situ measurements from 40+ Adirondack lakes.

»

Achieved strong predictive performance (R² = 0.90, MSE = 0.66 mg/L) using stacking regressors and neural networks.

»

Conducted trend analyses on water temperature across 130+ lakes, revealing long-term ecological shifts linked to climate and land-use change.

PythonGoogle Earth EngineMachine LearningTensorFlowScikit-learnJavaScriptRemote SensingData Analysis
January 2025

Software Engineer Intern

Company Logo

Cognizant

»

Built and deployed 3+ AI agents for automation tasks including text summarization, survey analytics, and documentation generation using Copilot Studio, reducing manual workload by 60%.

»

Developed a full-stack web application with React, Node.js, and a Flask-based API to process PDFs, DOCX, and images for automated workflow extraction and diagram generation using Gemini 2.0 Flash and Mermaid.js.

»

Integrated a Vue.js frontend for seamless file uploads and real-time interaction with the backend through secure, CORS-enabled endpoints.

ReactVue.jsNode.jsFlaskCopilot StudioGemini 2.0 FlashMermaid.jsAgile DevelopmentPythonTrello
Aug 2024 - Dec 2024

AI Engineer Intern

Company Logo

Anote

»

Fine-tuned multiple lightweight Hugging Face transformer models (Gemma-2, Phi-2, TinyLLaMA, DeepSeek) using LoRA for financial question-answering tasks, achieving the highest numeric accuracy of ~25% with the best single model.

»

Evaluated zero-shot, context-augmented, and retrieval-augmented generation (RAG) strategies across Hugging Face and GUFF model families, identifying zero-shot RAG as the top performer for GUFF models with ~12% accuracy on numeric financial data.

»

Developed retrieval-augmented pipelines with LangChain and FAISS for efficient context chunking and similarity search, improving both answer relevance and semantic accuracy.

PythonHugging FaceTransformersLoRALangChainFAISSRAGMachine Learning
Jun 2024 - Aug 2024

IT & Tech Support Intern

Company Logo

NYC Department of Education

»

Delivered comprehensive technical support by diagnosing and resolving complex hardware, software, and network issues, minimizing downtime and ensuring business continuity for end-users.

»

Led the setup of IT infrastructure across 7+ locations, deploying and configuring over 300 laptops.

»

Collaborated with cross-functional teams and IT specialists to integrate and deploy technology solutions seamlessly, actively contributing to system improvements, and participating in strategic meetings to align on project goals and IT priorities.

»

Assisted in developing and maintaining detailed IT documentation, including network infrastructure diagrams, user manuals, and troubleshooting protocols, to streamline future support operations.

Information TechnologyNetworking TroubleshootingHardware SUpportSystem ConfirgurationSoftware SupportIT Infrastructure Management
Jan 2024

Data Science Intern

Company Logo

KPMG

»

Collaborated with the KPMG Advisory team on analyzing 40,000+ NYC asthma cases, uncovering correlations and proposing partnerships with 3 external organizations.

»

Evaluated statistical and predictive models (OLS Linear Regression, Random Forest), achieving 87% accuracy in forecasting the impact of increased health insurance access on asthma prevalence in 25 Bronx neighborhoods.

PythonPandasNumpyMatplotlibSeabornScikit-learnPowerBI