Hi, I'm Zain.

A
Self-driven, quick starter, passionate programmer with a curious mind who enjoys solving complex and challenging real-world problems.

About

I am a PhD Fellow at PCAI and CopeNLU group at the University of Copenhagen. I enjoy problem-solving and coding. I bring a unique blend of technical expertise and a passion for cutting-edge technology to the table. I have always sought out competitive and innovative environments that offer fresh challenges with each new project. This drive fuels my constant quest for learning and skill enhancement. My research interests span a wide spectrum of NLP applications, including fact-checking, factual text generation, news media profiling, and large language models.

  • Languages: Python, JavaScript, C, C++, HTML/CSS, Bash
  • Databases: MySQL, PostgreSQL, MongoDB
  • Libraries: Numpy, Pandas, Librosa, Scikit-Learn, NLTK, OpenCV
  • Frameworks: Flask, Django, Vue.js, Node.js, Selenium, Keras, TensorFlow, PyTorch, Bootstrap
  • Tools & Technologies: Git, Docker, AWS, GCP, Heroku, JIRA

Experience

Ph.D. Fellow

Responsibilities: Conduct research on factual text generation.

October 2024 - Present | Abu Dhabi, United Arab Emirates
Research Assistant - NLP

Responsibilities: Conduct research on profiling news media outlets based on their bias and factuality. Evaluating and correcting the factuality of LLMs.

July 2024 - October 2024 | Abu Dhabi, United Arab Emirates
Flight Data Modeling Engineer

Responsibilities: Utilize NLP techniques to analyze internal incident reports, predict potential consequences, assess Event Risk Categories (ERCs) and Hazards, and investigate and identify root causes to prevent future incidents.

May 2023 - June 2023 | Abu Dhabi, United Arab Emirates
Software Engineer

Responsibilities: Design, code, and debug software. Maintain and upgrade existing products to improve their performance. Train other developers and manage the technical aspects of projects.


Tools: Vue.js, Python, Flask, MongoDB
October 2021 - June 2023 | Chatham, New Jersey
Research Engineer

Responsibilities: Research and development of an end-to-end Urdu Text-to-Speech system using deep learning, full stack development of an automated Urdu broadcast media content extraction and analytics system using Django framework, development of an NLP module to generate phonetic stream from plain Urdu text for the production of natural and intelligible speech, development of a statistical POS tagger for Urdu using different machine learning approaches, creation of an installation package for the deployment of HMM based Urdu SAPI voice in Windows environment, and implementation of Fernet encryption for Urdu lexicon.


Tools: Python, Django, Keras, Tensorflow, PyTorch, MySQL
December 2020 - October 2021 | Lahore, Pakistan
Internee Engineer

Responsibilities: Predicting the failure of Power Transformers based on dissolved gas analysis (DGA) data, creating an alert system to send notifications and emails when the parameters exceed the threshold, and supervising repair, winding, insulation, and testing sections of the workshop.


Tools: Python, Flask, JavaScript
June 2019 - July 2019 | Lahore, Pakistan

Projects

speech emotion recognitions
Speech Emotion Recognition

A CNN, LSTM and Attention based model to recognize human emotions.

Accomplishments
  • Tools: Scikit-Learn, Librosa, PyTorch
  • A CNN, LSTM and Attention based model is made using PyTorch.
  • The model is trained using RAVDESS dataset.
  • The algorithm is able to predict a total of 8 emotions with upto 96% accuracy.
sentiment analysis
Sentiment Analysis Deployment

An RNN based deployed model for sentiment analysis using AWS.

Accomplishments
  • Tools: Pickle, Scikit-Learn, NLTK, PyTorch, AWS Sagemaker, AWS Lambda, EC2, API Gateway, Endpoints
  • Designed an RNN in PyTorch and trained this model on movie reviews from IMDB.
  • Deployed its web application as user interface using AWS.
Face generation
Face Generation

A deep convolutional generative adversarial network for generating faces of celebrities.

Accomplishments
  • Tools: Pickle, Matplotlib, PyTorch
  • Implemented a DC-GAN which is trained on the faces of celebrities.
  • The trained model can generate new realistic faces.
TV Script Generation
TV Script Generation

An RNN based model trained on TV Scripts to generate new TV Scripts which can be used by production companies.

Accomplishments
  • Tools: Numpy, Pickle, PyTorch
  • Implemented a Recurrent Neural Network which is trained on many existing TV scripts.
  • This algorithm generates new TV Scripts which can be used by a TV show production company.
Income Classification Web App
Income Classification Web App

A web application to predict whether a person earns more than a threshold or not.

Accomplishments
  • Tools: Pandas, Seaborn, Scikit-Learn, Streamlit, Heroku
  • Several classifiers from SkLearn library were used to predict whether a person earns more than a threshold or not.
  • Best model was then deployed as a web application using Streamlit on Heroku.
Bike sharing Pattern
Predicting Bike-Sharing Patterns

A deep neural network for bike sharing company.

Accomplishments
  • Tools: Numpy, Pandas
  • In this project I Implemented a Deep Neural Network from scratch.
  • The algorithm predicts how many bikes are needed by the company to fulfil the customer requirements on any given day by using the historical data.
Dog Breed Classifier
Dog Breed Classifier

A fine-tuned CNN (VGG-16) to predict the breed of a dog.

Accomplishments
  • Tools:Seaborn, Scikit-Learn, PyTorch
  • I implemented a pre-trained Convolutional Neural Network (VGG-16) using transfer learning, fine-tuned it by training on different dog breeds.
  • The algorithm can classify 113 dog breeds efficiently.
Bill Projection
Bill Projection

A linear regression model hourly/monthly bill projection.

Accomplishments
  • Tools: Matlab
  • This project takes Date, Time, PV, and Temperature as input data and with the help of machine learning algorithms predicts the monthly residential load demand, which is then further used to calculate the bill of that certain month.
Metabolite Identificatoin
NMRware

A GUI based software package for Linux for Metabolite Identificatoin.

Accomplishments
  • Tools: Shell Scripting
  • This software contains the tools for Metabolite Identification. This software was designed using shell programming and is being currently open sourced.

Skills

Languages and Databases

Python
Go
C++
Shell Scripting
HTML5
CSS3
JS
MySQL
PostgreSQL

Libraries

NumPy
Pandas
Matplotlib
Seaborn
Plotly
Scikit-Learn
Librosa
OpenCV
NLTK

Frameworks

Vue.js
Node.js
Django
Flask
Bootstrap
Keras
TensorFlow
PyTorch
Selenium

Other

Git
AWS
Jira
Slack
Streamlit
Heroku

Education

University of Copenhagen

Copenhagen, Denmark

Degree: Ph.D. Computer Science (Natural Language Processing)
Year: 2024 - 2027


Mohamed bin Zayed University of Artificial Intelligence

Abu Dhabi, United Arab Emirates

Degree: M.Sc. Natural Language Processing
CGPA: 3.68/4.0
Year: 2022 - 2024

Nanjing University of Science and Technology

Nanjing, China

Degree: M.Sc. Computer Science and Technology
CGPA: 3.897/4.0
Year: 2021 - 2022

University of Engineering and Technology, Lahore

Lahore, Pakistan

Degree: Bachelor of Electrical Engineering
CGPA: 3.747/4.0
Year: 2016 - 2020

Contact