Proven Data Scientist with 3+ years of experience building and deploying ML pipelines for big data using Azure tools. I leverage my expertise in predictive modeling, feature extraction, computer vision, and time series analysis to solve complex environmental challenges. My passion lies in transforming intricate data into actionable insights that drive business optimization and innovation.
I have developed an intelligent early warning system (IEWS) that utilizes machine learning and computer vision algorithms to forecast outbreaks of arboviral diseases such as Dengue, Yellow Fever, and Zika. The system analyzes environmental and meteorological conditions in tropical regions, and it also incorporates image processing to detect and quantify specific mosquito species in photographs submitted by the local community. I have used Python, OpenCV, TensorFlow, and PyTorch for efficient and accurate small object detection in real time. Read more about this project.
I have designed and implemented an online framework for calculating the shortest path for a vehicle routing system based on multiple distance formats interfaced by a rich REST API service written with ASP.Net. I have utilized deep reinforcement learning algorithms to simulate and optimize the shortest path for a vehicle routing system. The system is capable of handling multiple distance formats such as Euclidean, Manhattan, and real road distance. The system is also capable of handling vehicle capacity and time window constraints.
Semantic segmentation for images can be defined as partitioning and classifying the image into meaningful parts and classifying each part at the pixel level into one of the pre-defined classes.
One issue of this project is the small training dataset size with the total number of training images of 25 object-labeled images. To overcome this problem, we use U-Net architecture, an extended form of Fully Connected Network FCN. The main idea of this approach is to use a CNN as a powerful feature extractor while replacing the fully connected layers with convolution ones to output spatial maps instead of classification scores. Those maps are up-sampled to produce dense per-pixel output. This method allows training CNN in the end-to-end manner for segmentation with input images of arbitrary sizes. Read more
Accurately predicting earthquakes is critical for several reasons:
Predicting earthquakes, especially in the short term (hours or days in advance), is notoriously difficult. This stems from the inherent complexity of earthquakes themselves.
This project tackles this challenge by simplifying the problem. We will use a set of techniques to convert the prediction of these continuous variables into a categorical classification problem. In simpler terms, instead of pinpointing the exact magnitude, location, and time, we will aim to categorize them into predefined ranges (e.g., high magnitude, low magnitude). This approach can potentially improve the accuracy of short-term earthquake prediction. Read more