Image by author

This blog follows a first one that I named “Basic DataFrame Manipulation using Pandas” which I published a few months ago, where I tried to give some tips and show short workflows that I use on a regular basis. Most of it, if not all of the content was for beginners, so I decided to do a second one with slightly more complex functions and methods which I broke down into two parts, not to make a super long blog tedious to follow.

Once again, I will try to do this as an exercise starting from creating my own DataFrame…


Flight delays has become a very important subject for air transportation all over the world because of the associated financial loses that the aviation industry is going through. According to data from the Bureau of Transportation Statistics (BTS) of the United Stated, over 20% of US flights were delayed during 2018, which resulted in a severe economic impact equivalent to 41 billion US$.

These delays not only cause inconveniences to the airlines but also to the passengers. The result is an increase in travel time which increases the expenses associated with food and lodging and ultimately causes stress among passengers…


Recently I wrote a blog titled “Pneumonia Detection From X-ray Images Using Deep Learning Neural Network” where I presented the results of what I chose to be the best out of 15 different model architectures that I created to solve a binary classification problem. For the readers not familiar with this, what it means is that my model will predict only a “0” or a “1”. A “0” corresponds to NORMAL or NO PNEUMONIA, and a “1” to PNEUMONIA.

Before I continue, don’t worry if you have not read my previous blog, you won’t need it to understand this one…


For the last months I have been writing a few blogs about the projects that I had to do at Flatiron as part of the requirements of the Data Science bootcamp that I’m doing. Now I’m almost at the end of the program and I’m still missing a couple blogs. One is a mandatory one about our Capstone Project and the second I decided to write about something that I really enjoy in Python which is manipulating dataframes with Pandas.

If by any chance you don’t know what these two are (Pandas and dataframes), Pandas (Python Data Analysis Library) is…


This blog is probably going to be in terms of coding the most technically driven that I have written so far, and it will be about a project using Deep Learning Neural Networks to classify a set of structured X-rays images from pedriatic patients to identify whether or not, they have pneumonia. The Neural Network that I chose is the Convolutional Neural Network (CNN/ConvNet), mainly because it is known to perform really well with image classification.

This project is not intended as a study of Pneumonia itself, but for detecting it from X-rays images by using Deep Learning Neural Networks…


This is the third project that I am doing for my Data Science Bootcamp at Flatiron and as you can read on the title, this one is about the NBA. For this project we had a series of datasets to chose from, or you had the freedom to get your own data as well, so I decided to go with something that I really like and that is basketball.

I am not an expert in basketball but I have been following it for almost 30 years now so I think I know a bit about it. …


After 4 more weeks of intense bootcamp at flatiron I’m just about to submit my module 2 project so once again I need to write a blog about anything related to the project. Because I’m tired as this has taken me a lot longer than expected I will go with the easiest and that is to describe the project. At the end of my first blog I promised I was going to add more code into this second blog so I will try my best to do so.

In this project I have to create a linear regression model to…


I’m currently a full time student at flatiron doing the online data science program and as part of the final project of Module 1, every student has to post a blog of some finding, some nifty code, or something that we found interesting related to the subject of the project.

My background is in geophysics and prior to starting this intensive program I had no coding experience, so I have found this a fascinating but very complex new world.

As you can imagine, in 3 weeks of online training, the chances of me finding a proper “nifty code” are vague…

Javier Herbas

Data Scientist formerly an engineer with 20 years of experience in the energy industry. Fan of ML and DL and everything related to AI

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store