Identification of outliers is an essential step in the machine learning workflow

Photo by Will Myers on Unsplash

Outliers are anomalous points within a dataset. They are points that don’t fit within the normal or expected statistical distribution of the dataset and can occur for a variety of reasons such as sensor and measurement errors, poor data sampling techniques, and unexpected events.

Within well log measurements and petrophysics…


Part 2 in a series going from Exploratory Data Analysis to Machine Learning with Well Log Data

Photo by Markus Spiske on Unsplash

Machine learning and Artificial Intelligence are becoming popular within the geoscience and petrophysics domains. Especially over the past decade. Machine learning is a subdivision of Artificial Intelligence and is the process by which computers can learn and make predictions from data without being explicitly programmed to do so. …


An example of exploring petrophysical and well log measurements using a number of plots from Seaborn and Matplotlib

Photo by Markus Spiske on Unsplash

Machine learning and Artificial Intelligence are becoming popular within the geoscience and petrophysics domains. Especially over the past decade. Machine learning is a subdivision of Artificial Intelligence and is the process by which computers can learn and make predictions from data without being explicitly programmed to do so. …


A short guide on multiple options for renaming columns in a pandas dataframe

Photo by Giulio Gabrieli on Unsplash

Ensuring that dataframe columns are appropriately named is essential to understand what data is contained within, especially when we pass our data on to others. In this short article, we will cover a number of ways to rename columns within a pandas dataframe.

But first, what is Pandas? Pandas is…


Understand your data distribution and identify outliers in petrophysics and well log data using boxplots

Multiple boxplots with different y-axis ranges generated using matplotlib in python. Image by author.

Boxplots are a great tool for data visualisation, they can be used to understand the distribution of your data, whether it is skewed or not, and whether any outliers are present. …


Visualising well log data versus depth using the matplotlib library from Python

Well log plot created using the matplotlib Python library. Image by author.

Introduction

Well log plots are a common visualization tool within geoscience and petrophysics. They allow easy visualization of data (for example, Gamma Ray, Neutron Porosity, Bulk Density, etc) that have been acquired along the length (depth) of a wellbore. …


Use scatter plots to visualise the relationship between variables

Neutron density scatter plot / crossplot created with matplotlib in python. Image by the author.

Introduction

Scatter plots are a commonly used data visualisation tool. They allow us to identify and determine if there is a relationship (correlation) between two variables and the strength of that relationship.

Within petrophysics scatter plots, are commonly known as crossplots. …


Visualising the distribution of data with histograms

Photo by Marcin Jozwiak on Unsplash

Introduction

Histograms are a commonly used tool within exploratory data analysis and data science. They are an excellent data visualisation tool and appear similar to bar charts. However, histograms allow us to gain insights about the distribution of the values within a set of data and allow us to display a…


Examples of machine learning to enhance your petrophysical workflow

Photo by Dan Meyers on Unsplash

Several decades of hydrocarbon exploration have led to the acquisition and storage of large quantities of well related measurements, which have been used to characterise the subsurface geology and its hydrocarbon potential. The potential of these large volumes of data has been increasingly exploited over the last couple of decades…


A Python library dedicated to loading and exploring well log LAS files

Photo by ali elliott on Unsplash

The welly library was developed by Agile Geoscience to help with loading, processing, and analysing well log data from a single well or multiple wells. The library allows exploration of the metadata found within the headers of las files and also contains a plotting function to display a typical well…

Andy McDonald

Petrophysicist, Geoscientist and data Scientist with a passion for data analytics, machine learning, and artificial intelligence.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store