Oil and Gas
Oil and Gas | Data Analytics
Introductory Machine Learning with Applications for Petroleum Engineers and Geoscientists
Business impact: This course enables participants to develop an understanding of data-driven workflows from data processing and data QA/QC to the application of various algorithms to gain insights from data that are not readily available with physics-based models. The course will provide participants with: (1) An overview of machine learning terminology and workflows (2) A discussion of use-cases covering a broad spectrum of disciplines in the oil and gas industry (3) Practical hands-on workflows in Python for several use-cases using a diverse set of machine learning algorithms.
Schedule
Duration and Training Method
A classroom or virtual classroom course. Each session begins with a lecture to provide the introduction, mathematical foundations, and theory, followed by examples and Python-based practical exercises.
Software and Datasets: Jupyter notebooks written in Python will be provided to participants with real field datasets. Participants will execute specific tasks in Python to aid in their learning experience.
Course Overview
Learning Outcomes
Participants will learn to:
- Discuss the subsurface applications of machine learning.
- Perform data clean-up, outlier detection and handling, and visualization.
- Apply a structured approach to unsupervised clustering and cluster evaluation.
- Perform supervised methods for classification and regression.
- Critically evaluate the various machine learning algorithms.
Course Content
Introduction
- Terminology related to data-driven modelling
- Key concepts related to supervised, unsupervised, and reinforcement learning
- A discussion of use cases relevant to the engineering and geoscience disciplines
Data Processing
- Basic probability, univariate, and multivariate statistics
- Exploratory data analysis, visualization and plotting multivariate data
- Data imputation for numerical and categorical data
- Outlier handling
Machine Learning Workflows
- Transformations: Scaling, dimensionality reduction
- Unsupervised learning: K-means and K-Prototype clustering, variants of Hierarchical Clustering, DBSCAN, Gaussian Mixture Models
- Supervised learning:
- Model Evaluation, Cross Validation, and Parameter Selection
- Classification: Decision Trees, Random Forests, Gradient Boosting, K-Nearest Neighbours, Support Vector Machines, Neural Networks
- Regression: K-Nearest Neighbours, Neural Networks, Decision Trees and Random Forests, Multilinear Regression
- Feature selection, ranking, and elimination
- Use cases in the E&P industry
Who Should Attend and Prerequisites
The course is designed for geoscientists and petroleum engineers with little to no prior experience in machine learning, seeking to incorporate machine learning tools in their E&P workflows. It is expected that participants have a working knowledge of programming and Python. A Python self-assessment rubric is provided in advance of the course; If participants need a Python refresher, a series of pre-course self-paced videos and exercises will prepare participants with the necessary background for the machine learning course.
Instructors
Deepak Devegowda
Background
Deepak Devegowda is Associate Professor and Mewbourne Chair of Petroleum Engineering at Mewbourne School of Petroleum and Geological Engineering at the University of Oklahoma, where he specialises in Data Analytics and Machine Learning.
Deepak began his career as Technical Project Leader of Wireline Services for Halliburton in Egypt and India, before completing an MSc and then PhD at Texas A&M University. In 2008, he took the position of Assistant Professor at the School of Petroleum and Geological Engineering at the University of Oklahoma, before undertaking his current role in 2014.
Deepak's acted as Chairperson for the Reservoir Monitoring Technical Committee at SPE Annual Technical Conference and Exhibition in 2019. He is also a technical committee member of the Reservoir Monitoring Sub-Committee for SPE ATCE.
Affiliations and Accreditation
PhD Texas A&M University - Petroleum Engineering
MSc Texas A&M University - Petroleum Engineering
SPE, SEG, SPWLA
Courses Taught
N595: Introductory Machine Learning with Applications for Petroleum Engineers and Geoscientists
N596: Advanced Machine Learning for Subsurface Applications