Oil and Gas

Oil and Gas | Data Analytics

Applied Statistical Modeling and Big Data Analytics

Course Code: N479
Instructors:  Srikanta Mishra
Course Outline:  Download
Format and Duration:
3 days
5 sessions

Next Event

Location: Virtual
Date:  5th - 9th May 2025
Start Time: 14:00 BST
Event Code: N479a25V
Fee From: USD $3,545 (exc. Tax)

Summary

This training course will provide a hands-on introduction to statistical modeling and big data analytics so that participants can use them for petroleum engineering and geoscience applications. Topics to be covered include: (a) easy-to-understand descriptions of the commonly-used techniques, (b) case studies demonstrating the applicability, limitations and value-added proposition for these methods, and (c) hands-on problems sessions using open source and/or commercial software. This course will provide engineers and geologists with practical techniques for identifying hidden patterns and relationships in large datasets and extracting data-driven insights towards actionable information that can contribute to lower cost, improved efficiency and/or increased productivity in oil and gas operations. This class will arm petroleum engineers and geoscientists with advanced capabilities to extract new insights from E&P data that can help: (a) learn hidden patterns and relationships in geologic datasets, (b) identify production sweet spots in developed plays; (c) determine factors responsible for separating good wells from poor producers wells, (d) build fast surrogate models of reservoir performance, and (e) assist in predictive maintenance by identifying failure inducing conditions from historical records.

Schedule

Event Code: N479a25V
Sessions: 5 sessions
Instructors: Srikanta Mishra
Dates: 5th - 9th May 2025
Start Time: 14:00 BST
Location: Virtual
Fee From
USD $3,545 (exc. Tax)
Good Availability
Please login to book.

Duration and Training Method

This is a classroom or virtual classroom consisting of lectures interspersed with worked examples, hands-on exercises and discussions.

The textbook for the course will be “Applied Statistical Modeling and Data Analytics: A Practical Guide for the Petroleum Geosciences” by Srikanta Mishra and Akhil Datta-Gupta (Elsevier, 2017), supplemented by various technical publications.

Course Overview

 Participants will learn to:

  1. Apply foundational concepts in probability and statistics for basic data analysis
  2. Perform linear regression for building simple input-output models
  3. Conduct multivariate data reduction and clustering for finding sub-groups of data that have similar attributes
  4. Converse with confidence about big data, data analytics and machine learning terminology and fundamental concepts, and critically review topical technical publications
  5. Apply machine learning techniques for regression and classification for developing data-driven input-output models
  6. Evaluate proxy modeling and uncertainty quantification studies for probabilistic performance forecasting

Part 1

  • Foundational Concepts
    • Big data technologies, basic data analytics and machine learning terminology/concepts
    • Data, statistics, and probability
    • Distributions (models, fitting distributions to data)
    • Inference (Confidence limits, bootstrap, significance tests)
    • Data visualization
    • Problem session
  • Basic Regression Analysis
    • Linear regression (univariate and multivariate regression)
    • Understanding regression statistics, ANOVA
    • Non-parametric regression
    • Problem session

Part 2

  • Multivariate Data Analysis
    • Dimension reduction (Principal component analysis)
    • Cluster analysis (K-means, hierarchical clustering, self-organizing maps)
    • Problem session
  • Machine Learning Basics
    • Overview of techniques
    • Evaluating model performance (model validation, goodness-of-fit, common pitfalls)
    • Variable importance
    • Model aggregation
    • Case studies

Part 3

  • Machine Learning for Regression and Classification I
    • Classification/regression trees
    • Random forest
    • Gradient boosting machine
    • Problem session

Part 4

  • Machine Learning for Regression and Classification II
    • Support vector machine
    • Neural networks and deep learning
    • Problem session

Part 5

  • Miscellaneous topics and Wrap-up
    • Experimental design and response surface analysis
    • Uncertainty quantification
    • Selected literature review
    • Key takeaways and resources
    • Data analytics do’s and don’t’s

This course is for designed for petroleum engineers, geoscientists, and managers interested in becoming smart users of statistical modeling and data analytics.

Srikanta Mishra

Background
Dr. Srikanta Mishra is currently a Research Professor at Texas A&M University, where he researches Energy Transition and Subsurface Data Analytics. He also serves as a Technical Advisor to the US-DOE NETL’s SMART Initiative (Science Informed Machine Learning to Accelerate Real-time Decisions for Subsurface Applications) and instructs courses on CO2 Geological Storage and Machine Learning.

Dr. Mishra retired in 2023 as the Technical Director for Geo-energy Modeling & Analytics at Battelle Memorial Institute. At Battelle, he led a technology portfolio related to computational modeling and data analytics for geological carbon storage, improved oil recovery projects, and shale gas/oil development. His recent work focused on full-physics and reduced-order modeling, and pressure-based monitoring, of CO2 geologic sequestration projects. He served as PI or co-PI on several CO2 geological storage and EOR projects funded by the US Department of Energy.

Dr. Mishra has presented lectures and conducted short courses on CO2 geologic sequestration in many US universities and in academic and research organizations worldwide. He is an editor of the book “CO2 Injection in the Network of Carbonate Fractures” and the author of ~200 technical publications.

Dr. Mishra is a member of the Technical Advisory Board of the SMART initiative organized by the US Department of Energy’s Carbon Storage Program. He was an SPE Distinguished Lecturer for 2018-19 on Big Data Analytics. He has also served as an Adjunct Professor of Petroleum and Geosystems Engineering at The University of Texas at Austin.

Affiliations and Accreditation
PhD Stanford University - Petroleum Engineering
MS Stanford University - Petroleum Engineering
BTech  Indian School of Mines - Petroleum Engineering

Courses Taught
N479: Applied Statistical Modeling and Big Data Analytics
N480: Introduction to Statistical Modeling and Big Data Analytics
N535: Carbon Capture Sequestration (CSS)
N567: Carbon Capture, Utilization and Storage

CEU: 2.1 Continuing Education Units
PDH: 21 Professional Development Hours
Certificate: Certificate Issued Upon Completion
RPS is accredited by the International Association for Continuing Education and Training (IACET) and is authorized to issue the IACET CEU. We comply with the ANSI/IACET Standard, which is recognised internationally as a standard of excellence in instructional practices.
We issue a Certificate of Attendance which verifies the number of training hours attended. Our courses are generally accepted by most professional licensing boards/associations towards continuing education credits. Please check with your licensing board to determine if the courses and certificate of attendance meet their specific criteria.