Python for Oil and Gas
Divyanshu vyas | dvyas13ad@gmail.com
PDPU | IIT(ISM) Dhanbad,India
Experience:
RezLytix (Intern-2020) - Reservoir Simulation | ML/Deep-Learning | Python
Founder - Petroleum From Scratch
List Of Contents in this Notebook -
1. Basics of Core Python with Petroleum Examples.
2. Data Structures with Focus on Lists and Dictionaries.
3. Data Analysis : NumPy and Pandas.
4. Data Visualization : RelPerm, Fractional Flow and Nodal Analysis examples.
5. Python based Reservoir Simulation.
6. Overview on Machine Learning in Oil and gas.
7. Learning Resources.
Useful Links-
1. Contact Me at : www.linkedin.com/in/divyanshu-vyas
2. For More of my projects : https://github.com/Divyanshu-ISM
Oil and Gas Projects: https://github.com/Divyanshu-ISM/Oil-and-Gas-data-
analysis
3. Petroleum From Scratch: https://www.linkedin.com/company/petroleum-from-
scratch/?viewAsMember=true
Welcome Everyone!
In the coming 3 Sessions, We'll be learning Reservoir Simulation a lot of interesting things in this Mini-
Course.
My Aim is to take you to a better, con dent and super comfortable place as far as Python for Oil and Gas
is concerned.
So Gear UP!
P.S - I wish I could give a little TED-Talk but We have a lot to Learn.
I will Assume you know nothing about anything I'm going to talk about!
Here goes.
Just a note on why Python?
Available.
Understandable. Accessable. multi-Utility.
Interface : Google Colab
Direct link : https://colab.research.google.com/drive/1nBMrS3SBARNQQR9s0iif3E1rnqND7FRq
PYTHON from below zero!
List of contents -
First We'll Talk about Python's Data Structures-
1. Single Values --> Variables.
2. Multiple Value containers ---> Lists.
-> Other Containers - a) Tuples b) Sets
-> tuples: https://www.w3schools.com/python/python_tuples.asp
3. Named/Labelled Containers ---> Dictionaries.
# Single Values are Stored in Variables.
# Variable names cannot start with a number or special character.
# Underscore is allowed.
#Ex - 007bond is invalid
#Ex - bond007 is valid
#Ex- bond_007 and _007_bond valid.
# 007bond = 5
bond007 = 5
#This is called initialization.
#The value from RHS gets stored into the variable at LHS.
x = 5
y = 3
name = 'dvyas'
Now let's Look at Data Manipulations.
print('value = {x + y}')
value = {x + y}
#Let's Also get used to the print function:
print('\nHello Guys!')
#Escape Sequence \n helps change line.
print('\nHello \nGirls!\n')
#displaying variable's value and string using f'...{}'
print(f'0. This is x: {x} and this is y: {y}')
#1. Addition, subtraction : x + y , x - y
print(f'1. Addition Result: {x+y}')
#Multiplication : x*y
print(f'2. Multiplication Result: {x*y}')
#Power x^y : x**y
print(f'3. Square of x is: {x**2}')
print(f'3. x raised to yth power is: {x**y}')
#Mixed operations - Forget BODMAS. Put clusters in brackets.
z = (x+y)*(x-y)/(x**2 - 2*x + y)
print(f'4. Z is = {z}')
Hello Guys!
Hello
Girls!
0. This is x: 5 and this is y: 3
1. Addition Result: 8
2. Multiplication Result: 15
3. Square of x is: 25
3. x raised to yth power is: 125
4. Z is = 0.8888888888888888
p = 800
print('The value of p is {p} or 800')
print(f'The value of p is {p}')
The value of p is {p} or 800
The value of p is 800
# x*y
# x+y
# x**2
# 5%2
#Example. Calculating hydrostaticc pressure.
rho = 15 #ppg
D = 5000 #ft
P = 0.052*rho*D
print(f'P = {P} psi')
P = 3899.9999999999995 psi
BOOLEANS and Comparisons.
This section deals with checking for equality or inequality.
print(x == y)
# == , x>y , x>=y , x<y
print(5 == 5)
print(x>y)
print(y>x)
False
True
True
False
num = int('5')
type(num)
int
if-elif-else
if 5>3:
print('five is greater than 3')
elif 5==5:
print('five is equal to five')
else:
print('this will only get executed if above 2 fail')
five is greater than 3
Let's Now talk about data structures one by one.
1. LISTS - [.... ] | Change-able
List Methods Reference - https://www.w3schools.com/python/python_ref_list.asp
l1 [ porosity , 0.25, perm , 50]
#[ ..... ]
# print(l1)
# l1[-1]
# l1[3]
#Operations (Methods) on list.
#1. Append - Adding element to the tail.
l1.append('new_rock')
print(l1)
['porosity', 0.25, 'perm', 50, 'new_rock']
#2. listname.count(a) - counts no. of occurences of a in listname.
rocks = ['sst' , 'lst' , 'sst' , 'lst']
rocks.count('sst')
# listname.reverse() - Reverses the list.
porosities = [0.50,0.20,0.15,0.30,0.05,0.10]
porosities.reverse()
porosities
[0.1, 0.05, 0.3, 0.15, 0.2, 0.5]
# listname.sort() - Sorts the list.
perms = [50,100,150,200,20,70,500]
perms.sort()
perms
[20, 50, 70, 100, 150, 200, 500]
List Indexing and Slicing -
#Indexing - Accessing the values stored in a list based on index #.
#Indexing starts from 0 (and not 1. In MATLAB it starts with 1)
poro = [0.10,0.20,0.30,0.40,0.50]
print(poro[0])
print(poro[1])
print(poro[-1])
print(poro[-2])
0.1
0.2
0.5
0.4
#Slicing. Just like the name says.
#It helps cut a slice (sub-part) off a List.
superset = [1,2,3,4,5,6,7,8,9]
#Slicing syntax - listname[start:stop:step] start is included. stop is not
subset = superset[0:5]
print(subset)
subset2 = superset[:5]
print(subset2) #Skipping a part also works for first and end indices.
subset3 = superset[:-1] #starting to ending -1
print(subset3)
subset4 = superset[:] #everything
print(subset4)
[1, 2, 3, 4, 5]
[1, 2, 3, 4, 5]
[1, 2, 3, 4, 5, 6, 7, 8]
[1, 2, 3, 4, 5, 6, 7, 8, 9]
Question. How do you reverse a list without using a .reverse() method?
#Ans-
reverse_set = superset[-1::-1]
#Start at -1 th index, end at 0th, take negative steps.
print(reverse_set)
[9, 8, 7, 6, 5, 4, 3, 2, 1]
The same operations can be applied to strings.
name = 'reservoir_simulation'
print(name[0])
print(name[-1])
print(name[0:5])
print(name[::2])
print(name[-1::-1])
r
n
reser
rsrorsmlto
noitalumis_riovreser
DICTIONARIES
Absolute gem of a DataStructure.
1. Helps Store Data with labels.
2. Syntax => { key : value }
3. Has no order.
4. Can be directly converted to DataFrames (tables).
rock_properties = {'poro' : 0.25, 'perm' : 150 , 'lith' : 'sst'}
print(rock_properties)
rock_properties['poro']
{'poro': 0.25, 'perm': 150, 'lith': 'sst'}
0.25
#Just an example of what do i mean when i say they
#can be converted to tables.
import pandas as pd
rock_data = {'phi' : [0.10,0.20,0.30,0.50], 'perm' : [50,100,150,200]}
data_table = pd.DataFrame(rock_data)
data_table
#We'll cover More of Pandas later.
ITERATIONS
The tool with which we can utilize the power of computers.
Loops/Iterations are very important for Reservoir Simulation as well.
1. For Loops
2. While Loops
Iterations are always performed on Iterables (ordered-collections).
Examples of iterables - lists, strings etc.
#We can perform repetition of a command a 1000 times in 1 second.
my_numbers = [1,2,3,4,5,6,7,8,9]
for num in my_numbers:
print(num)
#example 2:
for i in range(len(my_numbers)):
print('*')
Practical Applications of a loop:
To iterate over all values of a list to update it.
While Loop:
# while condition1:
# runs if condition 1 satisfied
#update statement
t = 1
while t<10:
print(f'Current value of t = {t}')
t = t+1
#Note that if you don't give the
#update statement, You can run an infinite Loop.
Current value of t = 1
Current value of t = 2
Current value of t = 3
Current value of t = 4
Current value of t = 5
Current value of t = 6
Current value of t = 7
Current value of t = 8
Current value of t = 9
Hands on Practice with Functions
#Functions are very important.
#1. Make the code clean.
#2. Helps Prevent repetitive commands.
kh
q = delP
141.2∗μ∗B∗ln(re/rw)
#So here the term except delP is important.
#And we can use it again and again. So we can create a function of it.
#function definition.
def slope():
k = int(input('Enter K: '))
h = int(input('Enter h: '))
mu = int(input('Enter viscocity: '))
B = float(input('Enter FVF: '))
lnrerw = 0.5
m = k*h/(141.2*mu*B*lnrerw)
return m
#function call to get the value function returns.
#Note that this example also teaches how to take user input.
delP = 500 #psia
m = slope()
q = m*delP
print(f'The flowrate at {delP} psi drawdown is {q} STB/D')
###########################################################
Mini-Project 1: Finding the square root of a number using python.
Background - Numerical Methods - Newton Raphson's
Formula:
if we have a function f(x) = 0 and we want to nd the root,
f (xn )
x(n + 1) = xn − ′
f (xn )
#Solution:
# x = root(N)
# x^2 - N = 0
# f(x) --> x2 - N
n = float(input('Enter the Number to be square rooted: '))
x = int(input('Enter your initial Guess of the root: '))
while abs(x**2 - n) >= 0.0000001:
x = x - (x**2 - n)/(2*x)
print(f'The final root is: {abs(x)}')
Similarly we can nd the roots of any transcendental equation f(x)
= 0 using this method.
Home Task - Try to nd the Klinkenberg (corrected) Permeability using this
method.
Refer My Github. This is one of my most favourite mini projects.
Link for Problem Description and Solution : https://github.com/Divyanshu-ISM/Oil-and-Gas-data-
analysis/blob/master/KlinkenbergEffect.ipynb
###########################################################
Welcome to Day 2. Here's the solution to the Klinkenberg Home Task
6.9kL^(0.64) + pmkL - pm*kg = 0 ...this is the equation.
k(i+1) = k(i) - f(ki)/f'(ki)
where, f'(ki) = 4.416 ki^(-0.36) + pm
The iterative procedure is repeated until convergence is achieved when f(ki) approaches zero or when no
changes in the calculated values of ki are observed.
Problem Statement- The permeability of a core plug is measured by air. Only one measurement is made at
a mean pressure of 2.152 psi. The air permeability is 46.6 md. Estimate the absolute permeability of the
core sample. Compare the result with the actual absolute permeability of 23.66 md.
k = input("\nEnter your first guess of absolute permeability kl: ")
k = float(k)
#f(k= kl) = 6.9kL^(0.64) + pmkL - pm*kg = 0
# f'(k) = 4.416 ki^(-0.36) + pm
pm = input("\nEnter Mean Pr. Pm: ")
pm = float(pm)
kg = input("\nEnter K(air) = kg: ")
kg = float(kg)
while abs(6.9*(k**0.64) + pm*k - pm*kg) > 0.01:
k = k - (6.9*(k**0.64) + pm*k - pm*kg)/(4.416*(k**(-0.36)) + pm)
print(f"The final value of Perm K is : {k}")
################################################################
PART 2 Of the Session
Data Analysis with NumPy and Pandas
A. NumPy
Python ----------> A Scienti c Calculator
NumPy stands for Numerical Python. It brings a lot of Casio-991Ms utilities to Python.
It is built on top of C++ and C++ is quick as ash. So, NumPy operations help in computational
e ciency.
NumPy is famous for its object - NumPy arrays. Which helps store collection (just like list) of
numbers in form of an object that can be treated and manipulated just like a number.
Why NumPy? Take a look at this disadvantage of lists.
num1 = [1,2,3,4,5]
num2 = [5,6,7,8,9]
# print(num1*num2)
print(num1 + num2)
#So, a dot product is not possible unless you use for loops.
#And, addition term by term doesn't happen this easy. It rather concatenates.
#So we need for loops for manipulating data with lists. and that's bad.
sum = []
for i in range(len(num1)):
s = num1[i] + num2[i]
sum.append(s)
print(sum)
# #Let's use NumPy now.
We can directly convert a list to a NumPy array.
#Step 1. Import numpy with its alias name 'np'
import numpy as np
#Lets first Directly convert the list to NumPy arrays -> use np.array(listname)
arr1 = np.array(num1)
arr2 = np.array(num2)
#And Let's now solve the List Disadvantages using NumPy
print(arr1)
print(arr2)
print(arr1*arr2)
print(arr1 + arr2)
#SO, This is how easy it gets with NumPy arrays.
#Let's now check more features out.
1. np.linspace( start, stop, no.ofvalues)
saturations = np.linspace(0,1,100)
#Both start and stop values are included as the first and last values of the array.
#Creates saturation array with 100 values: starting from 0-100
# print(saturations)
#Applications? krw = sw^3 kro = (1-sw)^3
# saturations**2
sw = np.linspace(0,1,50)
krw = sw**3
kro = (1-sw)**3
#BRilliant. It worked.
#Further when we will plot these Rel-Perm arrays, you'll get more confident.
2. np.arange(start,stop,interval) --> [start,stop)
#Will be useful for Reservoir simulation. Space-Time Arrays.
#Suppose you want to build an array for days. from t = 0th day to t= 500th day.
#Time step = 20 days.
t = np.arange(0,520,20)
3. NumPy Universal Functions (uFuncs)
Reference link - https://www.w3schools.com/python/numpy_ufunc.asp
This is where we get the advantages of a scienti c calculator in python.
Also do check out the math library.
a = np.array([1,2,3,4,5])
b = np.array([4,5,6,7,8])
#1.
np.add(a,b)
#2. Power
arr1 = np.array([10, 20, 30, 40, 50, 60])
arr2 = np.array([3, 5, 6, 8, 2, 33])
newarr = np.power(arr1, arr2)
print(newarr)
#3. Logarithms - base e,10,custom
#3.A)-Log base 2
arr = np.arange(1, 10)
print(arr)
print(np.log2(arr))
print(np.exp(arr))
# Use the log10() function to perform log at the base 10.
# Use the log() function to perform log at the base e.
#4. Trigonometry.
print(np.sin(np.pi/2))
print(np.arcsin(1.0)) #which must be pi/2
print(np.pi/2)
####################################################################
CONCLUSION-
Use Lists when your aim is to just store.
Use NumPy arrays when your aim is Quick Computations.
Double-click (or enter) to edit
Pandas - The Ms.Excel of Python
This library helps us import | create | work with data in the form of tables.
The tables are called DataFrames.
1. We can directly convert a Dictionary into a DataFrame.
2. We can import excel-sheets or CSV les (most popular) into DF.
3. We can manipulate and use these tables in a very user-friendly way.
Case 1: Dictionary to Pandas DataFrame.
#Step 1: Import Pandas with an alias 'pd'
import pandas as pd
#Step 2: Create your dictionary
cambay_rocks = {'phi': [0.2,0.40,0.30,0.25,0.270],
'perm': [100,20,150,130,145],
'lith': ['sst','shale','sst','sst','sst']}
#Step 3: Create your Table.
rock_table = pd.DataFrame(cambay_rocks)
#Step 4: Print your table.
# rock_table
# rock_table.tail(n=3)
Case 2: Importing Data From a CSV (Comma Seperated Values)
File.
A csv le can be imagined as a table with the lines being replaced by commas.
df_csv_to_dataframe = pd.read_csv('/content/sample_data/california_housing_train.csv')
#Similarly excel file can be read by-
#df = pd.read_excel('\path\filename.csv')
# df_csv_to_dataframe.head()
Description
Note that the Pandas DF has two parts-
1. The vertical Columns.
2. The Horizontal Rows.
rock_table
The Columns individually are labelled arrays.
And these 'arrays-with-labels', are called a 'Series'.
Series, vertically stacked next to each other forms a DF.
#Let's Manipulate data using Pandas.
df.head()
df_csv_to_dataframe.shape
df.info()
# df.describe()
A. Accessing Columns
#First lets list down all the columns
df_csv_to_dataframe.columns
rock_table.columns
#For example you just want to access phi.
rock_table['phi']
rock_table[['phi']]
rock_table[['phi','perm']]
rock_table
Accessing Rows
df.iloc[::] ..i loc means index wise locating rows.
df.loc[r1:r2 , c1:c2] .. loc (starts with l) means label wise. PREFERABLE.
rock_table.loc[0:2,'phi':'perm']
import pandas as pd
df_temp = pd.read_csv('/content/sample_data/california_housing_train.csv')
df_temp.head(n=3)
#Question 1: Try to Access 5-->10 row. and All columns.
df_temp.loc[5:10,:] #Note that Both 5 and 10 are included.
#Question 2: Try to access 20th to 30th row and first five columns.
df_temp.loc[20:30,:5] #This will throw an error since you cannot use column indices with loc.
df_temp.iloc[20:30,:5] #The same slicing as above works if we use iloc instead of loc.
#The same thing can be done by using loc but with column names.
df_temp.loc[20:30,'longitude':'total_bedrooms'] #The only difference is that loc includes it all. Bo
Reference for more on data manipulations-
https://www.geeksforgeeks.org/pandas-tutorial/
My Practice Repository-
https://github.com/Divyanshu-ISM/Oil-and-Gas-data-analysis/blob/master/GL_pandas_1.ipynb
##########################################################################
Visualization
Library - matplotlib.pyplot (alias = plt)
1. Line-Plots. :- plt.plot(x,y)
2. Scatter-Plots. :- plt.scatter(x,y)
Example 1:- Let's plot a parabola y = x^2.
#Step 1: Import the library(s).
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline
#Step 2: create numpy arrays for x and y.
x = np.linspace(-10,10)
y = x**2
#Step 3: Voila! Plot now.
plt.plot(x,y)
Customizations to the above plot.
Plotting steps in short :- F S P L L G
1. F- Figure ( gsize=(h,v))
2. S- Style (use default)
3. P- Plot (plt.plot(x,y,label=)
4. L- Labels for axes. (xlabel,ylabel..) Also, Titles.
5. L- Legend. (Optional)
6. G - Grid (Optional)
#0. Set the figure size.
plt.style.use('default')
plt.figure(figsize=(6,4)) #6X6 canvas.
# plt.style.use('default')
# 1. generate the plot. Add a label.
plt.plot(x,y,label='It is a parabola')
plt.plot(x,x,label='Y=X')
plt.plot(x,x**3,label='Y=X^3')
#2. Set x axis label
plt.xlabel('This is X-Axis')
#3. Set y axis label.
plt.ylabel('This is Y-Axis')
#4. Set the title.
plt.title('TITLE here.')
#5. set the grid.
plt.grid()
#6. display the label in a legend.
plt.legend(loc='best')
########################################################################
Oil and Gas Examples.
Example 1. draw relative permeability curves.
sw = np.linspace(0.001,1)
krw = sw**3
kro = (1-sw)**3
plt.plot(sw,krw)
plt.plot(sw,kro)
plt.title('This is Rel-Perm Curve')
plt.grid()
Example 2. (Home Task? Yes or No) Plot Fractional ow curve.
fw = 1/(1 + 1/M) M = (krw/mu_w)/(kro/mu_o)
Assume mu_o = 1000 cp (heavy oil) mu_w = 1 cp
mu_o = 1000
mu_w = 1
M1 = (krw/mu_w)/(kro/mu_o)
fw = 1/(1 + (1/M1))
plt.plot(sw,fw,label = 'Thin water-Thick oil')
M2 = (krw/100)/(kro/mu_o)
fw2 = 1/(1+(1/M2))
plt.plot(sw,fw2,label='Thick water-Thick oil')
plt.legend(loc='best')
##########################################
Example 2- Nodal Analysis.
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
%matplotlib inline
#STEP 1:
#First Let's Get the IPR dataset. (Source : PetroWiki)
ipr = pd.DataFrame({'Pwf':[4000,3500,3000,2500,2000,1500,1000,500,14.7],
'Q': [0,1999,3094,3902,4512,4963,5275,5458,5519]})
ipr
#STEP 2
#Now TPR Data
q = np.arange(1000,6500,500)
p_190 = [1334,1400,1487,1592,1712,1843,1984,2132,2287,2446,2609]
p_2375 = [1298,1320,1351,1390,1435,1487,1545,1609,1677,1749,1824]
p_2875 = [1286,1294,1305,1319,1336,1356,1378,1403,1431,1461,1493]
tpr = pd.DataFrame({'q':q, 'Pwf(1.90" tbg)':p_190, 'Pwf(2.375"tbg)': p_2375,'Pwf(2.875"tbg)':p_2875}
tpr.head(3)
Plotting steps in short :- F S P L L G
1. F- Figure ( gsize=(h,v))
2. S- Style (use default)
3. P- Plot (plt.plot(x,y,label=)
4. L- Labels for axes. (xlabel,ylabel..) Also, Titles.
5. L- Legend. (Optional)
6. G - Grid (Optional)
l = [1,2,3]
a = np.array(l)
d = {'xyz':a}
d1 = {'vv': l}
print(d)
print(d1)
#STEP 3: PLOT
#Let's Plot them now.
plt.figure(figsize=(8,4))
plt.style.use('default')
plt.plot(ipr['Q'], ipr['Pwf'], label = 'IPR', linewidth=5)
plt.plot(tpr['q'],tpr['Pwf(1.90" tbg)'],label='TPR for 1.9" tubing', linewidth=3)
plt.plot(tpr['q'],tpr['Pwf(2.375"tbg)'],label='TPR for 2.375" tubing', linewidth=3)
plt.plot(tpr['q'],tpr['Pwf(2.875"tbg)'],label='TPR for 2.875" tubing', linewidth=3)
plt.xlabel('FlowRate(Mscf/D)')
plt.ylabel('Flowing BHP(Psia)')
plt.title('Nodal Analysis for a Gas Well')
plt.grid()
plt.legend(loc='best')
###########################################################################################
###########################################################################################
PART 3: Reservoir Simulation Foundations (From Scratch).
Keywords: Heterogeneity | PDEs | Discretization | Finite Differences
Why Reservoir Simulation?
Okay. Imagine you are in a world which has no World-Map. Now, you want to travel somewhere for
vacations. To some remote country.
1. You would travel to antartica without having a clue it is freaky cold.
2. You'll not pack any cold wear because you had no info/input data.
How-ever Now in a Parallel Universe, Imagine you do have a Map. And the Input data like Weather Report
for locations.
1. You would probably travel to somewhere more pleasant.
2. If you expect winter, you'd also pack sweaters.
BRILLIANT! Now you understand the intuition behind why do we need a
Reservoir Simulation/Model (the world map).
1. It helps us decide between a set of development options.
2. You can pick a metric, say Recovery E ciency. Then Run several realizations and calculate this RF.
3. The one which gives the most/highest RF. Is. The. Choice you go ahead with!
SIMPLE Reservoir Models you already know about.
1. Volumetric Model - Assumes a cement/sand box kind of reservoir with constant properties.
Ahϕ(1−Sw )
N =
Bo
2. Material Balance (Tank Model) - Zero Dimensional.
Note.
In M-BAL, The mass balance involves no foreign addition or ejection of mass.
In Res. Simulation, the mass balance can have a well term as well.
What's Lacking? - Reality! Practicality! Flow Model! Heterogeneity!
In comes the need for Reservoir Simulation.
1. You divide the Entire Reservoir into discrete Grid-Blocks.
Sand Box (Cube) --> Rubix cube
2. Each Grid block can have different properties unique to them-selves.
3. Hence a discontinuous-homogenous model is what we can call it. In other words, we have captured
Heterogeneity.
4. Finer the grid/ More the number of grid blocks, closer to reality we get.
Mathematical Base-
The Entire Game is based on the Heat-Equation. Which is nothing but a PDE.
Intuition Behind a PDE
Take an IRON-ROD. (1D reservoir). Now Heat it from one end using a candle.
Two things happen.
The Heat travels from Candle-End to the Other End. (Space
dependence). The Rod gets hotter at every point with time.
(Time Dependence).
And this exact Space-Time Dependence is given by a Heat-PDE.
1. PDE- (Heat case)
2
∂ T 1 ∂T
2
=
∂x η ∂t
2. PDE- (Pressure case)
2
∂ P 1 ∂P
2
=
∂x η ∂t
####################################################
#Next up
#1. Space Time Discretization of the above PDE.
#2. Explain how you can code it.
#3. Convert the whole scene into Reservoir Scenes and talk about that.
Space Discretization: (LHS)
2
∂ P P [i+1]−2P [i]+P [i−1]
2
= 2
∂x △x
Time Discretization: (RHS)
T t
1 ∂P P −P
=
η ∂t η△t
Where, T is the next time step (t+1) and t is the current.
Equating both we get the heart of the Reservoir Simulation (Flow
Equation)
You can see that it is an algaebric equation.
WorkFlow (Ertekin et al.)
Example 1. Understanding Simulation by a Heat Flow example.
Problem statement : T(w) = 40 C | T(e) = 20 C : Left to right heat ow with L lengthed rod and
n nodes
Solution: https://github.com/Divyanshu-ISM/Oil-and-Gas-data-
analysis/blob/master/ResSimultn_HeatDiff_1D.ipynb
Example 2. Pressure diffusion- A 1D Reservoir Simulation.
import numpy as np
import math
import matplotlib.pyplot as plt
%matplotlib inline
import pylab as pl
from IPython import display
import time
# at x = 0 well is there q = KA/u * (-Pi+1 + P0)/dx
# P1 + qudx/kA
k_md = 50
k_m2 = k_md*0.987E-15
phi = 0.25
ct_psi = 1E-03
ct_pa = ct_psi*14.7/101325
Mu_cp=3
Mu_Pas=Mu_cp*10**(-3)
diffusivity=k_m2/phi/Mu_Pas/ct_pa
q = 5 #bbls/day
q_m3persec = q/6.29/(24*60*60)
A = 5000 #m2
alpha = q_m3persec*Mu_Pas/(k_m2*A)
L = 10 #m
n = 100
dx = L/n
x = np.arange(dx/2,10, dx)
dt = 2
t = np.arange(0,181,dt)
beta1 = diffusivity*dt/dx**2
m = int(input('Enter The order you want to increase diffusivity with: '))
beta2 = m*beta1
P1 = np.ones(len(x))
P2 = np.ones(len(x))
Pi = 205
pw = 175 #pa
P1 = np.ones(len(x))*Pi
P2 = np.ones(len(x))*Pi
ax1 = plt.figure(figsize=(12,5))
plt.grid()
plt.style.use('fivethirtyeight')
# P1 = np.zeros(len(x))
for j in range(1,len(t)):
plt.clf()
for i in range(1,n-1):
P1[i] = P1[i] + beta1*(P1[i+1] - 2*P1[i] + P1[i-1])
P2[i] = P2[i] + beta2*(P2[i+1] - 2*P2[i] + P2[i-1])
P1[0] = 175
P1[n-1] = 205
P2[0] = 175
P2[n-1] = 205
plt.plot(x,P1,label='beta1 (lower diffusivity)')
plt.plot(x,P2,label='beta2 (higher diffusivity)')
plt.xlabel('distance from the well, metres')
plt.ylabel('Pressure in the reservoir, Pa')
plt.legend(loc='best')
display.clear_output(wait=True)
display.display(pl.gcf())
time.sleep(1.0)
#############################################
Conclusions about Python Based RE Simulation-
1. Our Goal is not to 'beat' any Commercial Simulator.
2. The aim is to get our brains well equipped with the work ow.
3. With python, you'll understand more about the practical work ow rather than worrying
about calculations.
####################################################################################################
####################################################################################################
#########################################################################################
Machine Learning (For Kindergarten)
A note on Machine Learning to make the term less scary.
We learnt a bit of machinelearning theory right back in 5th grade, when we were taught how joining A(1,1)
and B(2,2) through a line (AB) on a plot would give us a 'y=x' line.
And the next amazing thing we would do was, nd y for any new x other than A and B.
This, exactly is what we try to do in various forms in various #machine #learning #algorithms.
1. We have the #training data -P(x,y): {A(1,1) , B(2,2)}
2. We draw the line AB : y=x (training/learning/ tting step). 'y=x' is our #model.
3. Hence, for a new x=3 (#test data), we can predict, y = 3. (#predictions)
Hence in a nutshell, Machine Learning is composed of 3 steps-
1. Arranging/getting the #data.
2. Identifying the #trends or #structures in the #data.
3. Using these #trends to correctly predict necessary results.
Now, depending on the type/format or structure of data (step 1) our ML-algorithms maybe of two types.
1. Supervised Learning - The training data has correct #labels(y) to the
#features(X) and hence the model trains on them. Ex- #Regression
(Curve tting).
2. Unsupervised Learning - The training data is a set of #features(X) only.
Ex- Clustering data into groups based on the scatter plot.
1. SUPERVISED-Prediction.
#Example 1:
df_example = pd.DataFrame({'phi':[0.1,0.15,0.2,0.25,0.3,0.35],
'pore radius': [5,10,15,20,25,30],
'k': [80,100,120,150,180,'?']})
df_example
#regression
phi pore radius k
0 0.10 5 80
1 0.15 10 100
2 0.20 15 120
3 0.25 20 150
4 0.30 25 180
5 0.35 30 ?
So, in the above data set, we can perform a prediction Task.
We can call K as our target variable (Label). phi and pore radius can be called features.
Algorithm : Regression. Model Type:
K = a(phi) + b(pore radius) + c
2. Supervised Classi cation.
rock_labels = pd.DataFrame({'k_md':[100,150,200,5,90,6,80,4],
'phi': [0.25,0.27,0.22,0.44,0.26,0.38,0.21,0.34],
'RT(ohm-m)': [100,110,120,10,89,15,80,20],
'lith':['sst','sst','sst','shale','sst','shale','sst','???']})
rock_labels
#KNN | K-Means
k_md phi RT(ohm-m) lith
0 100 0.25 100 sst
1 150 0.27 110 sst
2 200 0.22 120 sst
3 5 0.44 10 shale
4 90 0.26 89 sst
5 6 0.38 15 shale
6 80 0.21 80 sst
7 4 0.34 20 ???
Libraries used for Machine learning and Deep learning:
SK-Learn
Tensor ow
Pytorch
# Row 7 is called the Test Data Point. And rest all are the Training Data-Points.
#So based on the mapping the computer finds in the training, the test data predictions are made.
#And hence, we get to know the Final Prediction.
STEPS in a Machine Learning Project-
1. Import Data.
2. Check for missing values and abnormal values.
3. Accordingly process the data and make it usable.
4. Perform EDA - Exploratory Data Analysis. Check for which features are not important
and which can be excluded.
5. Record Visual stories of your data, to be presented.
6. now pick a ML algrithm suitable to your case.
7. Split the Data into Training, Validation and Test Data.
8. Fit the ML-model into the training data.
9. Perform predictions and validate, and make modi cations using the validation data.
10. Finally test the Accuracy and other performance metrics on your Test Data.
Oil and Gas Applications-
1. Image Segmentation (Thin Section Images) to classify rock minerals, and pores.
2. EOR Screening.
3. Torque Prediction using various drilling features.
4. Rock-Facies Classi cation.
5. Production Forecasting (without Decline Curve Analysis).
In conclusion, what i believe is, We can perform a ML based prediction using features that we think are
proportional to the result.
We then get the data of these features and corresponding result, and try to train a model that develops a
relationship.
In this way, we can synthesize relationships that the textbooks do not contain.
######################################END#######################################
Youtube Sources (Brilliant)-
1. SentDex
2. Krish Naik
3. Mathematics - Gilbert Strang (He is the Lord!)
4. Mathematics - Khan Academy.
5. Machine Learning Algorithms - StatQuest (Josh Starmer).
Python - corey schaffer
GOOD LUCK!