Introduction to Scipy: Interpolation and Integration

In this lecture, we will look at two other common sub-packages of Scipy: scipy.interpolate and scipy.integrate.

Interpolation

The simplest interpolation routine in scipy.interpolate is interp1d:

In [ ]:
from scipy.interpolate import interp1d

If we create a fake dataset:

In [ ]:
import numpy as np
x = np.array([0., 1., 3., 4.])
y = np.array([0., 4., 3., 2.])

we can interpolate linearly by first creating an interpolating function:

In [ ]:
f = interp1d(x, y)

and we can then interpolate to any value of x within the original bounds:

In [ ]:
f(0.5)
In [ ]:
f(3.3)

It is also possible to interpolate to several values at the same time:

In [ ]:
f(np.array([0.5, 1.5, 2.5, 3.5]))

If the interpolating function is called outside the original range, an error is raised:

In [ ]:
f(-1.)

You can change this behavior by telling interp1d to not give an error in this case, but to use a set value:

In [ ]:
f = interp1d(x, y, bounds_error=False, fill_value=-10.)
In [ ]:
f(-1.0)
In [ ]:
f(np.array([-1., 1., 3., 6.]))

By default, interp1d uses linear interpolation, but it is also possible to use e.g. cubic interpolation:

In [ ]:
f = interp1d(x, y, kind='cubic')
f(0.5)

For more information, see the documentation for interp1d. There are also other interpolation functions available (for example for spline interpolation), which you can read up about at scipy.interpolate.

Integration

The available integration functions are listed at scipy.integrate. You will notice there are two kinds of functions - those that integrate actual Python functions, and those that integrate numerical functions defined by Numpy arrays.

First, we can take a look at one of the functions that can integrate actual Python functions. If we define a function:

In [ ]:
def simple_function(x):
    return 3. * x**2 + 2. * x + 1.

we can integrate it between limits using:

In [ ]:
from scipy.integrate import quad
print(quad(simple_function, 1., 2.))

As described in the documentation for quad, the first value returned is the integral, and the second is the error on the integral. If we had solved the integral analytically, we would expect 11, so the result is correct.

We can also define functions as Numpy arrays:

In [ ]:
x = np.linspace(1., 2., 1000)
y = 3. * x**2 + 2. * x + 1.

And in this case we can use for example the simps function to integrate using Simpson's rule:

In [ ]:
from scipy.integrate import simps
print(simps(y, x=x))

This can be very useful in cases where one wants to integral actual data that cannot be represented as a simple function or when the function is only available numerically.

Note that there is an issue on the scipy.integrate page - there should also be a menton of the trapz function which works similarly to simps but does trapezium integration:

In [ ]:
from scipy.integrate import trapz
print(trapz(y, x=x))

Exercise

Write a function that takes x, and the parameters for a Gaussian (amplitude, displacement, width) and returns the value of the Gaussian at x:

In [ ]:
# your solution here

Use quad to compute the integral and compare to what you would expect.

In [ ]:
# your solution here

Now create two arrays x and y that contain the Gaussian for fixed values x, and try and compute the integral using simps.

In [ ]:
# your solution here

Compare this to what you found with quad and analytically.