5 min read

On Poisson process

This is my study note for Math 605, UW-Madison. See more details at here.

Definitions

Almost every entry level statistical course introduce the Poisson distribution formula (we call this as the first formulation): f(k;μ)=Pr(X=μ)=μkeμk! where X is a discrete random variable, μ is the mean measure. When you need to calculate the probability of observing k events, you can just put k back to the formula. One important property of Poisson distribution is that its mean equals with its variance, i.e. μ=E(X)=Var(X).

Now let’s consider continuous process. Suppose that we start at time 0 to count events (earthquakes, car accidents, number of death, etc.). For each time t, we obtain a number N(t), which is the total number of events that has occurred up to time t. We then make the following modeling assumptions on the process N(t):

  1. For some λ>0, the probability of exactly one event occurring in a given time interval of length h is equal to λh+o(h). That is, for any t0, P{N(t+h)N(t)=1}=λh+o(h),ash0
  2. The probability that 2 or more events occur in an interval of length h is o(h): P{N(t+h)N(t)2}=o(h),ash0
  3. The random variables N(t1)N(s1) and N(t2)N(s2) are independent for any choice of s1t1s2t2. This is usually termed an independent interval assumption.

We then sat that N(t) is a homogeneous Poisson process with intensity, propensity, or rate λ.

Proposition 1: Let N(t) be a Poisson process satisfying the three assumptions above. Then for any ts0 and k{0,1,2,...}, we can prove that P{N(t)N(s)}=eλ(ts)(λ(ts))kk! If we choose s=0, then we get P{N(t)N(s)}=e(λt)(λt)kk!

Letting S1 denote the time of the first increase in the process (i.e. the first event occurred), then according to Proposition 1, P{S1>t}=P{N(t)=0}=eλt Therefore, the distribution of the first event time is exponentially distributed with a parameter of λ. Because of the independent increments assumption (assumption 3), we can see that the distribution of S2S1 is also exponentially distributed with a parameter of λ. Therefore, we see that *N(t)* is simply the counting process of a renewal process with inter-event times determined by *exponential random variables*(we call this as the second formulation).

We now move to the third formulation of a one dimensional Poisson process. We say the N is a Poisson process with intensity λ if for any AR0 and k0, we have that P{N(A)=k}=eλ|A|(λ|A|)kk! where |A| is the Lebesgue measure of A and if N(A1),...,N(Ak) are independent random variables whenever A1,...,Ak are disjoint subsets of state space E.

Definition 1. Let N be a point process with state space ERd, and let μ be a measure on Rd. We say that N is a Poisson process with mean measure μ, or a Poisson random measure, if the following two conditions hold:

  1. For AE, P{N(A)=k}={eμ(A)(μ(A))kk!ifμ(A)<0ifμ(A)=
  2. If A1,...,Ak are disjoint subsets of state space E, then N(A1),...,N(Ak) are independent random variables.

Note that the mean measure of a Poisson process, μ(A), completely determines the process. One choice of the mean measure would be a multiple of Lebesgue measure, which gives length in R1, area in R2, volume in R3, etc. That is, if μ((a,b])=λ(ba), for a,bR, then μ is said to be Lebesgue measure with rate, or intensity, λ. If λ=1, then the measure is said to be unit-rate. For another example, a Poisson process with Lebesgue measure in R2 satisfies μ(A)=Area(A). When the mean measure is a multiple of Lebesgue measure, we call the process homogeneous (λ>1).

If is a non-decreasing, absolutely continuous function, and over an open interval (a,b), then mean measure μ for a Poisson process is μ((a,b))=(b)(a) If has density λ (i.e. it is differentiable), then μ((a,b))=(b)(a)=abλ(s)ds As a result, for any AR, P{N(A)=k}=eabλ(s)ds(abλ(s)ds)kk!

The three formulations above are all equivalent.

Some definitions: >1. Renewal process: it is used to model occurrences of events happening at random time, where gaps between points (inter-event time) are i.i.d. random variables. >1. Point process: it is used to model a random distribution of points in space. It is a renewal process which distributes points so gaps are i.i.d. exponential random variables. e.g. locations of diseased deer in a given region (space); the breakdown times of certain part of a car (time). The simplest and most ubiquitous example of a point process is the Poisson point process. >1. Lebesgue measure: length, area, volume, etc.

Transformations of Poisson processes

  • If the position of the points of a homogeneous process of rate λ>0 are multiplied by λ, then the resulting point process is also Poisson, and it is, in fact, a homogeneous process of rate 1.
  • Likewise, we could start with a unit-rate process and divide the position of each point by λ to get a homogeneous process with rate λ.
  • Also, move the points around via an one-to-one function, or transformation, resulted in another Poisson process.

Simulating non-homogeneous Poisson process

Set t0=0,n=1.

  1. Let En be an exponential random variable with parameter one, which is independent from all other random variables already generated.
  2. Find the smallest μ0 for which 0μλ(s)ds=E1++En
  3. Set n+1 to n.
  4. Return to step 1 or break.

###Example Let N be a non-homogeneous Poisson process with local intensity λ(t)=t2. Write a code that simulates this process until 500 jumps have taken place.

library(ggplot2)
theme_set(theme_bw())
N=500 # number of jumps
time=vector() # to hold times of jumps
E_n=rexp(N) # N exp random values
for (i in 1:N){
  time[i]=(3*sum(E_n[1:i]))^(1/3)
}
ggplot(data=NULL)+geom_step(aes(x=c(0,time), y=c(0:500)))+
  geom_line(aes(x=c(0,time), y=c(0,time)^3/3), color="blue")