This is my study note for Math 605, UW-Madison. See more details at here.
Definitions
Almost every entry level statistical course introduce the Poisson distribution formula (we call this as the first formulation): where is a discrete random variable, is the mean measure. When you need to calculate the probability of observing k events, you can just put k back to the formula. One important property of Poisson distribution is that its mean equals with its variance, i.e. .
Now let’s consider continuous process. Suppose that we start at time 0 to count events (earthquakes, car accidents, number of death, etc.). For each time t, we obtain a number N(t), which is the total number of events that has occurred up to time t. We then make the following modeling assumptions on the process N(t):
- For some , the probability of exactly one event occurring in a given time interval of length h is equal to . That is, for any ,
- The probability that 2 or more events occur in an interval of length h is o(h):
- The random variables and are independent for any choice of . This is usually termed an independent interval assumption.
We then sat that N(t) is a homogeneous Poisson process with intensity, propensity, or rate .
Proposition 1: Let N(t) be a Poisson process satisfying the three assumptions above. Then for any and , we can prove that If we choose , then we get
Letting denote the time of the first increase in the process (i.e. the first event occurred), then according to Proposition 1, Therefore, the distribution of the first event time is exponentially distributed with a parameter of . Because of the independent increments assumption (assumption 3), we can see that the distribution of is also exponentially distributed with a parameter of . Therefore, we see that *N(t)* is simply the counting process of a renewal process with inter-event times determined by *exponential random variables*(we call this as the second formulation).
We now move to the third formulation of a one dimensional Poisson process. We say the N is a Poisson process with intensity if for any and , we have that where is the Lebesgue measure of A and if are independent random variables whenever are disjoint subsets of state space .
Definition 1. Let N be a point process with state space , and let be a measure on . We say that N is a Poisson process with mean measure , or a Poisson random measure, if the following two conditions hold:
- For ,
- If are disjoint subsets of state space , then are independent random variables.
Note that the mean measure of a Poisson process, , completely determines the process. One choice of the mean measure would be a multiple of Lebesgue measure, which gives length in , area in , volume in , etc. That is, if , for , then is said to be Lebesgue measure with rate, or intensity, . If , then the measure is said to be unit-rate. For another example, a Poisson process with Lebesgue measure in satisfies . When the mean measure is a multiple of Lebesgue measure, we call the process homogeneous ().
If is a non-decreasing, absolutely continuous function, and over an open interval (a,b), then mean measure for a Poisson process is If has density (i.e. it is differentiable), then As a result, for any ,
The three formulations above are all equivalent.
Some definitions: >1. Renewal process: it is used to model occurrences of events happening at random time, where gaps between points (inter-event time) are i.i.d. random variables. >1. Point process: it is used to model a random distribution of points in space. It is a renewal process which distributes points so gaps are i.i.d. exponential random variables. e.g. locations of diseased deer in a given region (space); the breakdown times of certain part of a car (time). The simplest and most ubiquitous example of a point process is the Poisson point process. >1. Lebesgue measure: length, area, volume, etc.
Transformations of Poisson processes
- If the position of the points of a homogeneous process of rate are multiplied by , then the resulting point process is also Poisson, and it is, in fact, a homogeneous process of rate 1.
- Likewise, we could start with a unit-rate process and divide the position of each point by to get a homogeneous process with rate .
- Also, move the points around via an one-to-one function, or transformation, resulted in another Poisson process.
Simulating non-homogeneous Poisson process
Set .
- Let be an exponential random variable with parameter one, which is independent from all other random variables already generated.
- Find the smallest for which
- Set n+1 to n.
- Return to step 1 or break.
###Example Let N be a non-homogeneous Poisson process with local intensity . Write a code that simulates this process until 500 jumps have taken place.
library(ggplot2)
theme_set(theme_bw())
N=500 # number of jumps
time=vector() # to hold times of jumps
E_n=rexp(N) # N exp random values
for (i in 1:N){
time[i]=(3*sum(E_n[1:i]))^(1/3)
}
ggplot(data=NULL)+geom_step(aes(x=c(0,time), y=c(0:500)))+
geom_line(aes(x=c(0,time), y=c(0,time)^3/3), color="blue")