Next: Kernel Basis Functions
Up: svkde
Previous: Non-Parametric Density Estimation
In the previous section we decomposed the CDF into regions or windows
and estimated the PDF for each window separately. There are several ways to choose the placement (i.e. the center) of the windows; for example they can be disjoint and cover the entire domain as is the case with Frequency Histogram Estimation (Figure 2); the benefit of using such a method is that the structure of the windows are independant of the number of random samples (although the estimation procedure is not). Alternatively, the approach we will consider henceforth will associate a window with each random sample
so that we have a sequence of windows
centered at
. Since the number of windows (and their volume) is now dependent on the sample size we will re-write the result (5) as
 |
(6) |
where
is the volume of the region
and
is the number of samples that fall inside it; as more samples are generated the regions can either be refined (as is the case with the Parzen method) or we can reverse this logic and gradually increase the size of the regions starting with an inconspicuously small
(which is the case with the nearest-neighbor estimation method).
Let us further generalize [DHS01] the definition of the region
so that we can estimate multi-variate densities; assume the windows
are d-dimensional hypercubes with edges of length
; the volume is then simply
.
Figure:
Let the hypercube window
have dimension
; then
is simply a count of the random samples that fall in the square with sides of length
and centered at
; in this case
.
|
Now that the window has changed we need to revise our definition of
which was previously defined using a simple indicator function
; generalising this from counting random samples within an interval to counting within a hypercube can be done by first using a window function
of the form:
which defines the boundary for a unit-hypercube centered at the origin and then defining
as follows:
So
if and only if
or
, in other words if
is less than half the length of an edge of the hypercube away from
in all dimensions
. So
defines a (d+1)-dimensional hypercube of volume
(since the (d+1)th dimension has edges of length
), centered at
which counts the number of random samples that fall within the d-dimensional hypercube window
. Finally substituting this result into 6 gives:
The resulting estimated density is `jagged' since the window (basis) functions in the above linear combination are hypercubes (Figure 3) with abrupt edges; reducing the width
as more samples are generated will smooth out the estimate. Infact if
then the estimated density function is proved [Fuk72] to be asymptotically unbiased;
.
Subsections
Next: Kernel Basis Functions
Up: svkde
Previous: Non-Parametric Density Estimation
Rohan Shiloh SHAH
2006-12-12