...
At the grating, we want the incident wavefront to have a transverse spatial coherence equal to or larger than the grating width because this maximizes the number of lines that are illuminated with spatially coherent light. This gives us the maximum spectral resolution since it makes the peaks sharper.
Lets try to quantify what we mean by "sharper" here.
Suppose we have a grating with some grating period
We also have a monochromatic light source with wavelength
, at normal incidence, uniformly illuminating the grating with intensity .This means that there is an "incident electric field" on the grating, which we denote
. Let x=0 be the center of the grating and have the grating extend infinitely outwards to the left and right.
We define the mutual intensity J as follows
| LaTeX Unit |
|---|
| body | J(x_1,x_2)=\langle E^*(x_1)E(x_2)\rangle |
|---|
|
We've seen this before, its just the interference term from earlier, also called the mutual coherence function or correlation function. When this value is high, we have high potential for interference and when it is close to 0 we have essentially no interference.
In the case where x1=x2=x, ie the diagonal, it just reduces to the intensity at x
OK now we take a step back and think from some first principles.
What we observe with a diffraction grating in the far field is essentially the spatial fourier transform of the incident electric field at the grating.
| LaTeX Math Block |
|---|
|
E(\theta)=C\int_{-\infty}^{+\infty}E_{out}e^{i\frac{2\pi}{\lambda}x\sin\theta}dx |
Where C is a constant prefactor which has the appropriate units. The term in the exponential is essentially the phase difference between a ray at x=0 and a ray at x=x. This is a limiting case of the Fresnel-kirchhof formula.
Anyways, E_out needs to be determined. The grating of width 12.7mm. The total distance from the grating to the slit is about 95.4mm. Suppose the slit is a circular aperture. What is the minimum slit size needed to is essentially a series of opaque and non opaque strips, so we can describe its transmission as
| LaTeX Math Block |
|---|
|
t(x) = \begin{cases}
1 & \text{if } \mod(\lfloor x/\Lambda\rfloor,2)\neq0 \\
0 & \text{otherwise}
\end{cases} |
So lets say E_out(x) = E_in(x)t(x)
| LaTeX Math Block |
|---|
|
E(\theta)=C\int_{-\infty}^{+\infty}E_{in}(x)t(x)e^{i\frac{2\pi}{\lambda}x\sin\theta}dx |
The intensity is therefore
| LaTeX Math Block |
|---|
|
I(\theta)=\langle E^*(\theta)E(\theta)\rangle = \langle C^2\int_{-\infty}^{+\infty}E^*_{in}(x_1)t^*(x_1)e^{-i\frac{2\pi}{\lambda}x_1\sin\theta}dx_1 \int_{-\infty}^{+\infty}E_{in}(x_2)t(x_2)e^{i\frac{2\pi}{\lambda}x_2\sin\theta}dx_2\rangle
|
OK lets simplify this. First of all, move all the things that are not time dependent out, and combine the two integrals as they integrate independent quantities.
| LaTeX Math Block |
|---|
|
I(\theta)= C^2\int_{-\infty}^{+\infty}\int_{-\infty}^{+\infty} t^*(x_1)t(x_2) \langle E^*_{in}(x_1)E_{in}(x_2)\rangle e^{i\frac{2\pi}{\lambda}(x_2-x_1)\sin\theta} dx_1dx_2 |
OK now notice that the only time dependent term is the E field part, so actually we can simplify the entire thing down using the mutual intensity we found earlier
| LaTeX Math Block |
|---|
|
I(\theta)= C^2\cdot \int_{-\infty}^{+\infty}\int_{-\infty}^{+\infty}t^*(x_1)t(x_2)\cdot J(x_1,x_2)e^{i\frac{2\pi}{\lambda}(x_2-x_1)\sin\theta} dx_1dx_2 |
OK now lets figure out more details about the mutual coherence J.
First of all, as we've seen before, J is related to the complex degree of coherence as follows
| LaTeX Math Block |
|---|
|
\mu(x_1,x_2)=\frac{J(x_1,x_2)}{\sqrt{I(x_1)I(x_2)}}=\frac{J(x_1,x_2)}{I_0} |
The second step follows from the fact that we assume uniform incident intensity across the grating I_0
Now all we need to do is specify the complex degree of coherence between two points x_1 and x_2 on the grating!
For this, we can utilize whats called the "Gaussian-Schell Model". We know obviously that when x_1=x_2=x, then the complex degree of coherence should be 1 since a point is perfectly coherent to itself (if you know the value of E at x=0, then you know obviously the value of E at x=0)
For typical spatially incoherent sources, there is some typical "transverse spatial coherence width" where points closer than this are roughly spatially coherent and points further away are spatially incoherent.
From this, we create the following model for the complex degree of coherence!
| LaTeX Math Block |
|---|
|
\mu(x_1,x_2)=e^{-\frac{(x_2-x_1)^2}{2l_c^2}} |
This follows a gaussian distribution and will be our model of spatial coherence across the grating surface.
We now solve for J(x1,x2)!
Anyways, we can now plug it into the equation for I
| LaTeX Math Block |
|---|
|
I(\theta)= C^2\cdot \int_{-\infty}^{+\infty}\int_{-\infty}^{+\infty}t^*(x_1)t(x_2)\cdot I_0e^{-\frac{(x_2-x_1)^2}{2l_c^2}} e^{i\frac{2\pi}{\lambda}(x_2-x_1)\sin\theta} dx_1dx_2 |
quite a hairy integral I'll try to simplify it a bit down now
| LaTeX Math Block |
|---|
|
I(\theta)= C^2I_0\cdot \int_{-\infty}^{+\infty}\int_{-\infty}^{+\infty}t^*(x_1)t(x_2)\cdot e^{-\frac{\Delta x^2}{2l_c^2}} e^{i\frac{2\pi}{\lambda}\Delta x \sin\theta} dx_1dx_2 |
I'm sure theres a way to solve this explicitly, but I'm not quite that good at math and instead lets just look at the limiting cases and then do numerical simulation in between.
First case is for perfectly spatially coherent light. This means the transverse spatial coherence width is infinite, l_c=infinity, so then J(x1,x2)=I_0
| LaTeX Math Block |
|---|
|
I(\theta)= C^2I_0\cdot \int_{-\infty}^{+\infty}\int_{-\infty}^{+\infty}t^*(x_1)t(x_2)\cdot e^{i\frac{2\pi}{\lambda}\Delta x \sin\theta} dx_1dx_2 |
| LaTeX Math Block |
|---|
|
I(\theta)= C^2I_0\cdot |\int_{\text{slits}}e^{i\frac{2\pi}{\lambda}x \sin\theta} dx|^2 |
blah blah blah some math later (fill in later)
We get the typical formula for perfect spatial coherent diffraction against N slits
We also check the fully incoherent limit, and that pops out that we should get uniform illumination in the far field.
- Is this weird? I don't think so. Diffraction relies on the fact that we have consistent spatial phase relationships between different points, such that we are able to get clear patterns of constructive and destructive interference. If we have perfectly incoherent light, then the E field at one point is entirely random and unpredictable from the E field at another point. This means that upon time average, there are no special spatial relationships between any points which causes us to get zero observable constructive or destructive interference upon time average.
- Despite this, we observe visible diffracted rainbows off common diffraction gratings like CDs using all sorts of lighting from the sun or lightbulbs etc. This isn't necessarily a contradiction because in reality I dont think there are many obvious 100% spatially incoherent sources. From the van-cittert zernike theorem we know that essentially any arbitrary light source will become spatially coherent at enough distance. the sun for example has some spatial coherence at the earths surface. lightbulbs etc when viewed from sufficiently far away have spatial coherence.
OK I had claude solve this integral explictly so not only do we now know the limiting cases, we also analytically know the general solution.
I made a script that shows the resulting angular diffraction pattern for various incident transverse spatial coherence lengths and its shown below.