This post basically came about in trying to answer to Tom Tango’s question about making career and seasonal regressions add up,
Or in other words what true talent level do you estimate based on two correlated samples?
As a starting point and as a reminder, if you have one observation, , and you assume true talent is Gaussian with mean and standard deviation , and statistical fluctuations are Gaussian with mean 0 and standard deviation , then the posterior distribution for true talent, , is,
You can use this to compute the mean of the posterior distribution for true talent, but because it is Gaussian, the mean will be equal to the mode. You can determine the mode by solving,
which is solved by
and if , with the number of plate appearances, then
which says, estimate true talent by regressing to the mean () by plate appearances.
The way all of this gets modified when you have two correlated observations — say true talent t1 in season 1 and true talent t2 in season 2 — is that your prior for the true talent distribution includes a correlation between t1 and t2. Specifically,
where t is the vector (t1, t2), C is the covariance matrix (and || denotes the determinant),
Multiplying this all out, and also including the probability distributions to observe performances x1 and x2, given true talents t1 and t2 and standard deviations of statistical fluctuations and , gives
I am writing it this way since I am going to find the mode as in the case with one observation, and since the exponential is always greater than 0, means that , and all I really need to know is the argument of the exponential.
that function g is,
to find the mode I take the partial derivatives with respect to t1 and t2 and set them both equal to 0, then solve those 2 simultaneous equations for t1 and t2. The equations can be written in a matrix form,
the solution for t1 is,
and for t2,