Random variables#
Introduction#
The foundations of probability are built on sets, yet data is more naturally stored and more easily computed on if it is represented numerically.
Random variables match each outcome in our sample space to a value on the number line.
In addition to computational advantages, random variables help us extract from our data the most important characteristics, and they serve as building blocks which we can use to create powerful models. Random variables are also a language we can use to communicate our modeling efforts to other mathematicians, statisticians, and data scientists.
Suppose we hypothesize that the frequency of social media posts on some popular outlet are related to influenza-like illness~(ILI)—a syndromic diagnosis suggesting a patient may have influenza. A patient is diagnosed with ILI if their temperature is measured to be at or above 38C and symptoms resembling the flu. Because influenza is most active in winter and spring, we collect a random sample, each day, of social media posts from September to May and in addition we collect the proportion of patients who are admitted to the hospital and are diagnosed with influenza-like illness at the US national level.
The above hypothesis, data collection, and future inference has numerous details. However, we will see shortly that we can simplify our hypothesis by using random variables.
Maps from the sample space to the number line#
Given a sample space \(\mathcal{G}\), a \textbf{random variable}, (e.g. \(X\)), is a function from each element in \(\mathcal{G}\)—from each outcome—to a value on the real number line. The real number line contains all numbers: integer and decimal, from negative to positive infinity.
\textbf{Example:} Suppose our sample space contains two elements \(\mathcal{G} = \{ a,b \}\). We may decide to define a random variable \(X\) that maps the outcome \(a\) to the value \(-1\) and the outcome \(b\) to the value \(1\). In otherwords, \(X(a)=-1\) and \(X(b)=1\). We could as well define a random variable \(Y\) on the same sample space such that \(Y(a)=0\) and \(Y(b)=1\).
\textbf{Example:} Suppose our sample space contains all integers from 0 to 1000 \(\mathcal{G} = \{0,1,2,3\cdots,1000 \}\). We may be most interested in when an integer is even or odd, and so we can define a random variable \(Y(y)=0\) when \(y\), our outcome, is an odd integer and \(Y(y)=1\) when \(y\) is even. This is an example of how a random variable can distill down a sample space with many outcomes into a random variable with two.
\textbf{Example:} Suppose we decide to study the relationship between the cumulative total number of cigarettes smoked by a person form the date that they started smoking and the presence of lung cancer. We define our sample space to be \(\mathcal{G} = \{ (x,y) | x \in \mathbb{Z}, y \in \{0,1\} \}\). We define two random variables, a random variable \(X\) that maps the outcome \((x,y)\) to the value in the first position \(x\), and a random variable \(Y\) that maps the \(outcome (x,y)\) to the value in the second position \(y\). Though our outcomes are linked, we can use random variables to think about two separate outcomes—cigarettes smoked and lung cancer—and how they interact.
A new sample space#
When we build a random variable \((X)\) that maps outcomes to values on the number line we create a new sample space which we will call the support of \(X\) or \(supp(X)\). Define a sample space \(\mathcal{G}\) without outcomes \(o_{i}\). Then the \textbf{support of X} is
Our new sample space is the set of all the potential values that our random variable \(X\) can produce. This is a sample space linked to \(\mathcal{G}\), but in practice after we develop a random variable we often no longer reference \(\mathcal{G}\).
\ex In our above example where \(\mathcal{G} = \{a,b\}\), the random variable \(X\) has support \(supp(X) = \{-1,1\}\) and \(supp(Y)=\{0,1\}\). Lets look at another example, when above \(\mathcal{G}\) is the set of all integers from 0 to 1000. Even though the sample space is quite large, the random variable that maps the integers to 0 when they are odd and 1 when even has a small support \((supp(Y) = \{0,1\})\).
How to assign probabilities to a random variable#
Random variable themselves do not require that we include the probability of each of their values. Random variables are a function from outcomes to the real numbers—nothing more. That said, in practice we build random variables expecting that the probabilities we assign to outcomes in our sample space will correspond to probabilities assigned to values of our random variable.
We assign a probability to the value \(x\), which belongs in the support of random variable \(X\), the sum of the probabilities of all the outcomes that \(X\) maps to \(x\).
where each outcome \(o_{1},o_{2},\cdots,o_{n}\) is mapped by \(X\) to the value \(x\). In other words, \(X(o)=x\) for each of \(o_{1},o_{2},\cdots,o_{n}\).
\ex Define a \(\mathcal{S} = \{a,b,c,d,e\}\) and a random variable \(X\) that maps the outcomes to the following values
\begin{table}[ht!] \centering \begin{tabular}{ccc} Outcome & P(outcome) & X(outcome) \ \hline a & 0.1 & 0\ b & 0.25 & 1\ c & 0.15 & 1\ d & 0.3 & 2\ e & 0.2 & 0\ \end{tabular} \end{table}
We assign the probability that \(X=0\) as the sum of the probabilities assigned to outcome \(a\) and outcome \(e\), or
We can run the same procedure for all the elements in the support of \(X\),
and organize our work in a table
\begin{table}[ht!] \centering \begin{tabular}{c|c} X & P(X=x) \ \hline 0 & 0.30\ 1 & 0.40\ 2 & 0.30 \end{tabular} \end{table}
A \textbf{probability distribution} for a random variable \(X\) is a set of tuples where the first position in each tuple is a value in the support of \(X\) and the second position in the tuple is the corresponding probability assigned to that value.
\ex A probability distribution for the random variable \(X\) above is \(\{(0,0.30),(1,0.40),(2,0.30)\}\).
\ex Imagine we run an experiment that collects data on marathon runners. We decide to collect the number of elapsed minutes until they finish the race. Our sample space is defined as all positive integers \(\mathcal{S} = \{1,2,3,\cdots,\}\). We may decide to build a random variable \(X\) that maps outcomes less than 60 to the value 1, outcomes from 61 to 120 to the value 2, and outcomes greater than 120 to the value 3. One potential probability distribution for \(X\) is \(\{(1,0.10),(2,0.50),(3,0.40)\}\). For this probability distribution, \(P(X=1) = 0.10\), \(P(X=2) = 0.50\), and \(P(X=3) = 0.40\).
\begin{figure}[ht!] \includegraphics{chapters/chapter2/rvsLink.pdf} \caption[List]{A sample space \(\mathcal{S}\) with elements \(\{a,b,c,d,e\}\) and a random variable \(X\) that maps each element in \(\mathcal{S}\) to one the values: 0, 1, or 2. Probabilities corresponding to outcomes in the sample space and how they map to probabilities for each value of \(X\) are shown in blue.} \end{figure}
Probability mass function#
There are several supportive tools that we can use to help us better understand random variable we create. The first is the probability mass function, or p.m.f. The \textbf{probability mass function} is a \underline{function} that maps values in the support of a random variable \(X\) to their corresponding probabilities. Inputs are values of \(X\), outputs are probabilities.
The probability mass function is a convenient way to organize a probability distribution and it allows us to transfer all the information we know about functions to random variables.
\ex Define a random variable \(Y\) with support \(\{-1,0,1\}\), probability distribution {(-1,0.2),(0,0.5),(1,0.3)}, and probability mass function
The function—our probability mass function—is a type of function called a \textbf{piecewise} function.
We can ask our pm.f. to return the probability for a given value
and we can visualize our probability mass function using, for example, a barplot. \begin{figure}[ht!] \centering \includegraphics{chapters/chapter2/pmfviz.pdf} \caption{A barplot for visualizing the probability mass function of our random variable \(Y\). The support of \(Y\) is plotted on the horizontal axis and height of each bar corresponds to the probability assigned to that value in the support.\label{fig.pmfviz}} \end{figure}
\textbf{Distributed as \(f\):} Because we can use the probability mass function to describe the probability distribution of a random variable, we will often write
The above formula is read “the random variable \(Y\) is distributed as \(f\)”, and what we mean when a random variable is distributed as \(f\) is that the support of \(Y\) is the same as the domain of the function \(f\) and that the probability of a value \(y\) is equal to \(f(y)\), or
The probability mass function is a convenient method for assigning probabilities to random variables and visualizing the distribution of a random variable.
Cumulative mass function#
The \textbf{cumulative mass function} is a \underline{function} that maps values in the support of a random variable \(X\) to the probability that the random variable is less than or equal to this value, or \(P(X \leq x)\).
We use a capital \(F\) to denote a cumulative mass function (c.m.f). The c.m.f. corresponding to random variable \(X\) has a domain equal to the support of \(X\) and produces values between 0 and 1 (the values a function can produce is called the function’s \textbf{image}).
The c.m.f. too can be visualized and we could also use the c.m.f. to describe the probability distribution of a random variable. This is because we can use the c.m.f. to derive the p.m.f.
\begin{figure}[ht!] \centering \includegraphics{chapters/chapter2/cmfviz.pdf} \caption{A barplot for visualizing the cumulative mass function of our random variable \(Y\). The support of \(Y\) is plotted on the horizontal axis and height of each bar corresponds to the probability assigned to values \((v)\) less than or equal to \(v\) in the support.\label{fig.pmfviz}} \end{figure}
\ex For a random variable
The c.m.f is then
We can use the c.m.f \((F(x))\) to compute any p.m.f (\(f(x)\)) too by noticing that for a support \(\{x_{0},x_{1},x_{2},x_{3}, \cdots,x_{n-1}\cdots,x_{n}\}\) where the values in this set are ordered form smallest to largest
Because the p.d.f. and c.m.f equivalently describe the probability distribution of a random variable we can write \(X\sim F\) or \(X \sim f\).
Two or more random variables#
A single random variable is a useful way to describe a sample space and the probabilities assigned to values on the number line corresponding to outcomes.
But more complicated scientific questions often require several random variables and rules for how they interact. We will explore how to generate multiple random variables on the same sample space. Then we will develop an approach to define probabilities for combinations of random variables, a joint probability mass function and joint cumulative mass function.
Suppose we design a sample space \(\mathcal{S}\) and build two random variables on this space \(X\) and \(Y\). Because \(X\) and \(Y\) are random variables we can think of the support of \(X\) and support of \(Y\) as two new sample spaces. Lets assume that the outcomes in \(supp(X) = \{x_{1},x_{2},x_{3},\cdots,x_{N}\}\) and that the outcomes in \(supp(Y) = \{y_{1},y_{2},\cdots,y_{M}\}\). Then we can define a new set of outcomes in the space \(supp(X) \times supp(Y) = \{x_{i}, y_{j}\}\) for all combinations of \(i\) from \(1\) to \(N\) and \(j\) from \(1\) to \(M\).
This new space above maps outcomes in \(\mathcal{S}\) to a tuple \((x_{i},y_{j})\) where the outcomes were mapped by \(X\) to the value \(x_{i}\) and by \(Y\) to the value \(y_{j}\). If we assign the probability of \((x_{i},y_{j})\) to be the the sum of the probabilities of all outcomes in \(\mathcal{S}\) that \(X\) maps to \(x_{i}\) and \(Y\) maps to \(y_{i}\) then we call this a \textbf{joint probability distribution}.
\ex A pharmaceutical company launches a clinical trial to study adverse events from a novel medicine. The trial enrolls patients and randomizes them to receive the novel drug or optimal medical therapy. The trial expects three potential adverse events which we call \(\text{ae}_{1}\), \(\text{ae}_{2}\), and \(\text{ae}_{3}\). We may choose to define a sample space \(\mathcal{S} = \{(\text{nov},\text{ae}_{1}),(\text{nov},\text{ae}_{2}),(\text{nov},\text{ae}_{3}),(\text{omt},\text{ae}_{1}),(\text{omt},\text{ae}_{2}),(\text{omt},\text{ae}_{3})\}\) and further build a random variable \(X\) that maps outcomes to values 0 when an adverse event was experienced by a control patient and 1 when an adverse event was experienced by a novel drug patient, and a random variable \(Y\) that maps adverse event 1 to the value 1, adverse event 2 to the value 2, and adverse event 3 to the value 3.
\begin{table}[ht!] \centering \begin{tabular}{c|c} Event & X \ \hline \((\text{nov},\text{ae}_{1})\) & 1\ \((\text{nov},\text{ae}_{2})\) & 1\ \((\text{nov},\text{ae}_{3})\) & 1\ \((\text{omt},\text{ae}_{1})\) & 0\ \((\text{omt},\text{ae}_{2})\) & 0\ \((\text{omt},\text{ae}_{3})\) & 0\ \end{tabular} \end{table}
\begin{table}[ht!] \centering \begin{tabular}{c|c} Event & Y \ \hline \((\text{nov},\text{ae}_{1})\) & 1\ \((\text{nov},\text{ae}_{2})\) & 2\ \((\text{nov},\text{ae}_{3})\) & 3\ \((\text{omt},\text{ae}_{1})\) & 1\ \((\text{omt},\text{ae}_{2})\) & 2\ \((\text{omt},\text{ae}_{3})\) & 3\ \end{tabular} \end{table}
A joint probability distribution would assign probabilities to all possible pairs of values for \(X\) and \(Y\).
\begin{table}[ht!] \centering \begin{tabular}{c c | c} X & Y & prob \ \hline 1 & 1 & 0.1\ 1 & 2 & 0.05\ 1 & 3 & 0.23\ 0 & 1 & 0.05\ 0 & 2 & 0.30\ 0 & 3 & 0.27 \end{tabular} \end{table}
We write \(P(X=x, Y=y)\) to denote the probability assigned to the joint probability that the random variable \(X\) equals the value \(x\) and the random variable \(Y\) equals the value \(y\). With the above example in mind, \(P(X=0,Y=2) = 0.30\) and this probability could be described as the probability an OMT patient experiences adverse event 2.
Joint probability distributions are often visualized as a table with one random variable’s values representing the rows and the second random variable’s values representing the columns
\begin{table}[ht!] \centering \begin{tabular}{c|ccc} & Y = 1 & Y = 2 & Y = 3 \ \hline X=0 & 0.05 & 0.30 & 0.27 \ X=1 & 0.10 & 0.05 & 0.23 \end{tabular} \end{table}
A joint distribution can be used to compute probabilities that the random variable \(X\) equals the value \(x\) for any value of \(Y\)—\(P(X=x)\) and also the probability that the random variable \(Y\) equals the value \(y\) for any value of \(x\)—\(P(Y=y)\). These probabilities are called \textbf{marginal probabilities}.
The marginal probabilities that \(X\) equals \(0\), or \(P(X=0)\), is computed by summing the joint probabilities where \(X=0\). We can use the same procedure to compute the marignal probability that \(X=1\).
The same procedure can be done for the random variable \(Y\) to find marginal probabilities:
Joint distributions need not be restricted to only two random variables. We can build joint distributions of any number of random variables and can compute marginal probabilities in the same way that we do for a joint distribution of two random variables.
The rules, laws, and theorems of probability that we learned in chapter one carry over to random variables. This is because we can consider a new sample space of values that our random variable can take. Suppose we build two random variables \(X\) that maps outcomes to the the values -1,0,1 and \(Y\) that maps outcomes to the values 1,2,3. We can define a new sample space \(\mathcal{S} = \{(x,y) | x \in supp(X) \text{ and } y \in supp(Y) \}\) With our new sample space, we can now discuss statements like \(P(X = x | Y = y)\).
\ex If we continue with our pharmaceutical example, we can build a new sample space \(\mathcal{S} = \{(0,1),(0,2),(0,3),(1,1),(1,2),(1,3)\}\) and compute, for example, \(P(X=1 | Y=2) = \frac{P(X=1,Y=2)}{P(Y=2)} = \frac{0.05}{0.35} = 0.14\).
We can define the conditional probability of the value of one random variable given another as
and so define statistical independence between two random variables, \(A\) and \(B\), as
We can also translate the law of total probability from events to random variables. For a partition \([Y=y_{1}] \cup [Y=y_{2}] \cup [Y=y_{3}] \cup \cdots \cup [Y=y_{n}]\) of the event that the random variable \(X=x\),
By thinking of the values of a random variable as outcomes in a new sample space, we can apply our past intuition and the past mechanics of events, sets to a collection of random variables.
Functions of a random variable#
There are times that we may wish to summarize the behavior of a random variable. One common way to describe how probability is distributed among values in the support of a random variable is by computing some function of that random variable.
Multiplying and adding a constant to a random variable#
Suppose we have defined a random variable \(X\) and wish to study a new random variable \(Y = g(X)\) that is a function of \(X\). The values and the probabilities assigned to \(X\), along with the function \(g\) will determine the values in the support of \(Y\) and assigned probabilities to these values.
Translation#
Define a random variable \(X\) with p.m.f. \(f_{X}\) and a new random variable \(Y = X+c\) where \(c\) is a constant. Then the support for \(Y\), \(supp(Y) = \{ x+c | x \in supp(X) \}\) and the p.m.f. of \(Y\), \(f_{Y}(y)\) is
In otherwords, the probability distribution on \(Y\) is \(\mathbb{P} = \{ (y,p) \;|\; y=x+c \text{ and } P(X=x) = p \}\). This type of transformation of the random variable \(X\) to \(Y\) is called a \textbf{translation}.
\ex Suppose we define a random variable \(Z\) such that \(supp(Z) = \{0,1,2,3,4,5\}\) and
We can define a new random variable \(Y = Z-4\). This random variable will have \(supp(Y) =\{-4,-3,-2,-1,0,1\}\) and
Scaling#
Define a random variable \(X\) with p.m.f. \(f_{X}\) and a new random variable \(Y = c \cdot X\) where \(c\) is a constant. Then the support for \(Y\), \(supp(Y) = \{ c \cdot x | x \in supp(X) \}\) and the p.m.f. of \(Y\), \(f_{Y}(y)\) is
When we multiply or divide a random variable by a constant to produce a new random variable this is called \textbf{scaling}.
\ex Suppose we define a random variable \(Z\) such that \(supp(Z) = \{0,1,2,3,4,5\}\) and
We can define a new random variable \(Y = 4 \cdot Z\). This random variable will have \(supp(Y) =\{0,4,8,12,16,20\}\) and
General transformation of a discrete random variable#
We explored translation and scaling as specific ways to use one random variable to generate another. However, there are many functions we can apply to a random variable \(X\) to create a new random variable \(Y\), and we need a more general method to compute the probability mass function for \(Y\).
Build a random variable \(X\) and define \(Y\) to be a new random variable such that \(Y = g(X)\). Then the support for \(Y\) is \(supp(Y) = \{y | y = g(x) \text{ and } x \in supp(X) \}\), and
where \(x_{1},x_{2}, \cdots x_{n} \in g^{-1}(y)\). If \(Y\) is a function of \(X\) then the probability that the random variable \(Y\) will assign to the value \(y\) is the sum of the probabilities the random variable \(X\) assigns to the set of values \(x_{1},x_{2},\cdots,x_{n}\) that are mapped to \(y\) by the function \(g\).
\ex Let \(X\) be the random variable with p.m.f
and also build a new random variable \(Y = X^{2}\). Here the function \(g\) is \(g(x) = x^{2}\). The support of \(Y\) is
To compute \(P(Y=0)\) we need to sum up the probabilities of all values in \(supp(X)\) that \(g\) maps to \(0\). In our case the only value mapped to \(0\) is the value \(0\), and so \(P(Y=0) = P(X=0) = 0.50\). To compute \(P(Y=1)\) we need to sum the probabilities of all values that \(g\) maps to the value \(1\). In this case two values in \(X\) map to \(1\), the values \(-1\) and \(1\), and so \(P(Y=1) = P(X=-1) + P(X=1) = 0.05+0.34=0.39\). Finally, \(P(Y=4) = P(X=-2) + P(X=2) = 0.11\) (why?). The pm.f for \(Y\) is
\subsection{Expectation#
Suppose we build a random variable \(X\) with a corresponding probability mass function \(f_{X}\).
The \textbf{expected value} of a random variable \(X\) is computed as
where \(x_{1},x_{2},\cdots,x_{n}\) are all values in the \(supp(X)\).
An intuitive definition of the expected value is that \(\mathbb{E}(X)\) is a weighted average of all values in the support of \(X\) where the weight for \(x_{i}\) is the probability of \(x_{i}\). The expected value of \(X\) will be close to values in \(supp(X)\) with high probability.
\ex Build a random variable \(Y\) with support \(supp(Y) = \{-1,0,1\}\) and \(f_{Y} = \{(-1,0.2),(0,0.5),(1,0.3)\}\). The expected value of \(Y\) is \(\mathbb{E}(Y) = 0.2 (-1) + 0.5 (0) + 0.3 (1) = 0.1\).
Properties of the expectation#
The expectation is a linear function, that is \(\mathbb{E}(aY + b) = a\mathbb{E}(Y) + b \). We can show this by defining a random variable \(Z = aY+b\) and asking
The step \eqref{exp.last} deserves some attention. Values \(z_{i}\) are equal to \(ay_{i} + b\), they are mapped from the values \(y_{i}\) and so the probability that \(Z\) equals \(z_{i}\) is equivalent to the probability that \(Y\) equals \(y_{i}\).
Second moment and variance#
Define a random variable \(Y\) with \(supp(Y) - \{y_{1},y_{2},\cdots,y_{n}\}\), then \textbf{variance} is the following function of \(Y\)
The variance can be thought of as the squared distance of each value in the support of the random variable \(Y\) from the expected value weighted by the probability of each value. In some sense, the variance attempts to measure the squared distance from the expected value.
\ex Define a random variable \(Z\) with probability mass function
To compute \(V(Z)\) we need to first compute the expected value of \(Z\) or \(\mathbb{E}(Z)\):
Now we can compute the variance
Just in time summation#
Summing a sequence of values is performed so frequently in mathematics and statistics that we have developed a special notation that simplifies sums.
Given a sequences of values, \(x_{1}, x_{2}, x_{3}, \cdots, x_{n}\), that we wish to sum define the following operator to represent that sum
How the expected value earned its name#
Markov inequality#
Suppose we define a random variable \(X\) with a support that takes only non-negative numbers.
Chebychev’s inequality#
Covariance#
Correlation#
Exercises#
Suppose \(\mathcal{S} = \{ a,b,c \}\) and \(P(\{a\})=0.2\), \(P(\{b\})=0.3\), \(P(\{c\})=0.5\). Define a random variable \(X\) such that \(X(a)=1\), \(X(b)=1\), and \(X(c)=0\). Define a second random variable \(Y\) such that \(Y(a)=0\), \(Y(b) = 1\), \(Y(c)=2\).
Compute \(P(X=1)\)
Compute \(P(X=0)\)
What is \(supp(X)\) ?
What new sample space does \(X\) generate?
Compute \(P(Y=1)\)
Compute \(P(Y=0)\)
What is \(supp(Y)\) ?
What new sample space does \(Y\) generate?
Let \(\mathcal{S} = \{1,2,3,4,5,6,7,8,9,10,11\}\), the set of all positive integers. Further define a random variable \(K\) with the following probability mass function
\[\begin{align*} f(k) = \left( \frac{1}{2} \right) ^{k} & \text{when } k \leq 10\\ f(11) = 0.009 \end{align*} \]Is the pmf \(f\) a valid probability distribution? Why or why not?
What value of \(K\) is assigned the highest probability?
Please define the cumulative mass function for the random variable \(K\)
Define a random variable with \(supp(Y) = \{-3,-2,-1,0,1,2,3\}\) and cumulative mass function
\[\begin{align*} F(y) = \begin{cases} 0.10 & \text{ when } y = -3\\ 0.24 & \text{ when } y = -2\\ 0.36 & \text{ when } y = -1\\ 0.50 & \text{ when } y = 0\\ 0.67 & \text{ when } y = 1\\ 0.78 & \text{ when } y = 2\\ 1.00 & \text{ when } y = 3\\ \end{cases} \end{align*} \]What is \(P(Y \leq -1)\)
What is \(P(Y=-1)\)
What is \(P(Y= 1)\)
Please define the p.m.f for the random variable \(Y\).
Graph the c.m.f
Graph the p.m.f
Define the following joint distribution of random variable \(Q\) and \(R\) that are mapped from the sample space \(\mathcal{G}\)
Q
R
prob
2
1
0.01
2
2
0.075
2
3
0.13
1
1
0.1
1
2
0.05
1
3
0.17
0
1
0.05
0
2
0.30
0
3
0.115
What is the implied support of \(Q\)?
What is the implied support of \(R\)?
Compute P(Q=1,R=2)
Compute the marginal probabilities for \(Q\)
Compute the marginal probabilities for \(R\)
Compute \(P(R=1 | Q=0)\)
The random variable \(Q\) is called \textbf{statistically independent} from \(R\) if for every value \(q \in supp(Q)\) and for every value \(r \in R\) the following is true \(P(Q=q |R=r) = P(Q)\). Is \(Q\) statistically independent from \(R\)? Why or why not?
For two events \(A\) and \(B\) that are statistically independent, we found that \(P(A \cap B) = P(A)P(B)\). Please derive an equivalent expression for two random variables by applying the definition of conditional probability and statistical independence to random variables.
Define the following joint distribution of random variable \(Q\) and \(R\) that are mapped from the sample space \(\mathcal{G}\)
Q
R
prob
2
1
0.01
2
2
0.075
2
3
0.13
1
1
0.1
1
2
0.05
1
3
0.17
0
1
0.05
0
2
0.30
0
3
0.115
Please compute \(\mathbb{E}(Q)\)
Please compute \(\mathbb{E}(R)\)
Please compute \(V(Q)\)
Please compute \(V(R)\)
Kimberly Palaguachi-Lopez—-Class of 2024: Consider the following joint distribution of random variable Z and L that are mapped from the sample space G
Z
L
prob
1
0
0.05
2
0
0.15
3
0
0.19
2
1
0.02
1
1
0.12
3
1
0.04
2
0.5
0.10
3
0.5
0.13
1
0.5
0.20
Please compute the \(supp(Z)\)?
Please compute the \(supp(L)\)?
Solve for \(P(Z=1,L=1)\)
Solve for the respective marginal probabilities of \(Z\)
Solve for the respective marginal probabilities of \(L\)
Is the probability distribution for the random variable Z a valid probability distribution? Why or why not? Is the probability distribution for L a valid probability distribution? Why or why not?
Suppose \(X\) is a random variable with support \(supp(X) = \{-2,-1,0,1,2,3\}\) and \(Y = |X|\). Further assume the c.m.f of \(X\) is
(97)#\[\begin{align} F_{X}(x) \begin{cases} 0.05 & \text{ if } x=-2\\ 0.15 & \text{ if } x=-1\\ 0.35 & \text{ if } x=0\\ 0.65 & \text{ if } x=1\\ 0.95 & \text{ if } x=2\\ 1.00 & \text{ if } x=3 \end{cases} \end{align}\]Define the support of \(Y\)
Build the p.m.f of \(Y\)
Build the c.m.f of \(Y\)
Compute \(\sum_{i=-5}^{i=5} i^{2}/2\)
Simplify the \(V(X)\) into the equation \(E(X^{2}) - \left[E(X)\right]^{2}\). Hint: Write down the definition of variance using the expectation, then expand the squared terms and simplify.
Define the random variable \(A\) with pmf
(98)#\[\begin{align} f_{B}(b) = \begin{cases} 0.52 &\text{ if } b=0\\ 0.12 &\text{ if } b=1\\ 0.34 &\text{ if } b=2 \end{cases} \end{align} \]Compute \(\mathbb{E}(B)\)
Compute \(V(B)\)
Use Chebychev’s inequality to make a statement about \(P(B \geq b)\)