Kurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00 Examiner: Xiangfeng Yang (Tel: 070 2234765). Please answer in ENGLISH if you can. a. You are allowed to use: • a calculator; • formel -och tabellsamling i matematisk statistik (from MAI); • TAMS 11: Notations and Formulae (by Xiangfeng Yang), • a dictionary. b. Scores rating: 8-11 points giving rate 3; 11.5-14.5 points giving rate 4; 15-18 points giving rate 5. English Version (no Swedish Version) 1 (3 points) At a certain gas station, 40% of the customers use regular gas, 35% use plus gas, and 25% use premium. Of those customers using regular gas, only 30% fill their tanks. Of those customers using plus gas, 60% fill their tanks, whereas of those using premium, 50% fill their tanks. (1.1). (1p) What is the probability that the next customer will request plus gas and fill the tank ? (1.2). (1p) What is the probability that the next customer fills the tank ? (1.3). (1p) If one knows that the next customer fills the tank, what is the probability that regular gas is requested ? Solution. What we can get from the problem is the following: P (regular) = 40%, P (plus) = 35%, P (f ill|regular) = 30%, P (premium) = 25%, P (f ill|plus) = 60%, P (f ill|premium) = 50%. (1.1). P (plus and f ill) = P (plus) × P (f ill|plus) = 35% × 60% = 21%. (1.2). P (f ill) = P (f ill|regular) × P (regular) + P (f ill|plus) × P (plus) + P (f ill|premium) × P (premium) = 30% × 40% + 60% × 35% + 50% × 25% = 45.5%. (1.3). P (regular and f ill) P (f ill) P (f ill|regular) × P (regular) 30% × 40% = = = 12/45.5 = 26.37%. P (f ill) 45.5% P (regular|f ill) = 2 (3 points) Annie and Alvie have agreed to meet between 5:00 pm and 6:00 pm for dinner at a local health-food restaurant. Let X = Annie’s arrival time and Y = Alvie’s arrival time. Suppose that the joint probability density function (joint pdf) for the two-dimension random variable (X, Y ) is f (x, y) = 1, if 5 ≤ x ≤ 6 and 5 ≤ y ≤ 6. (2.1). (1p) Find the marginal pdf fX (x) of X and the marginal pdf fY (y) of Y. (2.2). (1p) What is the probability that they both arrive between 5:15 pm and 5:45 pm ? (2.3). (1p) If the first one to arrive will wait only 10 minutes before leaving to eat elsewhere, what is the probability that they have dinner at the local health-food restaurant ? (Hint: the event of interest is A = {(x, y) : |x − y| ≤ 16 }.) Page 1/5 Solution. (2.1) Z ∞ fX (x) = Z 6 1 dy = 1, 5 ≤ x ≤ 6. f (x, y)dy = −∞ 5 Similarly, Z ∞ Z −∞ 6 1 dx = 1, 5 ≤ y ≤ 6. f (x, y)dx = fY (y) = 5 (2.2) P (both 5 : 15 − 5 : 45) = P (5.25 ≤ X ≤ 5.75 and 5.25 ≤ Y ≤ 5.75) Z 5.75 Z 5.75 f (x, y)dxdy = 0.25. = 5.25 5.25 (2.3) The probability is Z Z P (A) = f (x, y)dxdy = 11/36 = 0.3056. (draw the graph, then you will see clearly). A 3 (3 points) A binary communication channel transmits a sequence of “bits” (0s and 1s). Suppose that for any particular bit transmitted, there is a 10% chance of a transmission error (a 0 becoming a 1, or a 1 becoming a 0). Assume that bit errors occur independently of one another. Now we consider transmitting 1000 bits, What is the probability that at most 125 transmission errors occur ? Solution. We have two different methods to solve the problem. The first method is the usual one: CLT. Let Xi record the possible error of the i-th bit, so Xi p(x) 0 0.9 1 0.1 So we can easily get µ = E(Xi ) = 0.1 and σ 2 = V (Xi ) = 0.09. Thus the probability is P (at most 125 transmission errors occur) = P (X1 + X2 + . . . + X1000 ≤ 125) 125 X1 + X2 + . . . + X1000 ¯ ≤ 0.125) ≤ ) = P (X = P( 1000 1000 ¯ −µ X 0.125 − 0.1 √ = P( √ ≤ √ ) = P (N (0, 1) < 2.64) = 0.9959. σ/ n 0.09/ 1000 The second method is ‘Normal approximation to a Binomial random variable’. If we use X = total number of errors, then X ∼ Bin(1000, 0.1). Thus P (at most 125 transmission errors occur) = P (X ≤ 125) = P (N (1000 · 0.1, 1000 · 0.1 · 0.9) ≤ 125) = P (N (100, 90) ≤ 125) 125 − 100 √ = P (N (0, 1) ≤ ) = P (N (0, 1) < 2.64) = 0.9959. 90 Page 2/5 4 (3 points) Let X denote the proportion of allotted time that a randomly selected student spends working on a certain test. Suppose that the pdf of X is f (x) = (θ + 1) · xθ , 0 ≤ x ≤ 1, where θ > −1 is an unknown parameter. A random sample of 3 students yields data: {0.71, 0.92, 0.83}. (4.1). (1p) Find a point estimate θˆM M of θ using Method of Moments. (4.2). (2p) Find a point estimate θˆM L of θ using Maximum-Likelihood method. Solution. (4.1). For Method of Moments, the first equation is E(X) = x ¯. The mean E(X) can be calculated as Z 1 E(X) = x(θ + 1) · xθ dx = (θ + 1)/(θ + 2). 0 By solving E(X) = x ¯, we have θ = (2¯ x − 1)/(1 − x ¯) which yields θˆM M = (2¯ x − 1)/(1 − x ¯). From the data, 0.71+0.92+0.83 = 0.82, thus x ¯= 3 θˆM M = (2 · 0.82 − 1)/(1 − 0.82) = 0.64/0.18 = 3.56. (4.2). For the Maximum-Likelihood method, we write the likelihood function as L(θ) = f (x1 ) · f (x2 ) . . . f (xn ) = (θ + 1)n · (x1 . . . xn )θ . Maximizing L(θ) is equivalent to maximize ln L(θ) where ln L(θ) = n ln(θ + 1) + θ ln(x1 . . . xn ). By d ln L(θ) dθ = 0, we have n θ+1 + ln(x1 . . . xn ) = 0, therefore θˆM L = − (The second derivative 5 d2 ln L(θ) dθ 2 n 3 3 −1=− −1=− − 1 = 3.9. ln(x1 . . . xn ) ln(0.542156) −0.6122 < 0 which yields that θˆM L is indeed a maximal point) (3 points) One has measured the same physical quantity five independent times with Method A and four independent times with Method B, and the results are: Method A: Method B: x ¯ = 1.0635 y¯ = 1.0515 s2x = 3.7 · 10−6 s2y = 5.6 · 10−6 We assume that the sample for Method A is from N (µx , σ 2 ), and the sample for Method B is from from N (µy , σ 2 ). (5.1). (1p) Construct a (two-sided) 95% confidence interval of µx . (5.2). (1p) Do you think µx 6= µy ? Answer this using a (two-sided) 95% confidence interval of µx − µy . (5.3). (1p) Construct an upper 95% confidence interval (one-sided) for σ in the form (0, b). Solution. (5.1) Since σ is unknown, a 95% confidence interval of µx would be sx Iµx = (¯ x − tα/2 (n − 1) · √ , n sx x ¯ + tα/2 (n − 1) · √ ) n √ √ 3.7 · 10−6 3.7 · 10−6 √ √ = (1.0635 − t0.025 (5 − 1) · , 1.0635 + t0.025 (5 − 1) · ) 5 5 = (1.0635 − 2.78 · 0.86 · 10−3 , 1.0635 + 2.78 · 0.86 · 10−3 ) = (1.0635 − 0.00239, 1.0635 + 0.00239) = (1.06111, Page 3/5 1.06589). (5.2) A 95% confidence interval of µx − µy is r Iµx −µy = ((¯ x − y¯) − tα/2 (n1 + n2 − 2) · s · 1 1 + , n1 n2 r (¯ x − y¯) + tα/2 (n1 + n2 − 2) · s · 1 1 + ) n1 n2 where (¯ x − y¯) = 1.0635 − 1.0515 = 0.012; tα/2 (n1 + n2 − 2) = t0.025 (5 + 4 − 2) = 2.36; s r √ p (n1 − 1)s2x + (n2 − 1)s2y (5 − 1) · 3.7 · 10−6 + (4 − 1) · 5.6 · 10−6 2 = = 31.6 · 10−6 /7 = 2.125 · 10−3 s= s = n1 + n2 − 2 5+4−2 r r 1 1 1 1 = + = + = 0.671. n1 n2 5 4 Thus Iµx −µy = (0.012 − 0.003365, 0.012 + 0.003365) = (0.008635, 0.015365). Since 0 ∈ / Iµx −µy , we think µx 6= µy . (5.3) Since σ can be from Method A or Method B, we may solve this problem in three different ways and all are correct! 1st way: σ from Method A. One-sided confidence interval for σ 2 is Iσ2 = (0, (n1 − 1) · s2x ) = (0, χ21−α (n1 − 1) (5 − 1) · 3.7 · 10−6 ) = (0, χ21−0.05 (5 − 1) 14.8 · 10−6 ) = (0, 0.71 20.845 · 10−6 ). Thus one-sided confidence interval for σ is √ Iσ = (0, 20.845 · 10−6 ) = (0, 0.0045656). 2nd way: σ from Method B. One-sided confidence interval for σ 2 is Iσ2 = (0, (n2 − 1) · s2y ) = (0, χ21−α (n2 − 1) (4 − 1) · 5.6 · 10−6 ) = (0, χ21−0.05 (4 − 1) 16.8 · 10−6 ) = (0, 0.35 48 · 10−6 ). Thus one-sided confidence interval for σ is √ Iσ = (0, 48 · 10−6 ) = (0, 0.0069282). 3rd way: σ from both Method A and Method B (didn’t talk about this in the lectures, but include here just for your reference). One-sided confidence interval for σ 2 is Iσ2 = (0, (n1 + n2 − 2) · s2 ) = (0, χ21−α (n1 + n2 − 2) (5 + 4 − 2) · 4.514 · 10−6 ) = (0, χ21−0.05 (5 + 4 − 2) 31.6 · 10−6 ) = (0, 2.17 14.56 · 10−6 ). Thus one-sided confidence interval for σ is √ Iσ = (0, 6 14.56 · 10−6 ) = (0, 0.003816). (3 points) The PCB-concentration is measured for 10 fish in a lake, and the results are: 11.5, 10.8, 11.6, 9.4, 12.4, 11.4, 12.2, 11.0, 10.6, 10.8 We assume that these observations are from a normal population N (µ, 0.81). Previous measurements show that the average PCB-concentration for the fish is 10.8, but it is suspected that the concentration now becomes higher in the lake. (6.1). (1p) Test the following hypotheses with a significance level α = 0.05 : H0 : µ = 10.8 versus Ha : µ > 10.8. (6.2). (2p) For the test in (6.1), what is the power when the actual µ = 11.0 ? Page 4/5 Solution. (6.1) Since the population variance is known σ 2 = 0.81, according to Ha the rejection region C = (zα , +∞) = (z0.05 , +∞) = (1.65, +∞). The test statistic is TS = 11.17 − 10.8 x ¯ − µ0 √ = √ √ = 1.3. σ/ n 0.81/ 10 Since T S ∈ / C, we don’t reject H0 . (6.2) The power is h(11) = P (reject H0 when H0 is wrong and µ = 11) ¯ − µ0 X √ > 1.65 when µ = 11) = P( σ/ n ¯ −µ ¯ −µ ¯ − µ0 X X X √ to √ since √ ∼ N (0, 1)) (need to change σ/ n σ/ n σ/ n ¯ − µ µ − µ0 X √ > 1.65 when µ = 11) = P( √ + σ/ n σ/ n 11 − 10.8 √ > 1.65) = P (N (0, 1) + √ 0.81/ 10 = P (N (0, 1) > 1.65 − 0.7027) = P (N (0, 1) > 0.95) = 1 − 0.8289 = 0.1711. Page 5/5
© Copyright 2025