Ace - AI Tutor
Ask Our Educators
Textbooks
My Library
Flashcards
Scribe - AI Notes
Notes & Exams
Download App
krystal warren

krystal w.

Divider

Questions asked

BEST MATCH

Discrete Question 12, 4.3.27 Part 1 of 3 HW Score: 82.69%, 10.75 of 13 points Points: 0 of 1 Save An automobile manufacturer finds that 1 in every 2500 automobiles produced has a particular manufacturing defect. (a) Use a binomial distribution to find the probability of finding 4 cars with the defect in a random sample of 4500 cars. (b) The Poisson distribution can be used to approximate the binomial distribution for large values of n and small values of p. Repeat (a) using a Poisson distribution and compare the results. (a) The probability using the binomial distribution is (Round to five decimal places as needed.)

View Answer
divider
BEST MATCH

Suppose the field inside a large piece of dielectric is E0, so that the electric displacement is D0=

View Answer
divider
BEST MATCH

2. k-Means Clustering (12 points) Consider n data points $x_1, \dots, x_n \in \mathbb{R}^d$ and $X \in \mathbb{R}^{n \times d}$ with these data points as its rows. Consider any clustering $C = \{C_1, C_2, \dots, C_k\}$ of these data points. I.e., each $C_j$ is a set (a cluster) of points and each data point $x_i$ is assigned to one of these sets. Let $\mu_j = \frac{1}{|C_j|} \sum_{x \in C_j} x$ be the centroid (i.e., the mean) of cluster $C_j$. The k-means clustering objective is to minimize $\text{cost}(C, X) = \sum_{j=1}^k \sum_{x \in C_j} ||x - \mu_j||_2^2$. 1. (2 points) Prove that $\text{cost}(C, X) = \sum_{j=1}^k \frac{1}{2|C_j|} \sum_{x \in C_j} \sum_{y \in C_j} ||x - y||_2^2$. Hint: Prove that both expressions the cost can be written as $\sum_{j=1}^k \left[ \left( \sum_{x \in C_j} ||x||_2^2 \right) - |C_j| \cdot ||\mu_j||_2^2 \right]$. This will require some vector algebra. It will be helpful to use that for any vector $z$, $||z||_2^2 = \sum_{i=1}^d (z^{(i)})^2 = (z, z)$, as well as to use the linearity of inner product. 2. (2 points) Suppose that $\Pi \in \mathbb{R}^{m \times d}$ is a random projection matrix with each entry chosen independently as $N(0, 1/m)$. For each $x_i$ in the dataset, let $\tilde{x}_i = \Pi x_i$ and let $\tilde{X} \in \mathbb{R}^{n \times m}$ contain the compressed data points as its rows. By part (1), for $m = O(\frac{d}{\epsilon^2})$, with high probability, we have, for all clusterings $C$, $(1 - \epsilon) \cdot \text{cost}(C, X) \le \text{cost}(C, \tilde{X}) \le (1 + \epsilon) \cdot \text{cost}(C, X)$. Assuming that the above bound holds, prove that if we compute the optimal clustering on the compressed data, $\tilde{C} = \arg \min_C \text{cost}(C, \tilde{X})$, then it is near-optimal for the original data, i.e., that: $\text{cost}(\tilde{C}, X) \le \frac{1 + \epsilon}{1 - \epsilon} \min_C \text{cost}(C, X)$. The next few questions focus on the connection between k-means clustering and low-rank matrix approximation/PCA. 3. (2 points) Let $X_C$ be the $n \times d$ matrix whose $i^{th}$ row is equal to $\mu_j$ if $x_i$ is assigned to cluster $C_j$ in $C$. Verify that the k-means cost function can be written as $\text{cost}(C, X) = ||X - X_C||_F^2$. 4. (2 points) Use (3) to prove that for any clustering $C$, $\text{cost}(C, X) \ge \min_{\text{rank}(B) \le k} ||X - B||_F^2$. Hint: What is the rank of $X_C$? 5. (2 points) Show that we can write $X_C = VV^T X$ where $V \in \mathbb{R}^{n \times k}$ has orthonormal columns. 6. (2 points) Explain in a few sentences what parts (3)-(4) mean. How is k-means clustering similar to PCA? How is it different?

View Answer
divider
BEST MATCH

Some secretory proteins and some intergral ,embrane proteins must pass through a membrane. A ribonucleoprotein

View Answer
divider
BEST MATCH

A compound has the formula: Al$_2$X$_3$. Which of the following atoms could be X? chlorine sulfur phosphorus hydrogen

View Answer
divider
BEST MATCH

considering the action of the fibularis longus which of the following attachments will serve as an insertion? head of fibula medial cuneiform

View Answer
divider
BEST MATCH

How do the vertebral columns of hagfishes and lamprey differ from those of other vertebrates?

View Answer
divider
BEST MATCH

(ii) A mixture with x(1)CCl4 = 0.4 was distilled under equilibrium conditions until 0.4 mole fraction of this liquid was vapourized. Calculate: a) the final mole fraction of CCl4 in the liquid and vapour phases, b) the final equilibrium temperature.

View Answer
divider
BEST MATCH

Find an equation of the tangent line to the graph of $y = x^{13x}$ at the point where $x = e$. The equation of the tangent line to the graph of $y = x^{13x}$ at the point where $x = e$ is

View Answer
divider
BEST MATCH

Generator Output per Unit = 3.6 MW Number of Generator Units = 12 2 out of the 12 generator units are reserve units Hours of the Day: 12:00 MN - 1:00 AM 1:00 AM - 2:00 AM 2:00 AM - 3:00 AM 3:00 AM - 4:00 AM 4:00 AM - 5:00 AM 5:00 AM - 6:00 AM 6:00 AM - 7:00 AM 7:00 AM - 8:00 AM 8:00 AM - 9:00 AM 9:00 AM - 10:00 AM 10:00 AM - 11:00 AM 11:00 AM - 12:00 NOON 12:00 NOON - 1:00 PM 1:00 PM - 2:00 PM 2:00 PM - 3:00 PM 3:00 PM - 4:00 PM 4:00 PM - 5:00 PM 5:00 PM - 6:00 PM 6:00 PM - 7:00 PM 7:00 PM - 8:00 PM 8:00 PM - 9:00 PM 9:00 PM - 10:00 PM 10:00 PM - 11:00 PM 11:00 PM - 12:00 PM Monday to Saturday (MW): 23.73 23.58 23.1 23.04 23.7 24.58 28.47 27.09 26.8 26.93 27.29 27.5 27.09 26.49 26.09 26.49 28.23 29.09 29.6 30.39 30.4 28.98 27.49 25.2 Sunday (MW): 29.67 29.37 29.17 28.92 29.21 30.02 31.8 31.58 31.72 31.91 32.07 32.49 32.28 31.66 31.44 30.8 31.61 31.82 32.56 34.8 35.09 33.82 32.87 30.42 Make a power plant scheduling for weekdays and for Sunday based on the given loads. You can only use the reserve engines if necessary.

View Answer
divider