stats Archives - Page 17 of 728

세 개의 벡터 , 및 가 주어지면 와 , , , 와 사이 상관 관계 가 모두 음수 일 수 있습니까? 즉, 이것이 가능합니까? $a$

a

$a$ $b$

b

$b$ $c$

c

$c$ $a$

a

$a$ $b$

b

$b$ $a$

a

$a$ $c$

c

$c$ $b$

b

$b$ $c$

c

$c$

\begin{aligned} corr (a, b) < 0 \\ corr (a, c) < 0 \\ corr (b, c) < 0 \end{aligned}

$\begin{align} \text{corr}(a,b) < 0\\ \text{corr}(a,c) < 0 \\ \text{corr}(b,c) < 0\\ \end{align}$

답변

벡터의 크기가 3 이상이면 가능합니다. 예를 들어

\begin{aligned} a & = (- 1, 1, 1) \\ b & = (1, - 9, - 3) \\ c & = (2, 3, - 1) \end{aligned}

$\begin{align} a &= (-1, 1, 1)\\ b &= (1, -9, -3)\\ c &= (2, 3, -1)\\ \end{align}$

상관 관계는

cor (a, b) = - 0.80... cor (a, c) = - 0.27... cor (b, c) = - 0.34...

$\begin{equation} \text{cor}(a,b) = -0.80...\\ \text{cor}(a,c) = -0.27...\\ \text{cor}(b,c) = -0.34... \end{equation}$

크기가 2 인 벡터의 경우 이것이 불가능하다는 것을 증명할 수 있습니다 :

\begin{aligned} cor (a, b) & < 0 \\ 2 (\sum_{i} a_{i} b_{i}) - (\sum_{i} a_{i}) (\sum_{i} b_{i}) & < 0 \\ 2 (a_{1} b_{1} + a_{2} b_{2}) - (a_{1} + a_{2}) (b_{1} b_{2}) & < 0 \\ 2 (a_{1} b_{1} + a_{2} b_{2}) - (a_{1} + a_{2}) (b_{1} b_{2}) & < 0 \\ 2 (a_{1} b_{1} + a_{2} b_{2}) - a_{1} b_{1} + a_{1} b_{2} + a_{2} b_{1} + a_{2} b_{2} & < 0 \\ a_{1} b_{1} + a_{2} b_{2} - a_{1} b_{2} + a_{2} b_{1} & < 0 \\ a_{1} (b_{1} - b_{2}) + a_{2} (b_{2} - b_{1}) & < 0 \\ (a_{1} - a_{2}) (b_{1} - b_{2}) & < 0 \end{aligned}

$\begin{align} \text{cor}(a,b) &< 0\\[5pt] 2\Big(\sum_i a_i b_i\Big) - \Big(\sum_i a_i\Big)\Big(\sum_i b_i\Big) &< 0\\[5pt] 2(a_1 b_1 + a_2 b_2) - (a_1 + a_2)(b_1 b_2) &< 0\\[5pt] 2(a_1 b_1 + a_2 b_2) - (a_1 + a_2)(b_1 b_2) &< 0\\[5pt] 2(a_1 b_1 + a_2 b_2) - a_1 b_1 + a_1 b_2 + a_2 b_1 + a_2 b_2 &< 0\\[5pt] a_1 b_1 + a_2 b_2 - a_1 b_2 + a_2 b_1 &< 0\\[5pt] a_1 (b_1-b_2) + a_2 (b_2-b_1) &< 0\\[5pt] (a_1-a_2)(b_1-b_2) &< 0 \end{align}$

The formula makes sense: if $a_{1}$

a_{1}

$a_1$ is larger than $a_{2}$

a_{2}

$a_2$ , $b_{1}$

b_{1}

$b_1$ has to be larger than $b_{1}$

b_{1}

$b_1$ to make the correlation negative.

Similarly for correlations between (a,c) and (b,c) we get

(a_{1} - a_{2}) (c_{1} - c_{2}) < 0 (b_{1} - b_{2}) (c_{1} - c_{2}) < 0

$\begin{equation} (a_1-a_2)(c_1-c_2) < 0\\ (b_1-b_2)(c_1-c_2) < 0\\ \end{equation}$

Clearly, all of these three formulas can not hold in the same time.

답변

Yes, they can.

Suppose you have a multivariate normal distribution $X \in R^{3}, X \sim N (0, Σ)$

X \in R^{3}, X \sim N (0, Σ)

$X\in R^3, X\sim N(0,\Sigma)$ .
The only restriction on $Σ$

Σ

$\Sigma$ is that it has to be positive semi-definite.

So take the following example $Σ = (\begin{matrix} 1 & - 0.2 & - 0.2 \\ - 0.2 & 1 & - 0.2 \\ - 0.2 & - 0.2 & 1 \end{matrix})$

Σ = (\begin{matrix} 1 & - 0.2 & - 0.2 \\ - 0.2 & 1 & - 0.2 \\ - 0.2 & - 0.2 & 1 \end{matrix})

$\Sigma = \begin{pmatrix} 1 & -0.2 & -0.2 \\ -0.2 & 1 & -0.2 \\ -0.2 & -0.2 & 1 \end{pmatrix}$

Its eigenvalues are all positive (1.2, 1.2, 0.6), and you can create vectors with negative correlation.

답변

let's start with a correlation matrix for 3 variables

$Σ = (\begin{matrix} 1 & p & q \\ p & 1 & r \\ q & r & 1 \end{matrix})$

Σ = (\begin{matrix} 1 & p & q \\ p & 1 & r \\ q & r & 1 \end{matrix})

$\Sigma = \begin{pmatrix} 1 & p & q \\ p & 1 & r \\ q & r & 1 \end{pmatrix}$

non-negative definiteness creates constraints for pairwise correlations $p, q, r$

p, q, r

$p,q,r$ which can be written as

p q r \geq \frac{p^{2} + q^{2} + r^{2} - 1}{2}

$pqr \ge \frac{p^2+q^2+r^2-1}2$

For example, if $p = q = - 1$

p = q = - 1

$p=q=-1$ , the values of $r$

r

$r$ is restricted by $2 r \geq r^{2} + 1$

2 r \geq r^{2} + 1

$2r \ge r^2+1$ , which forces $r = 1$

r = 1

$r=1$ . On the other hand if $p = q = - \frac{1}{2}$

p = q = - \frac{1}{2}

$p=q=-\frac12$ , $r$

r

$r$ can be within $\frac{2 \pm \sqrt{3}}{4}$

\frac{2 \pm \sqrt{3}}{4}

$\frac{2 \pm \sqrt{3}}4$ range.

Answering the interesting follow up question by @amoeba: "what is the lowest possible correlation that all three pairs can simultaneously have?"

Let $p = q = r = x < 0$

p = q = r = x < 0

$p=q=r=x < 0$ , Find the smallest root of $2 x^{3} - 3 x^{2} + 1$

2 x^{3} - 3 x^{2} + 1

$2x^3-3x^2+1$ , which will give you $- \frac{1}{2}$

- \frac{1}{2}

$-\frac12$ . Perhaps not surprising for some.

A stronger argument can be made if one of the correlations, say $r = - 1$

r = - 1

$r=-1$ . From the same equation $- 2 p q \geq p^{2} + q^{2}$

- 2 p q \geq p^{2} + q^{2}

$-2pq \ge p^2+q^2$ , we can deduce that $p = - q$

p = - q

$p=-q$ . Therefore if two correlations are $- 1$

- 1

$-1$ , third one should be $1$

1

$1$ .

답변

A simple R function to explore this:

f <- function(n,trials = 10000){
  count <- 0
  for(i in 1:trials){
    a <- runif(n)
    b <- runif(n)
    c <- runif(n)
    if(cor(a,b) < 0 & cor(a,c) < 0 & cor(b,c) < 0){
      count <- count + 1
    }
  }
  count/trials
}

의 함수는 0 n에서 f(n)시작하여 0이 아닌 n = 3값 (일반 값은 약 0.06)으로 증가한 다음 약 0.11로 증가한 n = 15후 안정화됩니다.

따라서 세 가지 상관 관계를 모두 음수로 만들 수있을뿐만 아니라 (적어도 균일 한 분포의 경우)별로 드문 일은 아닙니다.

How IT

언제든지 물어보세요.

카테고리 보관물: stats

3 개의 벡터가 모두 음의 쌍별 상관 관계를 가질 수 있습니까? c 사이 의 상관 관계 가 모두

답변

답변

답변

답변

답변