# Questions tagged [multivariate-analysis]

Analyses where there is more than one variable analyzed together at once, and these variables are either dependent (response) ones or the only ones in the analysis. This can be contrasted with "multiple" or "multivariable" analysis, which implies more than one predictor (independent) variable.

1,996
questions

**5**

votes

**2**answers

36 views

### Why does component-wise median not make sense in higher dimensions?

I would like to compute the median of a higher-dimensional point set by computing the component-wise median for each individual dimension. The point that consists of the medians of each individual ...

**1**

vote

**1**answer

22 views

### How should I use interaction variables to compare two logistic regressions?

I am working on creating a predictive model using logistic regressions. I am hoping to compare two different populations, using the same set of variables but different data sets with different sample ...

**0**

votes

**0**answers

9 views

### How can I find the difference in features between two groups?

Assuming I have 2 data sets with 200 features each, and I can tell there is a difference between the groups in behavior. I want to identify the difference between these two groups (what features are ...

**0**

votes

**0**answers

20 views

### Multivariate One-Tailed Tests

I know when performing hypothesis tests like the z-test or the t-test for one variable, the one-tailed test (when fitting, usually in the case of verifying claims of superiority) provide stricter ...

**0**

votes

**0**answers

30 views

### How to learn dependency of variables from data?

I have a data set $X$ that consist of $m$ vectors $\vec{x}$ of $n$ real valued components. Each vector component lies within a corresponding predefined interval of valid values, which is the same for ...

**0**

votes

**0**answers

7 views

### What is the best way to convert a graded scale (A to G) to a numeric scale to be used in a composite index?

I'm creating a composite index and one of my indicators ranks countries in terms of grades (A, B, C, D, E, F, G). The grades come from a purely qualitative (but thorough) analysis which does not ...

**0**

votes

**0**answers

5 views

### How to identify the dissimilar points between multiple time series having almost similar patterns?

I have multiple time series that are quite similar to each other in terms of pattern. I Clustered all them to get similar time series under a cluster.
This is what the cluster looks like:
Now I'd ...

**0**

votes

**0**answers

9 views

### Liklihood ratio test and linear mixed effects regression

I have a data set which includes sex, age, and 5 polygenic scores as independent variables, with 16 dependent variables. I have constructed univariate linear mixed effects regression models and ...

**1**

vote

**1**answer

30 views

### oddstream error for get_pc_space() [closed]

I am trying out the 'oddstream' package for anomaly detection in a collection of time series data streams, following the example here - http://github.com/pridiltal/oddstream
I am now trying the same ...

**0**

votes

**1**answer

17 views

### What is the purpose of using cbind in r for multivariate regression?

I am trying to see if A1_IS (internalised stigma) predicts A1_CSI (couple satisfaction at time1) and A2_CSI(couple satisfaction at time 2). So i think this is multivariate regression right?
My R code:...

**0**

votes

**0**answers

8 views

### Can a predictor variable from one regression be the outcome variable in a multivariate-like regression?

I was hoping to get advice on what I believe might qualify as a multivariate regression analysis. The first equation is a rather simple linear multiple regression. However, one of the independent ...

**0**

votes

**0**answers

15 views

### Multivariate versus multivariable regression [closed]

I am doing research to predict if alcoholism is an independent risk factor for bleeding after surgery. I will perform a univariate analysis with a chi squared test to know if other factors, such as ...

**0**

votes

**1**answer

15 views

### Cointegration of order 2

Can we use the Johansen Test of Cointegration when the we have 7 variables that 6 of them are i(2) and one of them is i(0)?

**0**

votes

**0**answers

10 views

### Pairwise Kendall's tau & upper tail dependance high dimension copula in R

In R's copula package, one can use the functions tau() and lambda() to get Kendall's tau and the upper and lower tail dependance ...

**1**

vote

**0**answers

22 views

### Which Standard Deviation formula & Standard Error formula to use to conclude an A/B test

Story:
Our site has a button whose existing color is Green. While we think the color of the button should be Blue. Around 100k people visit our site daily, and the CTR of the green button is 3%. So, ...

**2**

votes

**1**answer

62 views

**0**

votes

**1**answer

39 views

### Conditional multivariate distribution

Suppose, Y = X + error where error follows standard normal(multivariate). Can we find the conditional distribution P(Y|X) if we don't know the distribution of X. If yes, how to start the procedure? Is ...

**1**

vote

**0**answers

24 views

### Expressibility of VAR(1) models

Am I correct in understanding that vector autoregressive (VAR) models of order one can capture seemingly more general modeling frameworks such as VAR(p) models, for orders $p > 1$, and ARMA models?
...

**1**

vote

**1**answer

63 views

### Is there any difference between these two terms, Population and Probability Distribution?

I am learning trying to learn more about statistics and probability theory, but I am having trouble understanding some of the terms that I feel have same or similar semantics just different name. For ...

**0**

votes

**0**answers

16 views

### Build and index with weighted scores with Principal Component Analysis?

For my undergrad thesis I wanted to build a multidimensional index of financial inclusion that would include variables of access and use of financial services in the states of Mexico. I have a set of ...

**1**

vote

**0**answers

13 views

### Convergence rate of test-statistic to chi-square distribution

I know that the to test whether $\Sigma=\Sigma_0$ against $\Sigma\ne\Sigma_0$ for an $n\times p$ data matrix, the test statistic is $np(a-1-\log g)$ where $a$ and $g$ are the AM and GM of the eigen ...

**0**

votes

**0**answers

19 views

### what is the necessary relationship between a confounding variable and related predictor?

My understanding of a confounder variable is that,
say response variable is y, predictor is x and confounder variable is z.
Running a simple linear regression y~x might show a suprious relationship.
...

**1**

vote

**0**answers

28 views

### What is Pillai's trace actually telling me?

I've read the site, the MANOVA section in a stats for n00bs book and Googled but I can't find this answer.
I believe I'm conducting a one-way MANOVA. I have one IV which consists of two groups. I ...

**0**

votes

**1**answer

24 views

### Best way to find correlation between categorical response variables and continuous explanatory variables

Sorry if this is a really simple question, but I'm very new to multivariable statistics and I'm trying to find a best method to deal with my ecological dataset.
I recorded the environmental ...

**0**

votes

**0**answers

13 views

### Suitable similarity measure for zero-rich preference data

I am performing some statistical analysis on preference data (-10 for strong dislike, 0 for neutral, 10 for strong like, let's assume all floats in [-10,10] are allowed) for 30 different food items ...

**0**

votes

**0**answers

23 views

### Is the coefficient of determination defined for higher dimensional dependent variables?

The coefficient of determination has a clear definition for scalar dependent variables. See for instance the definition here: http://en.wikipedia.org/wiki/Coefficient_of_determination
Can this ...

**1**

vote

**1**answer

16 views

### How is conditioning on a variable different than specifying nested random effects

How is conditioning on a variable similar to or different from using a mixed effect model?
I am a biologist. I'm analyzing my data. I have body sites nested within individuals. We're interested in ...

**0**

votes

**0**answers

16 views

### Which multivariate statistical method to use for qualitative AND quantitative data?

I'm just starting to learn about different types of multivariate statistics, but I'm having trouble figuring out which method is appropriate for my data.
My data is a mixture of quantitative and ...

**0**

votes

**0**answers

12 views

### What statistical test can be used to test if multiple samples of data originate from the same underlying distributions?

I have a data-set with several independent variables,
and my depended variable can get only positive values (it looks somewhat like a gamma-distribution).
For example - let's say the data is similar ...

**0**

votes

**1**answer

80 views

### time series for each customer to predict time to leave?

I am a beginner in the domain of forecasting and I was wondering if such a problem could be solved with time series analysis :
given customer historical data of taxi pickups,along with the weather
...

**0**

votes

**0**answers

17 views

### Multiple Linear Regression from scratch?

I am using multiple linear regression for my python project to predict prices of used cars. I want to do this from scratch and not rely on any libraries to do this for me. I cannot find any material ...

**0**

votes

**0**answers

21 views

### Dealing with non positive definite matrix covariance (possible numeric issue)

I'm generating random number of a multivariate skew normal distribution. Here is my code:
...

**0**

votes

**0**answers

20 views

### How can i correct or interpret a negative but an insignificant error correction term?

The variables that are used for cointegration are I(1). I got the error correction term as negative but it not significant. So how should i proceed for the results?

**2**

votes

**0**answers

21 views

### Does classic MultiDimensional Scaling reconstructs data upto a rotation when there's no noise and embedding dimensions equals original data dimension?

In the setup of classical MultiDimensional Scaling (MDS), assume that $D:=[d_{ij}]$ be an $n \times n$ distance matrix, i.e. $d(i,i)=0, d(i,j)=d(j,i) > 0 \forall i, j = 1 \dots n.$ Assume that: ...

**1**

vote

**0**answers

19 views

### How to organize data and batches for many sets of time series data for a recurrent neural network? [closed]

For example, I have data from 2010-01-01 to 2020-01-01, but many sets of similar data that I want to train from with the same features. I want clarified, if every dataset have the same length (say 10 ...

**3**

votes

**1**answer

43 views

### What information is needed to estimate the Joint PDF of two rv?

I've been reading multiple posts on the site about estimating Joint PDF and I understood why you can't estimate it from marginals. I also understand the use of Copulas. But curiously a lot of users ...

**0**

votes

**0**answers

60 views

### I need help choosing the appropriate model for structured time series

My hypothesis was that individuals born within 20 different populations could be differentiated based on the concentration of 7 substances that reflect the environment of those populations.The ...

**1**

vote

**0**answers

34 views

### Combining estimates from multivariate multiple regression using MICE in R?

A similar question was posted last year, but it didn't get any love, so!:
I'm trying to calculate a pooled estimate after fitting a multivariate regression in multiply imputed data (having used MICE ...

**0**

votes

**0**answers

13 views

### Best model for investigating which independant variables might affect the dependant variable the most

I have a dataset with almost 70000 rows and 8 colums (variables). The variables are district codes, drg-points (Diagnosis-related group), treatment days, cost for each visit and a few other dummy ...

**1**

vote

**1**answer

20 views

### How to measure smoothness of inputs over outputs?

I know similar questions have been asked for time series data. But my question is a little bit different.
Consider that we have input dataset $X \in R^{N \times M}$, where $M$ is the dimension of ...

**0**

votes

**0**answers

21 views

### Forecasting a multivariate time series with few observations

I am trying to forecast the number of confirmed cases for several days (1, 3, a week) of a virus with the following data:
...

**1**

vote

**1**answer

27 views

### Understanding Multidimensional Mutual Information

Given random variables $\vec{x}, \vec{y} \in \mathbb{R}^n$, and the mutual information, defined as
$I(\vec{x} : \vec{y}) = H(\vec{x}) + H(\vec{y}) - H(\vec{x}, \vec{y})$
is it true that
$I(\vec{x}: ...

**0**

votes

**0**answers

33 views

### Stepwise Regression

I have some data that I want to fit a log-linear model to (using R). The data in this dataset is categorical and y is the frequency. I use the glm function with family=poisson. I firstly fit the ...

**1**

vote

**0**answers

21 views

### How to deal with IFs or thresholds in a regression problem

I'm trying to do a regression on a continuous variable, but the conditions are as such:
$Y = B_0 + B_1X_1 + B_2X_2$
with something like
$Y = \operatorname{IF}((1.2 * X_1) > (2 * X_2 + 0.5 * X_3 ...

**0**

votes

**1**answer

36 views

### Distribution of ratio of $\bar X' \bar X$ and $\operatorname{tr}(S)$

$X_{n\times p}\sim N_p(\mu,\Sigma)$ and $S$ is the sample variance-covariance matrix.
I was able to find out that $$(\bar X-\mu)^T (\bar X-\mu)/\operatorname{tr}(S)\sim{1\over n-1}F_{p,np-p}$$
since ...

**1**

vote

**1**answer

51 views

### log-linear modelling: transforming y variable

I am conducting a study on graphical log-linear modelling and my aim is to fit a log-linear model to data.
I am using R studio to carry out the analysis and I am using the glm function.
When first ...

**1**

vote

**0**answers

20 views

### Looking for advice on conceptualizing and modeling this type of data

In a hypothetical experiment, lets assume that there are 30 lakes and I suspect that there is a difference in their chemistry. To test this, I randomly sample 10 rocks from each lake and measure the ...

**0**

votes

**0**answers

16 views

### Jacobian transformation

Suppose $J_1 $ be the Jacobian for transforming $(x_1,x_2,...,x_n) \to (y_1,y_2,...,y_n) $ and let $J_2 $ be the Jacobian for transforming $(y_1,y_2,...,y_n) \to (x_1,x_2,..,x_n) $ then prove that $...

**1**

vote

**1**answer

36 views

### Inverse moment of Multivariate Normal Norm

Let $x \sim N(\mu, \Sigma)$ with $\mu \in \mathbb{R}^P$; $\Sigma \in \mathbb{R}^{P\times P}$, positive definite. Denote by $||x||_2$ the vector two norm, that is, $||x||_2 = \sqrt{x^\top x}$.
What ...

**0**

votes

**0**answers

20 views

### Conditional probability involving black-box functions

Let X be a continuous random variable subject to a given probability density function. Let $f: \mathbb{R} \rightarrow \mathbb{R}$ and $g: \mathbb{R} \rightarrow \mathbb{R}$ be two black-box functions, ...