English 中文(简体)
如何找到有 NA 列中最低值和最高值之间的差值
原标题:How to find the difference between the lowest and highest values of a column with NA
  • 时间:2024-05-12 00:32:35
  •  标签:
  • r

我如何计算最大值和最小值之间的差数, 按行计算, 当有充裕的 NA 时, 每一列的差数?

假设这就是我的数据

   a   b   c  d   e



A  1   NA  3  NA  NA



B  0   NA  6  NA  2



C  3   2   1  NA  3



C  NA  NA  3  NA  0

对于每一列,我想找到值最高的行与值最低的行之间的差异 -- -- 结果应该是这样的:

3 0 5 NA 3

我有数百个栏目需要分析, 所以我需要一个容易处理的代码(例如:我不需要手动输入栏名) 。

任何帮助都会感激不尽!

问题回答

使用 apply 的方法。 检查所有 < NA 值, 使用 ifelse 防止 - Inf 的出现, 来源是 < min / / < max

sapply(df, (x) ifelse(all(is.na(x)), NA, max(x, na.rm=T) - min(x, na.rm=T)))
 a  b  c  d  e 
 3  0  5 NA  3 

Data

df <- structure(list(a = c(1L, 0L, 3L, NA), b = c(NA, NA, 2L, NA), 
    c = c(3L, 6L, 1L, 3L), d = c(NA, NA, NA, NA), e = c(NA, 2L, 
    3L, 0L)), class = "data.frame", row.names = c("A", "B", "C", "D"))
a <- matrix(c(1,   NA,  3,  NA,  NA, 0,   NA,  6,  NA , 2, 3 ,  2 ,  1 , NA,  3, NA , NA,  3  ,NA,  0), nrow=4, ncol=5, byrow=T)
apply(a, 1, function(x) max(x, na.rm=T)-min(x, na.rm=T)) # difference by row
apply(a, 2, function(x) max(x, na.rm=T)-min(x, na.rm=T)) # difference by column

您可以看到 - 参见 - in case where where a difference 无法计算差异。 要改变此行为, 请使用以下代码 :

apply(a, 1, function(x) ifelse(max(x, na.rm=T)-min(x, na.rm=T) == -Inf, NA, max(x, na.rm=T)-min(x, na.rm=T))) # difference by row
apply(a, 2, function(x) ifelse(max(x, na.rm=T)-min(x, na.rm=T) == -Inf, NA, max(x, na.rm=T)-min(x, na.rm=T))) # difference by column




相关问题
How to plot fitted model over observed time series

This is a really really simple question to which I seem to be entirely unable to get a solution. I would like to do a scatter plot of an observed time series in R, and over this I want to plot the ...

REvolution for R

since the latest Ubuntu release (karmic koala), I noticed that the internal R package advertises on start-up the REvolution package. It seems to be a library collection for high-performance matrix ...

R - capturing elements of R output into text files

I am trying to run an analysis by invoking R through the command line as follows: R --no-save < SampleProgram.R > SampleProgram.opt For example, consider the simple R program below: mydata =...

R statistical package: wrapping GOFrame objects

I m trying to generate GOFrame objects to generate a gene ontology mapping in R for unsupported organisms (see http://www.bioconductor.org/packages/release/bioc/vignettes/GOstats/inst/doc/...

Changing the order of dodged bars in ggplot2 barplot

I have a dataframe df.all and I m plotting it in a bar plot with ggplot2 using the code below. I d like to make it so that the order of the dodged bars is flipped. That is, so that the bars labeled "...

Strange error when using sparse matrices and glmnet

I m getting a weird error when training a glmnet regression. invalid class "dgCMatrix" object: length(Dimnames[[2]]) must match Dim[2] It only happens occasionally, and perhaps only under larger ...

Generating non-duplicate combination pairs in R

Sorry for the non-descriptive title but I don t know whether there s a word for what I m trying to achieve. Let s assume that I have a list of names of different classes like c( 1 , 2 , 3 , 4 ) ...

Per panel smoothing in ggplot2

I m plotting a group of curves, using facet in ggplot2. I d like to have a smoother applied to plots where there are enough points to smooth, but not on plots with very few points. In particular I d ...