English 中文(简体)
如果没有现有栏目,则区域数据框架从现有第一栏或NA中设立新的栏目。
原标题:R data frame create new column from first existing column or NA if no existing columns

如无现有栏目,则以下职能在数据框架中的第一个现有栏目即

library(dplyr, warn.conflicts = FALSE)
#> Warning: package  dplyr  was built under R version 4.2.3

# `dplyr::coalesce` return fist non-missing, but these columns must exist

return_exist_var <- function(.var1, .var2, .fill = NA, .data) {
  # get the . from magrittr::%>%
  .data <- get(".", parent.frame())
  stopifnot(is.data.frame(.data))
  .var1 <- rlang::enexpr(.var1)
  .var2 <- rlang::enexpr(.var2)
  if (rlang::as_string(.var1) %in% names(.data)) {
    return(rlang::parse_expr(rlang::as_string(.var1)))
  } else if (rlang::as_string(.var2) %in% names(.data)) {
    return(rlang::parse_expr(rlang::as_string(.var2)))
  } else {
    return(.fill)
  }
}

df <- data.frame(
  ID = c("ID01", "ID01", "ID02", "ID02"),
  v1 = c("a", "b", "c", NA),
  v2 = c("A", "B", "C", "C")
)

return first existing column

df %>% dplyr::mutate(NEW = !!return_exist_var(v1))
#>     ID   v1 v2  NEW
#> 1 ID01    a  A    a
#> 2 ID01    b  B    b
#> 3 ID02    c  C    c
#> 4 ID02 <NA>  C <NA>

return first existing column in two column

df %>% dplyr::mutate(NEW = !!return_exist_var(v2, v1))
#>     ID   v1 v2 NEW
#> 1 ID01    a  A   A
#> 2 ID01    b  B   B
#> 3 ID02    c  C   C
#> 4 ID02 <NA>  C   C

return second existing column in two column

df %>% dplyr::mutate(NEW = !!return_exist_var(v1_x, v2))
#>     ID   v1 v2 NEW
#> 1 ID01    a  A   A
#> 2 ID01    b  B   B
#> 3 ID02    c  C   C
#> 4 ID02 <NA>  C   C

return NA if two column not eixst

df %>% dplyr::mutate(NEW = !!return_exist_var(v1_X, v2_X))
#>     ID   v1 v2 NEW
#> 1 ID01    a  A  NA
#> 2 ID01    b  B  NA
#> 3 ID02    c  C  NA
#> 4 ID02 <NA>  C  NA

work well on grouped data frame

df %>%
  dplyr::group_by(ID) %>%
  dplyr::mutate(NEW = !!return_exist_var(v1))
#> # A tibble: 4 × 4
#> # Groups:   ID [2]
#>   ID    v1    v2    NEW  
#>   <chr> <chr> <chr> <chr>
#> 1 ID01  a     A     a    
#> 2 ID01  b     B     b    
#> 3 ID02  c     C     c    
#> 4 ID02  <NA>  C     <NA>

Problem

如何更新职能,然后将职能称为“无<条码>!

df %>% dplyr:mutate(NEW=回报_exist_var(v1)

问题回答

下面是你职能的简化版本,其依据是使用<代码>coalesce(>>和any_of(>处理缺失的var。 由于any_of()要求具有病媒特性,因此,在使用前必须加以区别。

library(dplyr)

return_exist_var <- function(...) {
  var_list <-  match.call(expand.dots = FALSE)$...
  do.call(coalesce, c(NA, pick(any_of(sapply(var_list, deparse)))))
}

df %>%
  dplyr::mutate(NEW = return_exist_var(v1))

    ID   v1 v2  NEW
1 ID01    a  A    a
2 ID01    b  B    b
3 ID02    c  C    c
4 ID02 <NA>  C <NA>

df %>%
  dplyr::mutate(NEW = return_exist_var(v2, v1))

    ID   v1 v2 NEW
1 ID01    a  A   A
2 ID01    b  B   B
3 ID02    c  C   C
4 ID02 <NA>  C   C

df %>%
  dplyr::mutate(NEW = return_exist_var(v1_x, v2))

    ID   v1 v2 NEW
1 ID01    a  A   A
2 ID01    b  B   B
3 ID02    c  C   C
4 ID02 <NA>  C   C

df %>% 
  dplyr::mutate(NEW = return_exist_var(v1_X, v2_X))

    ID   v1 v2 NEW
1 ID01    a  A  NA
2 ID01    b  B  NA
3 ID02    c  C  NA
4 ID02 <NA>  C  NA

# Works if data is grouped
df %>%
  mutate(NEW = return_exist_var(v1), .by = ID)

    ID   v1 v2  NEW
1 ID01    a  A    a
2 ID01    b  B    b
3 ID02    c  C    c
4 ID02 <NA>  C <NA>




相关问题
How to plot fitted model over observed time series

This is a really really simple question to which I seem to be entirely unable to get a solution. I would like to do a scatter plot of an observed time series in R, and over this I want to plot the ...

REvolution for R

since the latest Ubuntu release (karmic koala), I noticed that the internal R package advertises on start-up the REvolution package. It seems to be a library collection for high-performance matrix ...

R - capturing elements of R output into text files

I am trying to run an analysis by invoking R through the command line as follows: R --no-save < SampleProgram.R > SampleProgram.opt For example, consider the simple R program below: mydata =...

R statistical package: wrapping GOFrame objects

I m trying to generate GOFrame objects to generate a gene ontology mapping in R for unsupported organisms (see http://www.bioconductor.org/packages/release/bioc/vignettes/GOstats/inst/doc/...

Changing the order of dodged bars in ggplot2 barplot

I have a dataframe df.all and I m plotting it in a bar plot with ggplot2 using the code below. I d like to make it so that the order of the dodged bars is flipped. That is, so that the bars labeled "...

Strange error when using sparse matrices and glmnet

I m getting a weird error when training a glmnet regression. invalid class "dgCMatrix" object: length(Dimnames[[2]]) must match Dim[2] It only happens occasionally, and perhaps only under larger ...

Generating non-duplicate combination pairs in R

Sorry for the non-descriptive title but I don t know whether there s a word for what I m trying to achieve. Let s assume that I have a list of names of different classes like c( 1 , 2 , 3 , 4 ) ...

Per panel smoothing in ggplot2

I m plotting a group of curves, using facet in ggplot2. I d like to have a smoother applied to plots where there are enough points to smooth, but not on plots with very few points. In particular I d ...

热门标签