English 中文(简体)
R - Create columns based on the values in another data frame
原标题:
  • 时间:2023-06-03 02:49:56
  •  标签:
  • r
  • rstudio

I have a data frame called df that contains 2 columns; user and artist.

head(df, 5)
                                  user               artist
1 3bd73256-3905-4f3a-97e2-8b341527f805      betty blowtorch
2 f2fb0ff0-5679-42ec-a55c-15109ce6e320            die Ärzte
3 b3ae82c2-e60b-4551-a76d-6620f1b456aa    melissa etheridge
4 3d6bbeb7-f90e-4d10-b440-e153c0d10b53            elvenking
5 bbd2ffd7-17f4-4506-8572-c1ea58c3f9a8 juliette & the licks

I want to create another data frame that gives each entry in the artist column a row that is then represented by a 0 or 1 based on if the user ID has an entry for that artist. Here is an example of the results I am trying to get:

user                                             betty blowtorch   die Ärzte    melissa etheridge   elvenking   juliette & the licks
        1 3bd73256-3905-4f3a-97e2-8b341527f805             1              0           0                   0            0
        2 3bd73256-3905-4f3a-97e2-8b341527f805             0              1           0                   0            0
        3 f2fb0ff0-5679-42ec-a55c-15109ce6e320             0              1           0                   0            0
        4 b3ae82c2-e60b-4551-a76d-6620f1b456aa             0              0           1                   0            0
        5 3d6bbeb7-f90e-4d10-b440-e153c0d10b53             0              0           0                   1            0
        6 bbd2ffd7-17f4-4506-8572-c1ea58c3f9a8             0              0           0                   0            1
        7 bbd2ffd7-17f4-4506-8572-c1ea58c3f9a8             0              0           0                   1            0

If the user ID appears in df multiple times, then I need a new row for that user, so each row will only have one 1 value (see above example). Any suggestions?

问题回答

暂无回答




相关问题
How to plot fitted model over observed time series

This is a really really simple question to which I seem to be entirely unable to get a solution. I would like to do a scatter plot of an observed time series in R, and over this I want to plot the ...

REvolution for R

since the latest Ubuntu release (karmic koala), I noticed that the internal R package advertises on start-up the REvolution package. It seems to be a library collection for high-performance matrix ...

R - capturing elements of R output into text files

I am trying to run an analysis by invoking R through the command line as follows: R --no-save < SampleProgram.R > SampleProgram.opt For example, consider the simple R program below: mydata =...

R statistical package: wrapping GOFrame objects

I m trying to generate GOFrame objects to generate a gene ontology mapping in R for unsupported organisms (see http://www.bioconductor.org/packages/release/bioc/vignettes/GOstats/inst/doc/...

Changing the order of dodged bars in ggplot2 barplot

I have a dataframe df.all and I m plotting it in a bar plot with ggplot2 using the code below. I d like to make it so that the order of the dodged bars is flipped. That is, so that the bars labeled "...

Strange error when using sparse matrices and glmnet

I m getting a weird error when training a glmnet regression. invalid class "dgCMatrix" object: length(Dimnames[[2]]) must match Dim[2] It only happens occasionally, and perhaps only under larger ...

Generating non-duplicate combination pairs in R

Sorry for the non-descriptive title but I don t know whether there s a word for what I m trying to achieve. Let s assume that I have a list of names of different classes like c( 1 , 2 , 3 , 4 ) ...

Per panel smoothing in ggplot2

I m plotting a group of curves, using facet in ggplot2. I d like to have a smoother applied to plots where there are enough points to smooth, but not on plots with very few points. In particular I d ...

热门标签