Create empty dataframe in R with same columns

Question

 names(U1)

[1] "username"     "review_count" "forum_posts"  "age"          "avg_interval"
[6] "avg_sim"      "class"

So how do I create an empty data frame U1.RN that will have same columns as U1?

Can I ask why you need a 0-row data frame? Depending on what you are going to do with it, it might be more efficient to do things a different way (e.g. I hope you aren't planning on filling this row by row in a loop?) — Gavin Simpson
– Gavin Simpson, Commented Nov 24, 2010 at 16:40
"e.g. I hope you aren't planning on filling this row by row in a loop?" - yeah, :(. What is the R-y way to do the equiv of [pseudocode] for(i in 1:6000) if (pred.U1.nb.c[i]=='unlabeled') U1.RN[j++,]<-U1[i,] [/pseudocode], where pred.U1.nb.c is a vector I got from a predict(), and want to create a data frame by selecting those rows of U1 that predict spewed out? (... trying hard to be verbose and not confusing simultaneously) — Tathagata
– Tathagata, Commented Nov 24, 2010 at 16:58
In R, preallocate your storage! You know you want a 6000-row data frame ahead of the loop, so create one and fill it in row by row. Or even quicker; create a matrix of the correct dimension, fill that row by row, and then convert to a data frame, as matrices are much faster to work with. If you want more help (looks like you might not even need a loop, just some simple indexing and subsetting/insertion), can you start a new Q and provide a proper, small example of what you really want to do? If you do, I'll promise to look at it and give a go at an answer. — Gavin Simpson
– Gavin Simpson, Commented Nov 24, 2010 at 17:18

Joshua Ulrich · Accepted Answer · 2010-11-24 16:25:55Z

93

You can do this:

U1.RN <- U1[0,]

answered Nov 24, 2010 at 16:25

Joshua Ulrich

177k33 gold badges357 silver badges429 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Tathagata Over a year ago

Perfect ..... > U1.RN<-U1[0,] > names(U1.RN) [1] "username" "review_count" "forum_posts" "age" "avg_interval" [6] "avg_sim" "class" > nrow(U1) [1] 6000 > nrow(U1.RN) [1] 0

Gavin Simpson Over a year ago

+1 neat!!!!!!!! (the extra ! were to get round the min character limit, oh, wait, ... ;-)

dpel · Accepted Answer · 2018-05-24 15:41:39Z

12

Along the lines of df[0,] you can also use a boolean mask which might make the code more readable:

 df[FALSE,]

edited May 24, 2018 at 15:41

dpel

2,2032 gold badges24 silver badges35 bronze badges

answered Feb 16, 2017 at 19:46

Uwe Mayer

8048 silver badges12 bronze badges

Comments

Gregor Thomas · Accepted Answer · 2017-02-16 20:13:12Z

8

Using dplyr, there are a few good options:

slice(U1, 0)
filter(U1, FALSE)
filter(U1, NA)

The slice approach is probably clearest.

edited Feb 16, 2017 at 20:13

Gregor Thomas

147k22 gold badges185 silver badges320 bronze badges

answered Jan 16, 2017 at 5:05

joemienko

2,30019 silver badges28 bronze badges

2 Comments

Gregor Thomas Over a year ago

filter(FALSE) also works and is maybe a little clear of intent. slice(0) might be even better.

joemienko Over a year ago

Agreed that slice(0) seems to be a winner on readability/clarity

clp · Accepted Answer · 2024-07-08 12:23:37Z

0

For completeness. This resets an existing data frame to zero rows.

U1.RN <- U1
attributes(U1.RN)$row.names <- c()
# <0 rows> (or 0-length row.names)

Note that rownames() <- NULL "deletes" the rownames and then fills them in with the default.

edited Jul 8, 2024 at 12:23

answered Jul 8, 2024 at 10:41

clp

1,7028 silver badges13 bronze badges

Collectives™ on Stack Overflow

Create empty dataframe in R with same columns

4 Answers 4

2 Comments

Comments

2 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

Comments

2 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related