Welcome to the data.table project!
Fast subset, fast grouping, fast assign, fast ordered joins and list columns in a short and flexible syntax, for faster development.
DT[X]
is a fast join for large data (DT
and X
both type data.table
).
DT[,sum(b*c),by=a]
is fast aggregation (a
, b
and c
are column names).
DT[i,b:=3.14]
is fast sub-assignment (to column b
) by reference.
DT[,p:=x/sum(x),by=group]
to add a new column by reference by group.
fread("big.csv")
reads 50MB in 3 seconds vs 30 for read.csv("big.csv")
vs 10 for read.csv("big.csv",colClasses,nrow,etc)
.
10+ times faster than tapply()
100+ times faster than ==
500+ times faster than DF[i,j]<-value
Latest stable release: 1.9.2 on CRAN , 27 Feb 2014
User reviews , data.table wiki and Stack Overflow data.table tag
THE LATEST VERSION HAS NOW MOVED TO GITHUB
Scroll down on the GitHub page for install instructions and latest news. Known bugs and feature requests are now on GitHub.
Or, if that fails, the last precompiled .zip for Windows copied to this page may suffice: v1.9.3 r1200 28-Feb-2014
Vignettes (latest revisions):
FAQs
10 minute quick start introduction
Reproducible timings
Presentations:
2014.10.30 data.table talk, Matt Dowle, SQL Relay, London
2014.10.16 data.table talk, Matt Dowle, SQL Relay, Cardiff
2014.09.15 SOLD OUT - Four hour tutorial and talk, Matt Dowle and Arun Srinivasan, EARL Conference Agenda , London
2014.07 data.table talk , Matt Dowle, R/Insurance London [Agenda ]
2014.07 data.table : fast and flexible data manipulation (talk) , Matt Dowle, useR! Los Angeles [Agenda ]
2014.06 Three hour tutorial , Matt Dowle, useR! Los Angeles
2014.05 One hour tutorial , Matt Dowle, R/Finance Chicago [Agenda ]
2013.12 Introduction and news from 1.8.11 , Matt Dowle and Arun Srinivasan, Köln R User Group
2013.05 Lightning introduction , Matt Dowle, R/Finance Chicago
2013.05 One hour advanced tutorial , Matt Dowle, R/Finance Chicago
2013.02 Data Tables: An introduction (and pitch) , Gene Leynes, ChicagoR. Online examples
2012.06 News from data.table 1.6, 1.7 and 1.8 , Matt Dowle, LondonR
2012.03 Advanced analytics with R and SAP HANA (slides 22-24), Jitender Aswani and Jens Doerpmund. Code snippets on their blog
2011.06 Data munging with R , Jim Holtman, CinDayR
2011.06 The data.table package , Fernando Figueiredo
2010.07 News from data.table 1.4 and 1.5 , Matt Dowle, LondonR
2010.01 Data munging with SQL and R , Joshua Reich
2009.07 Higher speed time series queries , Matt Dowle, LondonR
Browse datatable-help on Nabble ,
Gmane ,
HTML Archive or
RSS feed
Please subscribe to datatable-help before posting (there is an option to turn off mail delivery). More information here.