Skip to main content

Table 1 Works that were cited at least ten times, with count, year, and citation

From: A preliminary review of influential works in data-driven discovery

Count

Year

Citation

63

2008

MapReduce (Dean and Ghemawat 2008)

51

2009

Fourth paradigm (Hey et al. 2009)

43

2009

Elements of statistical learning (Hastie et al. 2009)

30

2001

Initial sequencing of the human genome (Lander et al. 2001)

24

1948

A mathematical theory of communication (Shannon 2001)

23

2000

Sloan Digital Sky Survey (York et al. 2000)

20

1990

BLAST (Altschul et al. 1990)

19

1996

Lasso (Tibshirani 1996)

19

2003

Latent Dirichlet allocation (Blei et al. 2003)

17

1977

EM algorith (Dempster et al. 1977)

17

1995

Support vector networks (Cortes and Vapnik 1995)

15

2001

Random forests (Breiman 2001)

14

2006

Pattern recognition (Bishop et al. 2006)

14

1998

Anatomy of web search engine (Brin and Page 1998)

13

2007

Numerical recipes (Press 2007)

11

1979

Bootstrap methods (Efron 1979)

11

1953

Equation of state calculations (Metropolis et al. 1953)

11

1977

Exploratory data analysis (Tukey 1977)

11

1988

Probabilistic reasoning (Pearl 1988)

10

1999

PageRank (Page et al. 1999)

10

2013

Bayesian data analysis (Gelman et al. 2013)

10

2009

Unreasonable effectiveness of data (Halevy et al. 2009)