Highly cited AI papers

The Lindahl Letter

1×

0:00

-4:33

Highly cited AI papers

Dr. Nels Lindahl

Feb 11, 2023

This week it seemed like a good idea to take a look at some of the most highly cited AI papers. Back during week 81 of this journey into writing on Substack, I took a look at some of the most highly cited ML papers [1]. I was expecting a lot more overlap, but was pleasantly surprised at the differences. One of the papers really stood out based on the total number of citations and it’s up first. Intellectually I can accept that a paper has more than 100,000 citations, but in practice that is an awful lot of references for an academic paper to have and a representation of a degree of asynchronous interaction between researchers that helps bring the intellectual space called the academy to life.

Papers with over 100,000 citations:

Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980. https://arxiv.org/pdf/1412.6980.pdf

Papers with over 50,000 citations:

Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28. https://proceedings.neurips.cc/paper/2015/file/14bfa6bb14875e45bba028a21ed38046-Paper.pdf
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30. https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf

Papers with over 20,000 citations:

Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J., Bellemare, M. G., ... & Hassabis, D. (2015). Human-level control through deep reinforcement learning. nature, 518(7540), 529-533. https://daiwk.github.io/assets/dqn.pdf
Ioffe, S., & Szegedy, C. (2015, June). Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning (pp. 448-456). PMLR. http://proceedings.mlr.press/v37/ioffe15.pdf

Papers with over 10,00 citations:

Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., Van Den Driessche, G., ... & Hassabis, D. (2016). Mastering the game of Go with deep neural networks and tree search. nature, 529(7587), 484-489.
Arjovsky, M., Chintala, S., & Bottou, L. (2017, July). Wasserstein generative adversarial networks. In International conference on machine learning (pp. 214-223). PMLR. http://proceedings.mlr.press/v70/arjovsky17a/arjovsky17a.pdf
Kipf, T. N., & Welling, M. (2016). Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907. https://arxiv.org/pdf/1609.02907.pdf

Without question these levels of citation are an indication that the works are being read and actively referenced within the scholarly community. I’m referencing the citation numbers here to give you a sense of scale when it comes to considering the AI community and how many people are researching and considering the things happening in this space. This is a very crowded and vibrant place in the academy where a lot of time and effort are going into moving things along toward building very real and deployable technology in this space. Given the sheer volume of people working in this space it’s only a matter of time before somebody will shout “Eureka!” and we see practical deployments in production which will influence our daily lives.

What would ChatGPT create?

If you were wondering what ChatGPT from OpenAI would have generated with the same prompt, then you are in luck. I had that output generated over at https://chat.openai.com/chat by issuing a prompt.

Links and thoughts:

Top 5 Tweets of the week:

Ant Pruitt @ant_pruitt

#HOP - The latest episode of Hands-On Photography is UP. I 'preciate yall checkin' it out & SHARING it - HOP 163: Drone Photography and Video - How To Get Started With Drone Photography, Autel Nano+ twit.tv/shows/hands-on… #CreateAndDominate

ASPA National @ASPANational

Did you miss it? Francis Fukuyama will give the #ASPA2023 Donald C. Stone Lecture. Make sure you're in the audience for his insights - or, at least registered for the conference. Access to all presentations extends for two months past the end of the event! ow.ly/rbYl50MzbJo

Sara Fischer @sarafischer

Having to spin out key pieces of @Google’s ad-tech structure would be difficult and would certainly hurt revenue and growth potential. But it doesn't constitute "breaking up" Google, as some have described the government's goal.

axios.comDOJ’s new suit puts Google’s ad business at riskThe government wants the search giant to sell off a key chunk of its ad-tech business.

Yann LeCun @ylecun

To be clear: I'm not criticizing OpenAI's work nor their claims. I'm trying to correct a *perception* by the public & the media who see chatGPT as this incredibly new, innovative, & unique technological breakthrough that is far ahead of everyone else. It's just not.

Casey Newton @CaseyNewton

OpenAI beat Google to the market with its $42 monthly ChatGPT service. Can it handle the risks that come with it? platformer.news/p/the-consumer…

The thing is, all those risks Google is trying to avoid by moving slowly? OpenAI will soon have to answer for all of them. The provenance of its data set, issues related to copyright and plagiarism, the threat of misinformation and fraud — the scrutiny here is just beginning. A limited free beta will buy you a certain amount of patience and goodwill from regulators. Once you’re selling a product — and making many millions of dollars from it — that goodwill dissipates quickly.

OpenAI has been a generally good actor here, taking trust and safety concerns seriously from the start and taking steps to limit the more obvious ways its technology can be misused. But it has also benefited significantly from the fact that it’s a relative unknown in consumer technology. Being a startup that operates mostly out of the spotlight has given it freedom to maneuver that its larger rivals lack.

Offering ChatGPT for sale begins to change that equation.

Footnotes:

[1] Week 81 of The Lindahl Letter:

The Lindahl Letter

A machine learning literature review (ML syllabus edition 2/8)

Listen now (13 min) | You can find a lot of quality explanations of the differences between the various flavors of machine learning [1]. This second lecture in the introduction to ML syllabus series should open with a series of the best literature reviews I could find and pull together to share. That will be the second part of this lecture. The third part will cover the inte…

Listen now

2 years ago · 4 likes · 1 comment · Dr. Nels Lindahl

What’s next for The Lindahl Letter?

Week 108: Twitter as a company probably would not happen today
Week 109: Robots in the house
Week 110: Understanding knowledge graphs
Week 111: Natural language processing
Week 112: Autonomous vehicles

If you enjoyed this content, then please take a moment and share it with a friend. If you are new to The Lindahl Letter, then please consider subscribing. New editions arrive every Friday. Thank you and enjoy the year ahead.

Lindahl, N. (2023). The Lindahl letter: 104 Machine Learning Posts. Lulu Press, Inc. https://www.lulu.com/shop/nels-lindahl/the-lindahl-letter-104-machine-learning-posts/ebook/product-y244ep.html