Most popular data sources

Machine learning datasets maintained by our community to use for fun or practice. Download the ones you like.

24407 available data sources

Startup Database - AngelList - All 20 ...

angel.co
last ran at 2018-06-21

Companies

A list of 400 companies gathered from Angelist. Please note that if you do decide to copy the reci...

400
1
22
99

Vietlott

vietlott.vn
last ran at 2017-05-03

Sports and Betting

6
3
1
27

Instagram data source

instagram.com
last ran at 2018-07-29

People

39
3
11
23

MarketWatch: Stock Market News - Finan...

marketwatch.com
last ran at 2020-02-27

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

97
2,748
4
11

Financial Times - News Headlines

ft.com
last ran at 2020-02-27

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

76
2,745
4
9

Zillow properties with bad transaction...

zillow.com
last ran at 2017-06-06

Real Estate

We utilize this data source to train our price prediction model so as to identify undervalued real...

825,966
40
2
8

Companies | Crunchbase

crunchbase.com
last ran at 2018-06-04

50
1
2
8

Models - LegalPorno

legalporno.com
last ran at 2019-02-27

Adult

LegalPorno models

273
2
3
7

Reddit.com Breaking News - News Headli...

reddit.com
last ran at 2020-02-27

Entertainment, News

This data is used for performing sentiment analysis on publicly traded companies.

61
2,658
2
5

AR Crawl

app.anyguide.com
last ran at 2019-12-12

Travel

110
25
2
5

Top Things to Do in United States - Tr...

tripadvisor.com
last ran at 2019-02-23

Travel

Things to Do in United States, North America: See TripAdvisor's 18,173,021 traveler reviews and ph...

17,904
2
5

Craigslist - rooms for rent in San Fra...

sfbay.craigslist.org
last ran at 2020-02-27

Real Estate

Comps for determining fair value rental price

1,875
382
2
4

BBC News - news headlines

bbc.com
last ran at 2020-02-27

We monitor the news headline from this website which we then use to train our sentiment analysis e...

44
2,753
4

New York Stock Exchange [NYSE] - Daily...

eoddata.com
last ran at 2020-02-27

Finance

We plan to utilize this data source to detect sudden drop of more than 20% in stock prices within ...

3,105
655
4

Newsmax.com - Breaking news from aroun...

newsmax.com
last ran at 2020-02-27

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

28
2,737
1
3

Fox Business | Business News & Stock Q...

foxbusiness.com
last ran at 2020-02-27

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

181
2,736
1
3

Zomato: Restaurant data

zomato.com
last ran at 2020-02-11

Travel

27,209
7
1
3