Most popular data sources

Machine learning datasets maintained by our community to use for fun or practice. Download the ones you like.

25146 available data sources

Startup Database - AngelList - All 20 ...

angel.co
last ran at 2018-06-21

Companies

A list of 400 companies gathered from Angelist. Please note that if you do decide to copy the reci...

400
1
7
49

Vietlott

vietlott.vn
last ran at 2017-05-03

Sports and Betting

6
3
1
10

Instagram data source

instagram.com
last ran at 2018-07-29

People

39
3
4
8

MarketWatch: Stock Market News - Finan...

marketwatch.com
last ran at 2019-07-17

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

87
1,849
3
8

VentureBeat - Tech news that matters -...

venturebeat.com
last ran at 2019-07-17

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

40
921
1
5

Zillow properties with bad transaction...

zillow.com
last ran at 2017-06-06

Real Estate

We utilize this data source to train our price prediction model so as to identify undervalued real...

825,966
40
1
4

AR Crawl

app.anyguide.com
last ran at 2019-06-08

Travel

87
21
1
4

Financial Times - News Headlines

ft.com
last ran at 2019-07-17

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

52
1,846
2
3

Craigslist - rooms for rent in San Fra...

sfbay.craigslist.org
last ran at 2019-07-17

Real Estate

Comps for determining fair value rental price

1,833
157
2
3

Models - LegalPorno

legalporno.com
last ran at 2019-02-27

Adult

LegalPorno models

273
2
2
3

Fox Business | Business News & Stock Q...

foxbusiness.com
last ran at 2019-07-17

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

28
1,837
1
3

Science News | Daily news articles, bl...

sciencenews.org
last ran at 2019-07-17

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

47
1,814
1
3

NCES - school database complete

nces.ed.gov
last ran at 2019-05-04

Education

11,729
5
1
3

IMDb Top 250 - IMDb

imdb.com
last ran at 2019-01-14

Entertainment

250
1
1
3

New York Stock Exchange [NYSE] - Daily...

eoddata.com
last ran at 2019-07-17

Finance

We plan to utilize this data source to detect sudden drop of more than 20% in stock prices within ...

3,133
430
3

Top Things to Do in United States - Tr...

tripadvisor.com
last ran at 2019-02-23

Travel

Things to Do in United States, North America: See TripAdvisor's 18,173,021 traveler reviews and ph...

17,904
2
3

New York Post - News Headlines

nypost.com
last ran at 2019-07-17

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

79
1,816
2
2

Reddit.com Breaking News - News Headli...

reddit.com
last ran at 2019-07-17

News, Entertainment

This data is used for performing sentiment analysis on publicly traded companies.

40
1,759
2
2

US Proxy List - Free Proxy List

us-proxy.org
last ran at 2019-07-09

200
15,880
1
2

CBC.ca - News Headlines

cbc.ca
last ran at 2019-07-17

News

We monitor the news headline from this website which we then use to train our sentiment analysis e...

31
1,842
1
2

BBC News - news headlines

bbc.com
last ran at 2019-07-17

We monitor the news headline from this website which we then use to train our sentiment analysis e...

39
1,854
2

Zillow - Phoenix Arizon - Tolleson

zillow.com
last ran at 2019-04-02

Real Estate

We utilize this data source to train our price prediction model so as to identify undervalued real...

18
866
2