book
Checkout our new book! Hands on AI Trading with Python, QuantConnect, and AWS Learn More arrow

Quiver Quantitative

WallStreetBets

Introduction

The WallStreetBets dataset by Quiver Quantitative tracks daily mentions of different equities on Reddit’s popular WallStreetBets forum. The data covers 6,000 Equities, starts in August 2018, and is delivered on a daily frequency. The dataset is created by scraping the daily discussion threads on r/WallStreetBets and parsing the comments for ticker mentions.

This dataset depends on the US Equity Security Master dataset because the US Equity Security Master dataset contains information on splits, dividends, and symbol changes.

For more information about the WallStreetBets dataset, including CLI commands and pricing, see the dataset listing.

About the Provider

Quiver Quantitative was founded by two college students in February 2020 with the goal of bridging the information gap between Wall Street and non-professional investors. Quiver allows retail investors to tap into the power of big data and have access to actionable, easy to interpret data that hasn’t already been dissected by Wall Street.

Getting Started

The following snippet demonstrates how to request data from the WallStreetBets dataset:

Select Language:
self.aapl = self.add_equity("AAPL", Resolution.DAILY).symbol
self.dataset_symbol = self.add_data(QuiverWallStreetBets, self.aapl).symbol

self._universe = self.add_universe(QuiverWallStreetBetsUniverse, self.universe_selection)

Data Summary

The following table describes the dataset properties:

PropertyValue
Start DateAugust 2018
Asset Coverage6,000 US Equities
Data DensitySparse
ResolutionDaily
TimezoneUTC

Example Applications

The WallStreetBets dataset enables you to create strategies using the latest activity on the WallStreetBets daily discussion thread. Examples include the following strategies:

  • Trading any security that is being mentioned
  • Trading securities that are receiving more/less mentions than they were previously
  • Trading the security that is being mentioned the most/least for the day

For more example algorithms, see Examples.

Data Point Attributes

The WallStreetBets dataset provides QuiverWallStreetBets and QuiverWallStreetBetsUniverse objects.

QuiverWallStreetBets Attributes

QuiverWallStreetBets objects have the following attributes:

QuiverWallStreetBetsUniverse Attributes

QuiverWallStreetBetsUniverse objects have the following attributes:

Requesting Data

To add WallStreetBets data to your algorithm, call the add_data method. Save a reference to the dataset Symbol so you can access the data later in your algorithm.

Select Language:
class QuiverWallStreetBetsDataAlgorithm(QCAlgorithm):
    def initialize(self) -> None:
        self.set_start_date(2019, 1, 1)
        self.set_end_date(2020, 6, 1)
        self.set_cash(100000)

        self.aapl = self.add_equity("AAPL", Resolution.DAILY).symbol
        self.dataset_symbol = self.add_data(QuiverWallStreetBets, self.aapl).symbol

Accessing Data

To get the current WallStreetBets data, index the current Slice with the dataset Symbol. Slice objects deliver unique events to your algorithm as they happen, but the Slice may not contain data for your dataset at every time step. To avoid issues, check if the Slice contains the data you want before you index it.

Select Language:
def on_data(self, slice: Slice) -> None:
    if slice.contains_key(self.dataset_symbol):
        data_points = slice[self.dataset_symbol]
        for data_point in data_points:
            self.log(f"{self.dataset_symbol} mentions at {slice.time}: {data_point.mentions}")

To iterate through all of the dataset objects in the current Slice, call the get method.

Select Language:
def on_data(self, slice: Slice) -> None:
    for dataset_symbol, data_points in slice.get(QuiverWallStreetBets).items():
        for data_point in data_points:
            self.log(f"{dataset_symbol} mentions at {slice.time}: {data_point.mentions}")

Historical Data

To get historical WallStreetBets data, call the history method with the dataset Symbol. If there is no data in the period you request, the history result is empty.

Select Language:
# DataFrame
history_df = self.history(self.dataset_symbol, 100, Resolution.DAILY)

# Dataset objects
history_bars = self.history[QuiverWallStreetBets](self.dataset_symbol, 100, Resolution.DAILY)

For more information about historical data, see History Requests.

Universe Selection

To select a dynamic universe of US Equities based on WallStreetBets data, call the add_universe method with the QuiverWallStreetBetsUniverse class and a selection function.

Select Language:
def initialize(self) -> None:
    self.universe = self.add_universe(QuiverWallStreetBetsUniverse, self.universe_selection)
        
def universe_selection(self, alt_coarse: List[QuiverWallStreetBetsUniverse]) -> List[Symbol]:
    return [d.symbol for d in alt_coarse if d.mentions > 100  and d.rank < 100]

For more information about dynamic universes, see Universes.

Universe History

You can get historical universe data in an algorithm and in the Research Environment.

Historical Universe Data in Algorithms

To get historical universe data in an algorithm, call the history method with the Universe object and the lookback period. If there is no data in the period you request, the history result is empty.

Select Language:
# DataFrame example where the columns are the QuiverWallStreetBetsUniverse attributes: 
history_df = self.history(self._universe, 30, Resolution.DAILY, flatten=True)

# Series example where the values are lists of QuiverWallStreetBetsUniverse objects: 
universe_history = self.history(self._universe, 30, Resolution.DAILY)
for (univere_symbol, time), bets in universe_history.items():
    for bet in bets:
        self.log(f"{bet.symbol} mentions at {bet.end_time}: {bet.mentions}")

Historical Universe Data in Research

To get historical universe data in research, call the universe_history method with the Universe object, a start date, and an end date. This method returns the filtered universe. If there is no data in the period you request, the history result is empty.

Select Language:
# DataFrame example where the columns are the QuiverWallStreetBetsUniverse attributes: 
history_df = qb.universe_history(universe, qb.time-timedelta(30), qb.time, flatten=True)

# Series example where the values are lists of QuiverWallStreetBetsUniverse objects: 
universe_history = qb.universe_history(universe, qb.time-timedelta(30), qb.time)
for (univere_symbol, time), bets in universe_history.items():
    for bet in bets:
        print(f"{bet.symbol} rank at {bet.end_time}: {bet.rank}")

You can call the history method in Research.

Remove Subscriptions

To remove a subscription, call the remove_security method.

Select Language:
self.remove_security(self.dataset_symbol)

If you subscribe to WallStreetBets data for assets in a dynamic universe, remove the dataset subscription when the asset leaves your universe. To view a common design pattern, see Track Security Changes.

Example Applications

The WallStreetBets dataset enables you to create strategies using the latest activity on the WallStreetBets daily discussion thread. Examples include the following strategies:

  • Trading any security that is being mentioned
  • Trading securities that are receiving more/less mentions than they were previously
  • Trading the security that is being mentioned the most/least for the day

For more example algorithms, see Examples.

You can also see our Videos. You can also get in touch with us via Discord.

Did you find this page helpful?

Contribute to the documentation: