Persistent storage / Data serialization in research environment

Loading [MathJax]/jax/output/HTML-CSS/config.js

QuantConnect Community Discussions

QUANTCONNECT COMMUNITY

Join Our Discord Channel

Join QuantConnect's Discord server for real-time support, where a vibrant community of traders and developers awaits to help you with any of your QuantConnect needs.

Draft Discussions

Bookmarked Discussions

Persistent storage / Data serialization in research environment

Started by - January 2021

Persistent storage / Data serialization in research environment

User suggests adding persistent storage for preprocessed data in research environment, to avoid regenerating data and save computational resources.

Share New Research

Start New Discussion Sign up

SEARCH DISCUSSIONS

373,900 Quants.

Become a Quant

VOTE FOR UPCOMING FEATURES

Share your input and vote on our future direction.

LEAN Roadmap

JOIN OUR Community MAILING LIST

Create an account on QuantConnect for the latest community delivered to your inbox.

Persistent storage / Data serialization in research environment

Oldrich S | January 2021

Is there any option I missed or any plan to implement peristent storage for own preprocessed/generated data in research environemnt (it would be great if it can be accesed from alogs too)?

In research phase there is quite common pattern, that you will preprocess data first (e.g. generate custom higher frequency data from minute data, or some ML features..) and then work with them many times.

This preprocessing often takes long time (tens of minutes, hours). So in local environment it's good idea to serialize them (e.g. to csv, parquet, database) and work with them repeatedly.

I didn't see anything like this in Quantopian.

It's quite waste of (computational and human) resources when you generate same data many times, so I believe it woudl be of great use for many users.

Something like encapsulated S3 like storage for pandas dataframes would be ideal (and cheap enough for reasonable amount of data, relatively to price of computational resources..)

Author

Oldrich S

January 2021

Upvote

Author:

January 2021

Platform

Radically Open-Source Algorithmic Trading Engine

Join Our Discord Channel

Quarterly Open-Source Trading Competition

Draft Discussions

Bookmarked Discussions

SEARCH DISCUSSIONS

373,900 Quants.

VOTE FOR UPCOMING FEATURES

JOIN OUR Community MAILING LIST

IN THIS RESEARCH

PARTICIPANTS

Actions

Join QuantConnect for Free

Platform

SIGN IN

Radically Open-Source Algorithmic Trading Engine

Join Our Discord Channel

Quarterly Open-Source Trading Competition

Draft Discussions

Bookmarked Discussions

SEARCH DISCUSSIONS

373,900 Quants.

VOTE FOR UPCOMING FEATURES

JOIN OUR Community MAILING LIST

IN THIS RESEARCH

PARTICIPANTS

SHARE RESEARCH

SHARE DISCUSSION

SHARE ARTICLE

SHARE

Actions

Join QuantConnect for Free