QUANTCONNECT COMMUNITY

No Results

Join Our Discord Channel

Join QuantConnect's Discord server for real-time support, where a vibrant community of traders and developers awaits to help you with any of your QuantConnect needs.

Quarterly Open-Source Trading Competition

The Open-Quant League is a quarterly competition between universities and investment clubs for the best-performing strategy. The previous quarter's code is open-sourced, and competitors must adapt to survive.

pending review This research is under review. To publish this research attract three community upvotes.

Draft Discussions

Bookmarked Discussions

Share New Research

Start New Discussion Sign up

SEARCH DISCUSSIONS

TOP 5 Research PUblications

About Quant League

The Open-Quant League is a quarterly competition between universities and investment clubs for the best performing strategy. Previous quarter's code is open-sourced, and competitors must adapt to survive.

competition rules

See the competition code of conduct and rules for participation in prizes.

Read Rules

previous competitions

Browse strategies and organization entries from previous quarter's competitions.

STRATEGY

333,200 Quants.

Become a Quant

VOTE FOR UPCOMING FEATURES

Share your input and vote on our future direction.

LEAN Roadmap

Create an account on QuantConnect for the latest delivered to your inbox.

Machine Learning for Equity Price Trend Prediction

Coming from Python and being relatively new in C#, I thought it would be helpful to have an example strategy that utilizes the Accord.NET machine learning library. Since I couldn't find any good examples, I coded one myself.

This strategy trains a Linear SVM (Support Vector Machine) with historical returns. Then at the open of the market, it attempts to predict whether the market will close UP or DOWN. If the trend is predicted to be UP, we enter a LONG position. If the trend is DOWN, we exit the market. If we are already in a LONG position, we do nothing. As mentioned, I am quite new to C#, so there might be bugs and there is certainly room for improvement. Would be nice to discuss possible improvements/ideas here.

Update Backtest

person upvoted this people upvoted this

Gene Wildhart

| |

Accepted Answer

Update Backtest

Notebook

The material on this website is provided for informational purposes only and does not constitute an offer to sell, a solicitation to buy, or a recommendation or endorsement for any security or strategy, nor does it constitute an offer to provide investment advisory services by QuantConnect. In addition, the material offers no opinion with respect to the suitability of any security or specific investment. QuantConnect makes no guarantees as to the accuracy or completeness of the views expressed in the website. The views are subject to change, and may have become unreliable for various reasons, including changes in market conditions or economic circumstances. All investments involve risk, including loss of principal. You should consult with an investment professional before making any investment decisions.

Jared Broad

STAFF ,

Thats awesome Gene, I've never used the Accord framework much. That is some pretty intense for looping :D Maybe @MichaelH could apply some LINQ magic to it to make it easier to follow. Essentially you're building the input I wonder if it uses random numbers to initialize the SVM? For me it jitters to different solutions each time. It would be nice if we could standardize those initialization parameters so its more predictable.

Michael Handschuh

50.5k ,

Hey @Gene! Thanks for sharing this awesome SVM example! Maybe we can work on producing the training data sets easier. What kind of functions would you use in Python to perform the same operations. I could (probably) write some C# functions that behave very similarly to the Python functions you're used to. C# has LINQ which is a very powerful, monadic library of functions for iteratable sequences. Very cool stuff! I went to take a stab at reducing some of those loops into (possibly) more readable LINQ statements, but without knowing the exact intent it was kind of hard (comments!). I surmised that we're producing bit arrays indicating whether we went up or down based on the close values in the rolling window, three results per day. Maybe what we need is a RollingWindow>, sounds crazy, but what if when we added a new sample it would recursively add it to its children with a specified offset, I think then it may be very easy to define minimally the input training data. Awesome stuff, thanks for sharing!

Gene Wildhart

11.5k ,

Hi @MichaelH, I guess I should have been a little more clear about what is going on in the code. I actually do use a rolling window to store daily data. Here is a run-down of what is happening: The input to the SVM consists of an inputSize x trainSize array. In this simple example, I use change in the daily return. A +1 indicates an positive change, where a -1 indicates a negative change. First I build the 2-D input array: Every day, I build an Inputsize number of past daily historical returns (3 in the default case). I do this for a rolling trainSize number of past days (30 by default), so I end up with a 2-D array that is fed into the SVM. I then build the 1-D output array: Starting with today, and rolling backward trainSize number of times. What I end up with is something like the following: Input Array: t-1, t-2, t-3 t-2, t-3, t-4 t-3, t-4, t-5 ... Output: t t-1 t-2 ... So the inputs (t-1,t-2,t-3) should be used to train the SVM to predict the return at time t. Finally, when I go to make an actual prediction, I feed in the data (t,t-1,t-2) and try to predict (t+1) tomorrow's change. In Python/Pandas, building the inputs and outputs for training the SVM is very easy, and I do something like the following:


# tslag gets the shifted price inputs
for i in range (0,inputSize):
    tslag["Lag%s" % str(i+1)] = df["Close Price"].shift(i+1)
# now tslag holds the returns (ie. pct change)
tslag = tslag.pct_change()

# tsout will hold the returns and be the target for the SVM
tsout = df["Close Price"].pct_change();

Unfortunately, my C# skills are not up to snuff, so I'm not sure how to do the above in C# w/o lots of looping.

Dmitri Pavlenkov

8.7k ,

Hi Gene, Thank you for sharing your idea. Your idea gave me some of my own. I think it's too much to ask an SVM to predict next bar direction from past bars alone. I believe a more profitable approach is to train SVM to validate setups. Suppose you have an awesome setup, like MA cross up: MA crosses up, buy, stop loss 1R, take profit 2R (or use some signal to exit). Obviously, it doesn't always work. So you train machine to validate it. This setup has indicator data points that make it fire (MA separation, MACD, a few past bars, day of month, month of year, whatever feels right), and after it completes it produces it's own data point : success or failure. These data points you train the machine on after each setup completes. It's a bit more work because first these setups need to simulate, and trade only after the machine is trained. That's what I'm going to do. Maybe it's worth something. Dmitri

Michael Handschuh

50.5k ,

Hey @Gene, Thanks for the walk through on the input/outputs of the system. That's in line with what I thought it was doing. Hopefully we'll get some python support into LEAN sooner than later for you guys, in the mean time I may try and whip together some functions to make some things easier for you guys. Hey @Dmitri, Interesting concept! I would love to see what you come up. Your idea actually inspired another idea for myself. This would require a fair amount of processing power, but in short, what if you had a genetic algorithm building the setup and an SVM to validate the setup.

Gene Wildhart

11.5k ,

So I realized the stop-loss function was not working as intended in the original code because I was using the Liquidate() function, which apparently closes the position at the End of Day instead of immediately. I've fixed the stoploss, and now the code seems to function better.

Dmitri Pavlenkov

8.7k ,

Hey @Gene, there's a problem with how you store returns. Arrays are passed by reference, so you need to move double[] returns = new double[inputSize]; under

for (int i=0;i

otherwise, all your inputs will be the same.


        
            
            
                 
                
                    Dmitri Pavlenkov
                
                

                
                    8.7k
                                        ,
                
            
            I played with SVM predictions, but didn't want to share this project because it's pretty rough. But since I'm now concentrating on more basic stuff like position management, I'm sharing this in hope it can help those who are walking this path. SVM learning has potential, but I think it requires some preparation to be used profitably.
                            
                    
                    
                        
                        
                            
                        
                    
                
                                    
            
                
                
            
        
        
            
            
                 
                
                    Robert Graves
                
                

                
                    160
                                        ,
                
            
            Thanks for sharing this Gene.  Let's say I use Accord.MachineLearning to create an SVM trained with a few years worth of data through an external program I write.  After training, I save this SVM to a file.  Is it possible to upload my trained SVM to QuantConnect and use it from within my algorithm?  So far, I've only been able to create new code files.
                                    
            
                
                
            
        
        
            
            
                 
                
                    Michael Handschuh
                
                

                
                    50.5k
                                        ,
                
            
            I'm not sure what kind of format the data would have, but you could certainly make a constant string that represents the data and save it in a .cs file.const string SvmData = @"1,2,1,0,2,-1"
And then reference it from your algorithm and use it to hydrate your SVM instance.
                                    
            
                
                
            
        
        
            
            
                 
                
                    Jared Broad
                
                

                
                    STAFF
                                        ,
                
            
            @Robert you can also import data using WebClient which is a C# class. I've posted a new QC University algorithm "QCU How Do I Import My Training Data Sets?" which has this snippet below:
using(var wc = new WebClient()) 
                {
                    //Point the web client to your own data store.
                    _data = wc.DownloadString("https://www.google.com");
                }

                            
                    
                    
                        
                        
                            
                        
                    
                
                                        
                    
                    The material on this website is provided for informational purposes only and does not constitute an offer to sell, a solicitation to buy, or a recommendation or endorsement for any security or strategy, nor does it constitute an offer to provide investment advisory services by QuantConnect. In addition, the material offers no opinion with respect to the suitability of any security or specific investment. QuantConnect makes no guarantees as to the accuracy or completeness of the views expressed in the website. The views are subject to change, and may have become unreliable for various reasons, including changes in market conditions or economic circumstances. All investments involve risk, including loss of principal. You should consult with an investment professional before making any investment decisions.
                
                        
            
                
                
            
        
        
            
            
                 
                
                    Robert Graves
                
                

                
                    160
                                        ,
                
            
            @MichaelH,
    Good point.  Yes for a trained SVM I could export the weights and store them in a class file maybe as an array of doubles.  But for other trained models it would be more difficult.  Many of Accord's models only support saving/loading to a file or stream.  I suppose it would be possible to serialize the stream to a base64 encoding and store that in the class file, but I'd prefer to avoid that.

Jared,
    Thanks, yes I can make that work.  To be clear, I don't want to load the data and retrain the model on every initialization.  I just want to use the trained model in the algorithm.  I can post my serialized model on google drive or dropbox, make the file public, load it using the webclient during initialize, and finally let Accord load it from the webclient stream.
                                    
            
                
                
            
        
        
            
            
                 
                
                    Dmitri Pavlenkov
                
                

                
                    8.7k
                                        ,
                
            
            EDIT: this won't work because accord uses binary formatter to serialize. Sorry
Hi @Robert,

you can convert string to bytes and write them to memory stream, or just create memory stream from bytes:


const string data = "A string with international characters: Norwegian: ÆØÅæøå, Chinese: ? ??";
var bytes = System.Text.Encoding.UTF8.GetBytes(data);

var stream = new System.IO.MemoryStream(bytes);

                                    
            
                
                
            
        
        
            
            
                 
                
                    James Smith
                
                

                
                    2.5k
                                        ,
                
            
            I've found that more recent versions of Accord have a serializer class which is more flexible:
http://accord-framework.net/docs/html/T_Accord_IO_Serializer.htm
I've been toying with the accord libraries, so thanks for sharing. One thing I'd like to do is incremental leaning, which I'm not sure Accord supports? It appears my only option would be to add new results to the learning set and completely rebuild the svm. Does anyone know of an alternative approach or library for incremental learning?
                                    
            
                
                
            
        
        
            
            
                 
                
                    Petter Hansson
                
                

                
                    10.5k
                                        ,
                
            
            I usually just retrain everything with a certain lookback period (so when enough has filled up, the old data outside lookback gets truncated).
In some cases you can do incremental fitting in Accord by just running a single (or otherwise few) learning iterations on your model with a new piece of data, e.g. logistic regression can do this. However, then most of the fitting is most likely on the most recent data and there's not a lot of control over this.
Can't use Accord in QC cloud atm due to class load error but that's of course no problem if you're running locally. It's a shame because Accord is the only whitelisted library with SVM.
                                    
            
                
                
            
        
        
            
            
                 
                
                    Petter Hansson
                
                

                
                    10.5k
                                        ,
                
            
            After googling a bit incremental learning of SVM is supposedly possible but difficult, and I haven't seen support in Accord for it.
                                    
            
                
                
            
        
        
            
            
                 
                
                    James Smith
                
                

                
                    2.5k
                                        ,
                
            
            Yes I've found some obscure options for incremental but think I will miss the rich features of accord. Besides, I expect there may be diminishing returns in prediction accuracy as the lookback grows. There may be a threading solution to getting adequate backtest performance with frequent model retraining.
                                    
            
                
                
            
        
        
            
            
                 
                
                    Petter Hansson
                
                

                
                    10.5k
                                        ,
                
            
            One thing I've considered is to simply have models retrain on a background thread on increasingly extensive data (e.g. increasing lookback) until a deadline or when the main thread submits a new data set. That is similar to iterative deepening concept in game AI. However, I'm typically wary of doing something in a backtest that will work differently when running live (e.g. a backtest with once a day retraining would quickly cut off training, whereas live version would probably reach maximum lookback).
And yes, the lookback is a hyperparameter that's likely to have a large impact on the model's accuracy in practice on live data, and what's worse, in most cases the best lookback varies over time...
                                    
            
                
                
            
        
        
            
            
                 
                
                    Petter Hansson
                
                

                
                    10.5k
                                        ,
                
            
            It would probably be possible to do like this however: Retrain model with fixed lookback, first time, wait for it to finish on main thread, after that, let main thread continue with the oldest finished model (just update the next training set). So in a backtest one would be using outdated models with probably worse performance than the live version which would have more recently trained models.
                                    
            
                
                
            
        
        
            
            
                 
                
                    James Smith
                
                

                
                    2.5k
                                        ,
                
            
            Have to agree that a background learning task is a minefield. In theory a less delayed lookback will lead to more accurate prediction, but it might equally be that the more recent signals are the noise of indecision that comes before a significant move. It's only standing on steady ground to have backtest behaviour that you're confident will reproduce. The problem is that regardless of the trade frequency relearning a significant set is simply too slow to backtest. Of course caching is an option, but then one tweak here or there and you need to rebuild your model cache.
                                    
            
                
                
            
        
        
        
    

    
    
    
        
             
            
            
                
            
            
            
            
            
            
                
                    Gene Wildhart
                    INVESTOR
                
                
                    
                    
                     | 
                    
                    
                                                        
            
            
                
                
                    
                    Permalink

                                    
            
        
        
        
        
            
            
                 Update Backtest 
            
                
                    
                        Project
                        
                            
                        
                    
                    
                        Backtest
                        
                            
                        
                    
                
            
            
                
                    
                    
                        
                        
                            
                        
                    
                
            
            
            

            

            
            
            
            
 

        

        
            
            
                
                
                    
                
            
        

        
            
            
                
                    
                        
                         Notebook
                    
                
                
                    
                
            
            
                
                
                    
                
            
        

        
            
        

        
            
            The material on this website is provided for informational purposes only and does not constitute an offer to sell, a solicitation to buy, or a recommendation or endorsement for any security or strategy, nor does it constitute an offer to provide investment advisory services by QuantConnect. In addition, the material offers no opinion with respect to the suitability of any security or specific investment. QuantConnect makes no guarantees as to the accuracy or completeness of the views expressed in the website. The views are subject to change, and may have become unreliable for various reasons, including changes in market conditions or economic circumstances. All investments involve risk, including loss of principal. You should consult with an investment professional before making any investment decisions.
        
        
            
                
                    
                        Reply
                    
                
                
                    
                    
                
            
            
                
                
                
                
                 person upvoted this
                
                
                 people upvoted this
                
            
            
                
                
                    
                        
                            
                        
                    
                    
                        
                            
                            
                                
                                 |
                                
                                
                            
                        
                        
                        
                            
                            
                        
                        
                            
                            
                            
                            
                             person upvoted this
                            
                            
                             people upvoted this
                            
                        
                    
                
                
                
                    
                    
                        
                        
                    
                
            
        

        
        
            
            
        
    
    
    
    
        Loading...




    
        1
2
    




    
    
    
    
        Load More

        
    
    
    
    



            
            
                To unlock posting to the community forums please complete at least 30% of Boot Camp.
 You can
                continue your Boot Camp training progress from the terminal. We
                hope to see you in the community soon!


    
    

        
        
            Organization
            
                
            
            
                
                Organization Website
            
            
            
                
                    
                      Update Competition
                
                
            
            Team
            
                
                    
                        
                    
                    
                    
                
            
            Show More
            
        

        
            Clone Strategy
            Copy this strategy code to your QuantConnect account and deploy it live with your brokerage.
            Clone
            
        

        
            Previous Ranking
            Browse strategies and organization entries from previous quarter's competitions.
            
                
                    
                    
                
            
            
        
        
        

        
            
                 
            
            
                Author: 
                
                
            
        
        

        
            IN THIS RESEARCH
            
                
            
        
        

        
        
            PARTICIPANTS
            
            
                
                
                    
                
            
            
        
        
        

        
        
            Discussion Awards
            
                
                    
                    
                
            
        
        
        

        
        
            SHARE RESEARCH
            SHARE DISCUSSION
            SHARE ARTICLE
            SHARE
            
                
                
                
            
        
        
        
            Actions
            
                
                View in Strategy Explorer
            

            
                
                Award Discussion
            

            
                
                    
                    
                
                
            

            
                
                    
                    
                
                
            

            
                
                Comments














    
        
            
                
                    
                
                
                    
                    
                    
                        
                            What is an Award?


    
        
    
        
            Research
            Announcements
            Lean
            
        
        
            
                COMMUNITY
                COMMUNITY FEED
            
                        |

 




            
            Join QuantConnect for Free
            
        
    

    
        
    
        
            
            
                QuantConnect™ 2024. All Rights Reserved
            
        
        
            
                
                    
                        Technology
                    
                        Algorithm Lab
                        Documentation
                        Research
                        Build vs. Buy
                        Tutorials
                        Data Library
                        Learning Articles
                        System Status
                        Settings
                        Discussions List
                    
                
                
                    
                        Company
                    
                        About
                        Affiliates
                        Our Blog
                        Contact
                        Pricing
                        Integration Partners
                        Terms & Conditions
                        Privacy Policy
                    
                
                
                    
                        
                            LEAN
                    
                    
                        
                            
                                
                                    
                                    Fork
                                
                                
                                    3,000                                
                            
                        
                        
                            
                                
                                    
                                    Star
                                
                                
                                    10,000                                
                            
                        
                    
                
            
        
    

    



    
    
        
            
            
                QuantConnect™ 2024. All Rights Reserved
            
        
        
            
                
                    
                        Technology
                    
                        Algorithm Lab
                        Documentation
                        Research
                        Build vs. Buy
                        Tutorials
                        Data Library
                        Learning Articles
                        System Status
                        Settings
                        Discussions List
                    
                
                
                    
                        Company
                    
                        About
                        Affiliates
                        Our Blog
                        Contact
                        Pricing
                        Integration Partners
                        Terms & Conditions
                        Privacy Policy
                    
                
                
                    
                        
                            LEAN
                    
                    
                        
                            
                                
                                    
                                    Fork
                                
                                
                                    3,000                                
                            
                        
                        
                            
                                
                                    
                                    Star
                                
                                
                                    10,000

Platform

Radically Open-Source Algorithmic Trading Engine

Join Our Discord Channel

Quarterly Open-Source Trading Competition

Draft Discussions

Bookmarked Discussions

SEARCH DISCUSSIONS

TOP 5 Research PUblications

About Quant League

competition rules

previous competitions

333,200 Quants.

VOTE FOR UPCOMING FEATURES

Machine Learning for Equity Price Trend Prediction

Organization

Team

Clone Strategy

Previous Ranking

IN THIS RESEARCH

PARTICIPANTS

Discussion Awards

Actions

Join QuantConnect for Free

Platform

SIGN IN

Radically Open-Source Algorithmic Trading Engine

Join Our Discord Channel

Quarterly Open-Source Trading Competition

Draft Discussions

Bookmarked Discussions

SEARCH DISCUSSIONS

TOP 5 Research PUblications

About Quant League

competition rules

previous competitions

333,200 Quants.

VOTE FOR UPCOMING FEATURES

Machine Learning for Equity Price Trend Prediction

Organization

Team

Clone Strategy

Previous Ranking

IN THIS RESEARCH

PARTICIPANTS

Discussion Awards

SHARE RESEARCH

SHARE DISCUSSION

SHARE ARTICLE

SHARE

Actions

Join QuantConnect for Free