MathWorks - Mobile View
  • Melden Sie sich bei Ihrem MathWorks Konto anMelden Sie sich bei Ihrem MathWorks Konto an
  • Access your MathWorks Account
    • Eigener Account
    • Mein Community Profil
    • Lizenz zuordnen
    • Abmelden
  • Produkte
  • Lösungen
  • Forschung und Lehre
  • Support
  • Community
  • Veranstaltungen
  • MATLAB erhalten
MathWorks
  • Produkte
  • Lösungen
  • Forschung und Lehre
  • Support
  • Community
  • Veranstaltungen
  • MATLAB erhalten
  • Melden Sie sich bei Ihrem MathWorks Konto anMelden Sie sich bei Ihrem MathWorks Konto an
  • Access your MathWorks Account
    • Eigener Account
    • Mein Community Profil
    • Lizenz zuordnen
    • Abmelden

Videos und Webinare

  • MathWorks
  • Videos
  • Videos Homepage
  • Suche
  • Videos Homepage
  • Suche
  • Vertrieb kontaktieren
  • Testsoftware
  Register to watch video
  • Description
  • Full Transcript
  • Related Resources

Reinforcement Learning in Finance

Alex Roumi, MathWorks

This video shows an automated trader who is able to decide when to hedge a European call option contract to have a trade-off between transaction costs and hedging risk using reinforcement learning.

In this video I will be talking about building an automated trader that is able to decide when to hedge a European call option contract in the presence of transaction costs using reinforcement learning.

Think of hedging as home insurance, but in finance we use hedging to reduce risk from stock price movement.

The amount of stocks traded at each time step is calculated using delta from the Black-Scholes formula. Therefore, if the call option is for 100 shares of MLB stock, and delta is 0.1, the trader needs to short 10 shares of MLB.

In a real-world scenario, where transaction costs exist, it becomes critical, while observing the market, to know when to hedge during the life of the option to have this tradeoff between trading costs and hedging risk.

Let’s see how we apply reinforcement learning in hedging.

The agent is observing the financial market inputs like the stock price, time to maturity, the option price, and the stock holdings and takes an action of whether to hedge or not. If the agent hedges, the new stock holdings are -51 calculated using delta. Therefore, the agent has sold 6 stocks resulting in $3 in transaction costs. If the agent doesn’t hedge, the holdings remain the same.

After one period, the stock price moves to $99.40. The agent observes a total profit and loss resulting from the change of the stock price, the change of the option price, transaction costs paid, and a reward related to the total profit and loss.

In reinforcement learning, the agent will learn by trial and error to maximize the cumulative reward by choosing when to hedge during the life of the option.

The RL setup is composed of an agent and an environment. The environment sends a state to the agent, which takes an action in response. The agent will evaluate its last action based on the reward it returned. The loop keeps going until the environment sends a terminal state, say the maturity of the option, which ends the episode. After each episode the agent will learn to take actions that maximize the cumulative reward.

MATLAB made it easier to set up the environment and design the reinforcement learning components.

The reset function returns the initial state of the environment. It is called at the beginning of each training episode.

The step function specifies how the environment advances to the next state based on which action the agent takes.

The observations include the stock price, time to maturity, and the agent’s stock holdings.

The action is whether to hedge or not.

The observations, actions, and reset and step functions form the environment.

The agent consists of a policy and a reinforcement learning algorithm. The policy is a mapping function between observations and action. It can be a neural network designed by specifying the layers, activation functions, and neurons.

The reinforcement learning algorithm continuously updates the policy parameters and will find an optimal policy that maximizes the cumulative reward.

We train the agent for an hour and we can see that the agent has learned to maximize the cumulative reward over time.

As a result, a trained agent outperformed a trader who used delta hedging and another who decided not to hedge at all.

We simulate a random stock path, and when all periods are hedged, the loss is $120. The agent hedged 38 periods and didn’t trade for 12 times and the loss is $55.

Thank you for watching.

Related Products

  • Reinforcement Learning Toolbox
  • Deep Learning Toolbox
  • Financial Toolbox
  • Risk Management Toolbox
Related Information
Get Started with Reinforcement Learning Onramp

Feedback

Featured Product

Reinforcement Learning Toolbox

  • Request Trial
  • Get Pricing

Up Next:

55:12
MATLAB for R Users in Computational Finance

Related Videos:

51:53
Parallel Computing with MATLAB in Computational Finance
34:46
Using MATLAB for Real-Measure Calibration of Stochastic...
35:00
Hands-on Learning with MATLAB and Analog Discovery
45:19
Enabling Project-Based Learning with MATLAB, Simulink, and...

View more related videos

MathWorks - Domain Selector

Select a Web Site

Choose a web site to get translated content where available and see local events and offers. Based on your location, we recommend that you select: .

Select web site

You can also select a web site from the following list:

How to Get Best Site Performance

Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.

Americas

  • América Latina (Español)
  • Canada (English)
  • United States (English)

Europe

  • Belgium (English)
  • Denmark (English)
  • Deutschland (Deutsch)
  • España (Español)
  • Finland (English)
  • France (Français)
  • Ireland (English)
  • Italia (Italiano)
  • Luxembourg (English)
  • Netherlands (English)
  • Norway (English)
  • Österreich (Deutsch)
  • Portugal (English)
  • Sweden (English)
  • Switzerland
    • Deutsch
    • English
    • Français
  • United Kingdom (English)

Asia Pacific

  • Australia (English)
  • India (English)
  • New Zealand (English)
  • 中国
    • 简体中文Chinese
    • English
  • 日本Japanese (日本語)
  • 한국Korean (한국어)

Contact your local office

  • Vertrieb kontaktieren
  • Testsoftware

Produkte

  • MATLAB
  • Simulink
  • Software für Studierende
  • Hardware-Unterstützung
  • File Exchange

Testen oder Kaufen

  • Downloads
  • Testsoftware
  • Vertrieb kontaktieren
  • Preise und Lizenzierung
  • Store

Lernen

  • Dokumentation
  • Tutorials
  • Beispiele
  • Videos und Webinare
  • Schulungen

Support

  • Hilfe zur Installation
  • Antworten
  • Consulting
  • License Center
  • Support kontaktieren

Über MathWorks

  • Jobs & Karriere
  • Newsroom
  • Social Mission
  • Vertrieb kontaktieren
  • Über MathWorks

MathWorks

Accelerating the pace of engineering and science

MathWorks ist der führende Entwickler von Software für mathematische Berechnungen für Ingenieure und Wissenschaftler.

Entdecken Sie…

  • Select a Web Site United States
  • Patente
  • Handelsmarken
  • Datenschutz
  • Datendiebstahl verhindern
  • Status von Anwendungen

© 1994-2021 The MathWorks, Inc.

  • Facebook
  • Twitter
  • Instagram
  • YouTube
  • LinkedIn
  • RSS

Folgen Sie uns