A Detailed Case Study on Crypto Multi-factor Risk Analysis

Introduction
In recent years, cryptocurrencies have garnered significant attention from researchers and inventors, among other financial investments. Many investment firms have been investing in and maintaining a strong portfolio of cryptos. More than 1,500 crypto currencies are being actively traded by individual and institutional investors worldwide across different exchanges. Over 170 cryptocurrencies focussed hedge funds, have emerged since 2017.
This report as a part of Desights.ai Data Challenge (OCEAN and Numer.ai) explores cryptocurrency investment strategies by adapting the robust framework of multi-factor investing, traditionally applied in equity markets, to the distinctive landscape of cryptocurrency assets.

  1. We conduct an in-depth examination of prominent cryptocurrencies from 2018 to 2024 (in some cases – 2020/21), employing models such as (i) Fama–MacBeth regression method, (ii) Fama-French 3 and 4-factor model, (iii) Carhart 4-factor model, (iv) GARCH, (v) CAPM (single factor) and (vi) Machine Learning-based regressions (LSTM, PCA based Statistical Risk Analysis) to assess the predictive capabilities of market, size, value, and momentum factors, adjusted for the unique characteristics of the cryptocurrency market.
  2. We extract and identify other factors such as Macroeconomic factors (Comparison with NASDAQ100, S&P500, CPI) and Social Media factors (Google Trends, Wikipedia page visits) along with other indirect factors to identify the correlation between the cryptocurrency market and their volatility, returns, risks and sentiment.

At the core of this study is the idea that there may be predictability in returns arising from systematic inconsistencies. This research introduces factors specific to the cryptocurrency realm, investigating their connection to market irregularities. With Bitcoin often serving as the benchmark currency on many trading platforms, this study suggests a re-examination of conventional methods for evaluating factor portfolios, proposing a shift in investors’ perspectives to accommodate this unique market characteristic.

Need for this analysis
There is a need to analyse the cryptocurrency market from the empirical rule-based approach for at least two reasons.

  • The first reason is to understand whether the returns of cryptocurrencies share similarities with other asset classes, most importantly, with equities.
  • The second reason is that to assess and develop theoretical models of cryptocurrency, it is meaningful to build an empirical model to be used as stylized facts and inputs. Since there is no simple universal framework to construct a crypto portfolio unlike the equity market, we, therefore, propose to create a factor model for cryptocurrencies.
  • The factor model has been traditionally used in the equity markets to decompose the assets return and risk. (e.g., CAPM, Fama-French, Carhart-4-factor, BARRA), so it could also provide a paradigm to analyze such patterns in the cryptocurrency market.

Report Link: Click here

Datasets Link:

  1. CoinGecko.com: Collected data from Coin Gecko (https://www.coingecko.com/en). Coin Gecko has information on more than 6900 coins from over 400 exchanges and has daily data on prices, volume, and market capitalization (in dollar terms). Out of which, collected the Top-1000 cryptocurrencies based on Market Cap for this analysis.

Data Link: CoinGecko Dataset

  1. CryptoCompare: This database is used to download aggregated and exchange level OHLC pricing and volume cryptocurrency data each day

Data Link: CryptoCompare Dataset

  1. IntoTheBlock.com database, which is used to source information on blockchain activity, such as the number of new addresses and the number of active unique addresses.
  2. CCXT: Here we use the ccxt library and list current exchanges supported by ccxt and identify arbitrage opportunities with the shortest and longest chains. Downloaded Binance Dataset using exchange = ccxt.binanceus()
4 Likes

great work @datahunter
I read your report and your point of view in solving this challenge is amazing !!!

2 Likes

hope to see you in top places on the leaderboard :100: :crossed_fingers:

1 Like

Visualisations and EDA:
Picture 1
Picture 2
Picture 3
Picture 4
Picture 5

2 Likes