We use Google Analytics to see how people use our website. This helps us improve the website. The data we have is anonymised. Learn More


This site uses cookies

We use cookies to give you a better browsing experience, to improve our website by learning more about our visitors and the pages they visit, and to market our programmes and activities to you. Learn more about the cookies we use

Manage cookies

Necessary Cookies

Necessary cookies enable core functionality. The website cannot function properly without these cookies, which can only be disabled by changing your browser preferences. You consent to these cookies if you continue to use this website.

Analytical Cookies

Analytical cookies help us to improve our website by collecting and reporting information on its usage.

Analytical Cookies: on

Open Code Repository

Curated non-proprietary suptech code used by financial authorities


    Coming Soon

    We are working on enhancing your participation in building up the existing Open Code Repository. Please share your contact e-mail to stay informed on the tool related updates.

      Displaying 24 of 24 items
      Filters reset filters
      Locations select all
      • East Asia & Pacific
        • American Samoa
        • Australia
        • Brunei Darussalam
        • Cambodia
        • China
        • Fiji
        • French Polynesia
        • Guam
        • Hong Kong SAR, China
        • Indonesia
        • Japan
        • Kiribati
        • Korea, Dem. People's Rep.
        • Korea, Rep.
        • Lao PDR
        • Macao SAR, China
        • Malaysia
        • Marshall Islands
        • Micronesia, Fed. Sts.
        • Mongolia
        • Myanmar
        • Nauru
        • New Caledonia
        • New Zealand
        • Northern Mariana Islands
        • Palau
        • Papua New Guinea
        • Philippines
        • Samoa
        • Singapore
        • Solomon Islands
        • Taiwan, China
        • Thailand
        • Timor-Leste
        • Tonga
        • Tuvalu
        • Vanuatu
        • Vietnam
      • Europe & Central Asia
        • Albania
        • Andorra
        • Armenia
        • Austria
        • Azerbaijan
        • Belarus
        • Belgium
        • Bosnia and Herzegovina
        • Bulgaria
        • Channel Islands
        • Croatia
        • Cyprus
        • Czech Republic
        • Denmark
        • Estonia
        • Faroe Islands
        • Finland
        • France
        • Georgia
        • Germany
        • Gibraltar
        • Greece
        • Greenland
        • Hungary
        • Iceland
        • Ireland
        • Isle of Man
        • Italy
        • Kazakhstan
        • Kosovo
        • Kyrgyz Republic
        • Latvia
        • Liechtenstein
        • Lithuania
        • Luxembourg
        • Moldova
        • Monaco
        • Montenegro
        • Netherlands
        • North Macedonia
        • Norway
        • Poland
        • Portugal
        • Romania
        • Russian Federation
        • San Marino
        • Serbia
        • Slovak Republic
        • Slovenia
        • Spain
        • Sweden
        • Switzerland
        • Tajikistan
        • Turkey
        • Turkmenistan
        • Ukraine
        • United Kingdom
        • Uzbekistan
      • Latin America & Caribbean
        • Antigua and Barbuda
        • Argentina
        • Aruba
        • Bahamas, The
        • Barbados
        • Belize
        • Bolivia
        • Brazil
        • British Virgin Islands
        • Cayman Islands
        • Chile
        • Colombia
        • Costa Rica
        • Cuba
        • Curacao
        • Dominica
        • Dominican Republic
        • Ecuador
        • El Salvador
        • Grenada
        • Guatemala
        • Guyana
        • Haiti
        • Honduras
        • Jamaica
        • Mexico
        • Nicaragua
        • Panama
        • Paraguay
        • Peru
        • Puerto Rico
        • Sint Maarten (Dutch part)
        • St. Kitts and Nevis
        • St. Lucia
        • St. Martin (French part)
        • St. Vincent and the Grenadines
        • Suriname
        • Trinidad and Tobago
        • Turks and Caicos Islands
        • Uruguay
        • Venezuela, RB
        • Virgin Islands (U.S.)
      • Middle East & North Africa
        • Algeria
        • Bahrain
        • Djibouti
        • Egypt, Arab Rep.
        • Iran, Islamic Rep.
        • Iraq
        • Israel
        • Jordan
        • Kuwait
        • Lebanon
        • Libya
        • Malta
        • Morocco
        • Oman
        • Qatar
        • Saudi Arabia
        • Syrian Arab Republic
        • Tunisia
        • United Arab Emirates
        • West Bank and Gaza
        • Yemen, Rep.
      • North America
        • Bermuda
        • Canada
        • United States
      • South Asia
        • Afghanistan
        • Bangladesh
        • Bhutan
        • India
        • Maldives
        • Mauritius
        • Nepal
        • Pakistan
        • Sri Lanka
      • Sub-Saharan Africa
        • Angola
        • Benin
        • Botswana
        • Burkina Faso
        • Burundi
        • Cabo Verde
        • Cameroon
        • Central African Republic
        • Chad
        • Comoros
        • Congo, Dem. Rep.
        • Congo, Rep.
        • Cote D'Ivoire
        • Equatorial Guinea
        • Eritrea
        • Eswatini
        • Ethiopia
        • Gabon
        • Gambia, The
        • Ghana
        • Guinea
        • Guinea-Bissau
        • Kenya
        • Lesotho
        • Liberia
        • Madagascar
        • Malawi
        • Mali
        • Mauritania
        • Mozambique
        • Namibia
        • Niger
        • Nigeria
        • Rwanda
        • Sao Tome and Principe
        • Senegal
        • Seychelles
        • Sierra Leone
        • Somalia
        • South Africa
        • South Sudan
        • Sudan
        • Tanzania
        • Togo
        • Uganda
        • Zambia
        • Zimbabwe
      Use Cases select all
      • AML / CFT / PF supervision
        • Assisted/automated examination
        • KYC/EDD assessment
        • Misconduct analysis
        • Suspicious activity detection
      • Consumer protection
        • Complaints analysis
        • Sentiment analysis
      • Digital assets / cryptos supervision
        • On-chain analysis
      • Licensing
        • Automated guidance
      • Prudential supervision
        • Automated report generation
        • Data handling
        • Early warning systems
        • Interdepartmental analysis
        • Microprudential supervision
        • Threshold monitoring
      • Securities supervision
        • Improved insights
      Technologies select all
      • Analytics
        • Descriptive/Diagnostic Analytics Tools
        • Prescriptive Analytics Tools
      • Collection
        • Application Programming Interfaces
        • Web portals or other document management
      • Processing
        • Advanced Text Processing
        • Automated validation errors and warnings integrated into data submission process
      • Storage
        • Big Data Tools
        • On-premise relational databases
      Licenses select all
      • Accuraface License
      • Apache
      • CDLA Permissive
      • CDLA Sharing
      • Creative Commons License CC0
      • Creative Commons Zero v1.0 Universal
      • GNU Affero General Public License v3 or later (AGPLv3+)
      • GNU General Public License
      • Microsoft
      • MIT
      Title Description Publisher Source(s) Additional Relevant Links

      Sentiment Analysis with Twint & Textblob (POC)

      This code enables the extraction of Tweets from Twitter profiles without the need for Twitter's API for sentiment analysis using NLP. Twint leverages Twitter's search operators to scrape Tweets based on specific users, topics, hashtags, and trends.
      GitHub Andrew Schleiss


      The repository encompasses modules designed for hierarchical transformers tailored to tabular data, along with a synthetic credit card transaction dataset. Noteworthy adaptations include a Modified Adaptive Softmax for effective masking and a Modified DataCollatorForLanguageModeling specifically crafted for tabular data. These modules are integrated within the transformers library from HuggingFace.
      IBM IBM


      This project's goal is to construct a multi-agent simulator dedicated to anti-money laundering (AML) and provide access to synthetically generated data. The aim is to enable researchers to devise and deploy their innovative algorithms using a uniform dataset.
      IBM IBM

      Code for the Bank of England Staff Working Paper 848

      This code creates predictive models for anticipating financial crises using machine learning on macro-financial data spanning 17 countries from 1870 to 2016. In comparison to traditional logistic regression, machine learning models exhibit superior performance in predicting crises beyond the sample period. The code employs a unique approach based on Shapley values to uncover economic factors influencing the machine learning models.
      Bank of England Bank of England

      Code for the Bank of England Staff Working Paper 905

      This code implements a model discussed in the Bank of England Staff Working Paper 905. The model assesses the effectiveness of multiple requirements in bank regulation using rule-based methodology.
      Bank of England Bank of England

      Elliptic Plusplus

      This repository introduces a comprehensive applied data science approach to Bitcoin network fraud detection, leveraging the Elliptic++ dataset. Utilizing graph data, the repository employs four graph types for analysis: transaction-to-transaction, address-to-address interaction, address-transaction, and user entity graphs. The approach involves training diverse machine learning algorithms on these graphs, to demonstrate fraud detection for both illicit transactions and addresses.
      GIT DISL Git Disl


      This GitHub repository houses code (TensorFlow version) and datasets for the paper "BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection," accepted at ACM Web conference (WWW) 2023, including presentation slides. An update (Section 5.5) discussing multi-hop modeling is added to the arXiv paper.
      GIT DISL Youssef Elmougy and Ling Liu

      A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context

      This notebook delves into the FinTabNet dataset, building a Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context for financial data capturing
      IBM IBM

      Finance proposition Bank Notebook

      This notebook delves into the Finance Proposition Bank dataset, known as FinProp, which features proposition bank-style annotations applied to sentences from the legal domain extracted from past IBM annual financial reports. The notebook is supported by data encompassing around 1,000 sentences, each annotated with a set of "universal" semantic role labels, encompassing aspects such as parts of speech, argument labeling, and predicate labeling.
      IBM IBM

      Android KYC Scan

      Open Code repository by Accura Scan, an identity fraud prevention company, for Customer On-Boarding and eKYC process with real-time User Authentication: Offers Optical Character Recognition (OCR) functionality across English, Latin, Chinese, Korean, and Japanese languages. It utilizes Face Biometrics to compare images, verifying the user's selfie against the document image. For customer verification and authentication, User Authentication and Liveness Check are employed, safeguarding against identity theft and spoofing attacks. The technology involves both active and passive selfie techniques for the Liveness Check.
      Accura Scan Accura Scan

      Consumer Complaints Classification

      An open source layer for consumer complaints data classification
      Shubham Chouksey FIS Global Solutions


      Presidio helps to ensure sensitive data is properly managed and governed. It provides fast identification and anonymization modules for private entities in text such as credit card numbers, names, locations, social security numbers, bitcoin wallets, US phone numbers, financial data and more.
      Microsoft Microsoft

      Consumer Complaints

      The model analyses consumer complaints filed against companies for various financial products, such as credit card payment problems or debt collection tactics. The model, determines, for each financial product and year, the total number of complaints, the count of companies receiving complaints, and the highest percentage of complaints directed at a single company. This analysis aims to provide insights into the distribution and concentration of consumer complaints across different companies in the financial sector.
      Mahzad Khoshlessan, University of Michigan Mahzad Khoshlessan

      Text Mining

      A generative probabilistic model used in natural language processing (NLP) and machine learning. It is specifically designed for topic modeling, a technique used to identify topics present in a collection of text documents
      Stephen Hansen, University of Oxford Stephen Hansen

      Finra Trace

      Research project on Financial Industry Regulatory Authority (FINRA) Trade Reporting and Compliance Engine (TRACE) academic version. The model analysed interaction and trading behaviour among dealers in over-the-counter (OTC) corporate bond market. Topic modeling techniques are utilized, mostly Latent Dirichlet allocation (LDA), to analyse bonds that were traded by dealer on each day. Preliminary result shows that LDA has the flexibility to analyse trading interaction in multiple dimensions
      Raymond Chen Raymond Chen


      The API facilitates searching and retrieving complaint data, offering features such as searching complaint data, suggesting data based on input, and retrieving complaints by ID. To fulfil its functionality, the API has specific requirements that are batch-installed via pip. These include using Django as the web framework, Django-local flavor for country-specific Django helpers, Django rest framework for the Rest API framework, elastic search for a low-level client to interact with Elasticsearch, and requests for making HTTP requests to obtain data in various formats.
      Consumer Financial Protection Bureau Consumer Financial Protection Bureau


      A Python-Markdown extension for interactive regulation text
      Consumer Financial Protection Bureau Consumer Financial Protection Bureau


      Open source software for anonymizing sensitive personal data. It has been designed from the ground up to provide high scalability, ease of use and a tight integration of the many different aspects relevant to data anonymization. Its highlights include:
      ARX ARX


      Black-it is a user-friendly toolbox created to assist in adjusting the settings of agent-based models and simulations (ABMs). It utilizes advanced methods to explore the parameter possibilities effectively. The black-box calibrator uses a loss function and a sequence of chosen search algorithms to estimate the wanted parameters. It comes with a set of ready-to-use example models, loss functions and search algorithms. Custom models and functions can be implemented to use with the calibrator.
      Banca d'Italia Banca d'Italia

      Heavy Nodes in a Small Neighborhood: Algorithms and Applications

      Open code repo by Kings College researchers looking at isolating suspicious AML activity by analysing a series of interrelated transactions for 'smurfing' activities
      Society for Industrial and Applied Mathematics Society for Industrial and Applied Mathematics

      Hapi Multi Mongo

      A plugin code repo for relational database, Mongo DB, whose schema is designed for large/ complex datasets such as financial data sets. The plugin provides access to multiple MongoDB servers and various databases in the request/reply life cycle. The plugin is designed to accept complex configuration options and exposes/decorates the connections object to the server object.
      Alyne Mitratech

      Data Protection Framework

      A python library/command line application for identification, anonymization and de-anonymization of Personally Identifiable Information data. The framework aims to work on a two-fold principle for detecting PII: ( Using RegularExpressions using a pattern and Using NLP for detecting NER: Named Entity Recognitions)
      ThoughtWorks Datakind ThoughtWorks Datakind

      API-based Prudential Reporting System

      An Application Programming Interface (API) and back office reporting and visualization application to (a) allow financial institutions to submit high-quality, granular data digitally, and automatically to the financial authority with higher frequency; (b) enable supervisory staff to make data validation faster and analysis sharper by generating customized reports for supervisory and policy development purposes in different formats, and (c) by improving data quality and access, and developing new tools for data visualization and analysis, the project will help supervisors implement a risk-based supervisory approach that reduces compliance costs and promotes financial inclusion while ensuring financial stability and integrity.
      Cambridge SupTech Lab R2A


      An API program that provides data infrastructure for AML compliance. For this project, The Mexican National Banking and Securities Commission (CNBV) collaborated with R2A to revamp its data infrastructure, aiming to enhance its anti-money laundering (AML) supervisory capabilities and accommodate the expanding fintech sector. The objectives included enabling digital submission of AML compliance information by financial institutions, improving the volume, granularity, and quality of AML-related data, importing historical records into a central platform, and enhancing AML-related data validation and analysis.
      Cambridge SupTech Lab R2A