The Digital Sandbox provides participants with a number of tools and features to support innovation

Collaborate

Work within the sandbox environment to rapidly develop new ideas and Proofs of Concept, discover and integrate APIs, and tap in to the ecosystem

Test

Using synthetic data to demonstrate a Proof of Concept, train a model, or validate a solution

Showcase

Demonstrate your solution to the wider community with a customisable showcase space
Four corners frame

Data

Access to data is a key ingredient of innovation.

The Digital Sandbox gives successful participants access to a range of data assets for testing, developing and validating solutions relating to the use cases. The following represents a high level breakdown of the data which will be available to successful applicants. More information will be provided in due course.

Overview: The data for the digital sandbox falls into a few broad categories
- Firm disclosures: public disclosure information (company reports, web statements etc), TCFD aligned disclosures
- Geospatial data: satellite imagery, public data sets
- Fund data: fund characteristics e.g. a funds overall carbon intensity score

Scope: Currently, data covers characteristics at a global level. As we match data sets, scope is likely to be narrowed down to focus on UK companies where possible.  In general, the data points covering the 'social' factors in ESG reporting have a broader scope than the Environmental and Governance data.

Number of companies: 1000 - > 10,000 -Large, Medium and Small firms. Data in general biased towards larger firms as ESG disclosure data more readily available for large firms. Multiple providers with data sets of varying size.

Data characteristics: Data used for the ESG stage of the digital sandbox is mainly  based on open disclosures  and reviewed/aggregated by subject matter experts. Data covers >500 data points across environmental, governance and social factors.

Collection mechanism: Various collection mechanisms across the data: interviews, questionnaires, web scraping and satellite imagery

Sectors: Range of sectors including financial services, healthcare, government activity, real estate, energy and more.

Frequency: Data generally collected at an annual frequency with the data covering 2016 - present day. Satellite imaging data has a more regular cadence of collection.

Reference
Synthetic Entities

Synthetically created entities that are used as reference data to link all of the datasets together within the ecosystem. Statistically representative of UK Companies House.

600K Estimated records to be created
Synthetic Individuals

Synthetically created individuals that are representativeof 100% of the UK population. Based on ONS data.

7 million Estimated records to be created
Banking
Consumer table, Companies table and Transactional

Retail and Wholesale banking representing transactions for consumers and customers across the UK. There will be 5 fictional UK banks that are representative of the type of transactions that you expect to see in a UK bank. We have also looked to represent some of the behaviours that we would see as a result of the Covid crisis.

5.4 million Estimated records to be created (Consumer)
600K Estimated records to be created (Companies)
5.4 million Estimated records to be created (Transactional)
Device data

Device data that represents the devices used for fasterpayments across the UK and can detect fraudulent behaviour.

5 million faster payments over 100k customers Estimated records to be created
Black listed accounts

List of bad actors or entities that have been highlighted as such through various risk indicators.

10K Estimated records to be created
Credit reference
Credit reference data that represents an individual or entities past track record with credit.

TBC
SME lending
Issued loans to SMEs

300K Estimated records to be created
Loan history

Profit/loss statements for entities who have made credit applications

240K Estimated records to be created
SME directors

Directors and officers for entities in the ecosystem

1.2 million Estimated records to be created
COVID SME lending

Applications for Covid-specific government relief or loans

360K Estimated records to be created
Grouping
Data
Description
Estimated records to be created
Reference
Synthetic Entities
Synthetically created entities that are used as reference data to link all of the datasets together within the ecosystem. Statistically representative of UK Companies House.
600K
Reference
Synthetic Entities
Synthetically created individuals that are representative of 100% of the UK population. Based on ONS data.
7 million
Banking

Banking

Banking
Consumer table

Consumer table

Transactional
Retail and Wholesale banking representing transactions for consumers and customers across the UK. There will be 5 fictional UK banks that are representative of the type of transactions that you expect to see in a UK bank. We have also looked to represent some of the behaviours that we would see as a result of the Covid crisis.
5.4 million

600K

2 billion
Banking
Device data
Device data that represents the devices used for faster payments across the UK and can detect fraudulent behaviour.
5 million faster payments over 100k customers
Banking
Black listed accounts
List of bad actors or entities that have been highlighted as such through various risk indicators.
10K
Credit reference
Credit reference
Credit reference data that represents an individual or entities past track record with credit.
TBC
SME lending
SME lending
Issued loans to SMEs
300K
SME lending
Loan history
Profit/loss statements for entities who have made credit applications
240K
SME lending
SME lending
Directors and officers for entities in the ecosystem
1.2 million
SME lending
COVID SME lending
Applications for Covid-specific government relief or loans
360K
Grouping
Data
Description
Estimated records
Overivew
Synthetic Entities
Synthetically created entities that are used as reference data to link all of the datasets together within the ecosystem. Statistically representative of UK Companies House.
600K
Reference
Synthetic Individuals
Synthetically created individuals that are representative of the UK population, based on Office of National Statistics data.
7 million
‍Banking
Transactions
Retail and Wholesale banking data representing transactions for consumers across the UK through the creation of 5 fictional banks. We have also looked to represent some of the behaviours that we would see as a result of the Covid-19 crisis.
400 million
Banking
Device data
Device data that represents the devices used for faster payments across the UK. Can be used to detect fraudulent behaviour.
5 million
SME lending
Loan history
Issued loans to SMEs.
65k
SME lending
Credit Card History
Business credit card statistics
190k
SME lending
Current Account History
Summary statistics about entity current accounts
750k
SME lending
SME Directors
Directors and officers for entities in the ecosystem
2.5 million
SME lending
COVID SME Lending
Covid-19 Business Impact based on applications made for Covid-19-specific government relief or loans
500k
SME lending
Factoring
Factoring information for SMEs who have factored their accounts receivable
500k
SME lending
Profit and Loss
Profit and loss statements for SME entities
500k
SME lending
Accounts Receivable
A list of details around invoices which inform credit decisioning
38 million
SME lending
Lending providers
Institutions providing lending in the market, both traditional and alternative
350
Four corners frame

IDE

Access to an Integrated Development Environment (IDE) for application development and data analytics.

The environment is pre-configured to provide developers quick access to cloud computing resources and to securely access datasets in order to conduct meaningful investigations and analysis.

These investigations can be connected to source code repositories to then share with teams and the wider ecosystem.

Four corners frame

API Marketplace

Access, connect and test a catalogue of FinTech and RegTech APIs from across the global FinTech ecosystem.

With over 300 APIs currently in the marketplace, rapidly discover new providers and leverage existing services to accelerate the development of new solutions.

Minimal coding required, and with code snippets in over 50+ languages, users of the marketplace can begin making test calls to different providers to check whether their API is suitable.

Four corners frame

Innovation Ecosystem

Collaborate with a network of innovators, regulators, financial services firms and investors.

Four corners frame

Showcase Space

A platform to demonstrate the solution you have developed to the broader financial services community.

Four corners frame

Covid19 Use Cases

In collaboration with the industry, we have published Covid19 use cases for innovators to tackle.

Register & Apply now

Complete the registration form and verify your email to create an account.

Once you have created an account you will able to access the application system, view the eligibility criteria and submit an application.

Questions? Read our FAQs here
Read the FCA press release here

We can be contacted at digital.sandbox@fca.org.uk 

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

More information & Contact us