3. The number of corporate networks is overwhelming, and so it is hard to prioritise which corporate ownership structures are more ‘risky’ than others
16
16
17
17
451 Corporate Risk Miner allows a user to navigate over different corporate ownership networks extracted from UK Companies House (UKCH) to identify and visualise those exhibiting risk signatures associated with financial crime. Example risk signatures include:
18
-
* Cyclic ownership: Circularcompanyownership (e.g. Company A owns Company B which owns Company C which owns Company A)
19
-
* Long-chain ownership: Long chains of corporate ownership (e.g. Person A controls company A. Company A is an officer for Company B. Company B is an officer of company C. etc)
18
+
* Cyclic ownership: measureofnetworkinterconnectedness (e.g. Company A owns Company B which owns Company C which owns Company A,orcasewhenthesamepeopledirectmultiplecompanies.)
20
19
* Links to tax havens: Corporate networks which involve companies/people associated with tax haven or secrecy jurisdictions
21
-
* Presence of proxy directors: Proxy directors are individual people who are registered as a company director on paper but who are likely never involved in the running of the business.
22
-
* Links to sanctioned entities: Official sanctioned people or companies, from sources such as the UN Sanctions List.
20
+
* Presence of proxy directors: Proxy directors are entities that have links to more than 50 companies.
23
21
* Links to politically-exposed persons (PEPs)
24
-
* Links to disqualified directors
22
+
* Links to russian politicians
23
+
25
24
26
25
The user can customise the relative importance of each risk signature for their search. The app then computes a **total risk score** for each corporate network in UKCH, and outlines the details of the most high-risk networks. The user can export these network results as a .csv file for later viewing.
27
26
skipped 23 lines
51
50
52
51
## Usage
53
52
54
-
TBD
53
+
- Access the app online.
54
+
55
+
Data cache can either be:
56
+
57
+
- Downloaded from <a href="https://drive.google.com/drive/folders/15I2-spww_5ZG6tzslLyU4gGwb3zDNZU-?usp=sharing" target="_blank">GoogleDrive.</a>
58
+
(Then unzipped and stored in `/data` folder.)
59
+
60
+
- Recomputed with fresh UK Company House Downloads. Follow README and notebook in `/data_cache/` folder.
55
61
56
62
## Additional Information
57
63
skipped 2 lines
60
66
In this project we used UK Company House Datasets. All information regarding the dataset, input schema and data processing can be found in [data_cache](https://github.com/sahanmar/451/tree/main/data_cache).
61
67
62
68
#### Data enrichment
63
-
The original UKCH data did not provide niethersanctionednorpep information. Hence, the data were enriched with the additional information from the publicly available external datasets. We have scraped [UN sanctions](https://www.un.org/securitycouncil/content/un-sc-consolidated-list), [Russian and Belorussian PEPs](https://rupep.org/en/persons_list/) and [all politicians dataset](https://raw.githubusercontent.com/everypolitician/everypolitician-data/master/countries.json). The scrapers, parsers and README can be found in [sanctions_and_peps](https://github.com/sahanmar/451/tree/main/sanctions_and_peps) directory.
64
-
In the final version of the app, UNandAllpoliticians were used.
69
+
The original UKCH data did not provide pep information. Hence, the data wes enriched with the additional information from the publicly available external datasets. We have scraped [UN sanctions](https://www.un.org/securitycouncil/content/un-sc-consolidated-list), [Russian and Belorussian PEPs](https://rupep.org/en/persons_list/) and [all politicians dataset](https://raw.githubusercontent.com/everypolitician/everypolitician-data/master/countries.json). The scrapers, parsers and README can be found in [sanctions_and_peps](https://github.com/sahanmar/451/tree/main/sanctions_and_peps) directory.
70
+
In the final version of the app, Russianrupep.organdEveryPolitician.org were used.
65
71
66
72
### Limitations
67
-
* Limited to cliques of ??? hop distance owing to space limitation
73
+
* If a user wants to refresh the cached data with the latest UKCH datasets, it would need to be downloaded from UKCH company house and formatted as per data_schema/README instructions.
74
+
* Limited to neighbourhood of 2 hop distance, when network is parto of a Giant Ownership component.
68
75
* Cyclicity calculation assumes an undirected graph to save computational time. This could be improved by taking into account specific directions of ownership.
69
-
* Entity resolution for company/people entities could be improved
76
+
* Entity resolution for company/people entities could be improved.Currentlylinkingisdoneonname+yob+mob.
70
77
* Graph visualisation for large corporate networks can be too cluttered to be useful.
71
78
72
79
### Potential next steps
73
80
* Expand to corporate ownership databases outside of the UK, for example using OpenCorporates data.
74
81
* Incorporate more external data sources identifying criminal or potentially-criminal activity for companies and people.
75
-
* Add an ability to filter based on a custom list of people/companies, explain.......
82
+
* Allow user to input custom lists as additional risk flags.