3. The number of corporate networks is overwhelming, and so it is hard to prioritise which corporate ownership structures are more ‘risky’ than others
16
16
17
17
451 Corporate Risk Miner allows a user to navigate over different corporate ownership networks extracted from UK Companies House (UKCH) to identify and visualise those exhibiting risk signatures associated with financial crime. Example risk signatures include:
18
-
* Cyclic ownership: Circularcompanyownership (e.g. Company A owns Company B which owns Company C which owns Company A)
19
-
* Long-chain ownership: Long chains of corporate ownership (e.g. Person A controls company A. Company A is an officer for Company B. Company B is an officer of company C. etc)
18
+
* Cyclic ownership: measureofnetworkinterconnectedness (e.g. Company A owns Company B which owns Company C which owns Company A,orcasewhenthesamepeopledirectmultiplecompanies.)
20
19
* Links to tax havens: Corporate networks which involve companies/people associated with tax haven or secrecy jurisdictions
21
-
* Presence of proxy directors: Proxy directors are individual people who are registered as a company director on paper but who are likely never involved in the running of the business.
22
-
* Links to sanctioned entities: Official sanctioned people or companies, from sources such as the UN Sanctions List.
20
+
* Presence of proxy directors: Proxy directors are entities that have links to more than 50 companies.
23
21
* Links to politically-exposed persons (PEPs)
24
-
* Links to disqualified directors
22
+
* Links to russian politicians
23
+
25
24
26
25
The user can customise the relative importance of each risk signature for their search. The app then computes a **total risk score** for each corporate network in UKCH, and outlines the details of the most high-risk networks. The user can export these network results as a .csv file for later viewing.
27
26
skipped 30 lines
58
57
In this project we used UK Company House Datasets. All information regarding the dataset, input schema and data processing can be found in [data_cache](https://github.com/sahanmar/451/tree/main/data_cache).
59
58
60
59
#### Data enrichment
61
-
The original UKCH data did not provide niethersanctionednorpep information. Hence, the data were enriched with the additional information from the publicly available external datasets. We have scraped [UN sanctions](https://www.un.org/securitycouncil/content/un-sc-consolidated-list), [Russian and Belorussian PEPs](https://rupep.org/en/persons_list/) and [all politicians dataset](https://raw.githubusercontent.com/everypolitician/everypolitician-data/master/countries.json). The scrapers, parsers and README can be found in [sanctions_and_peps](https://github.com/sahanmar/451/tree/main/sanctions_and_peps) directory.
62
-
In the final version of the app, UNandAllpoliticians were used.
60
+
The original UKCH data did not provide pep information. Hence, the data wes enriched with the additional information from the publicly available external datasets. We have scraped [UN sanctions](https://www.un.org/securitycouncil/content/un-sc-consolidated-list), [Russian and Belorussian PEPs](https://rupep.org/en/persons_list/) and [all politicians dataset](https://raw.githubusercontent.com/everypolitician/everypolitician-data/master/countries.json). The scrapers, parsers and README can be found in [sanctions_and_peps](https://github.com/sahanmar/451/tree/main/sanctions_and_peps) directory.
61
+
In the final version of the app, Russianrupep.organdEveryPolitician.org were used.
63
62
64
63
### Limitations
65
-
* Limited to cliques of ??? hop distance owing to space limitation
64
+
* If a user wants to refresh the cached data with the latest UKCH datasets, it would need to be downloaded from UKCH company house and formatted as per data_schema/README instructions.
65
+
* Limited to neighbourhood of 2 hop distance, when network is parto of a Giant Ownership component.
66
66
* Cyclicity calculation assumes an undirected graph to save computational time. This could be improved by taking into account specific directions of ownership.
67
-
* Entity resolution for company/people entities could be improved
67
+
* Entity resolution for company/people entities could be improved.Currentlylinkingisdoneonname+yob+mob.
68
68
* Graph visualisation for large corporate networks can be too cluttered to be useful.
69
69
70
70
### Potential next steps
71
71
* Expand to corporate ownership databases outside of the UK, for example using OpenCorporates data.
72
72
* Incorporate more external data sources identifying criminal or potentially-criminal activity for companies and people.