🤬
  • ■ ■ ■ ■ ■ ■
    README.md
    skipped 14 lines
    15 15  3. The number of corporate networks is overwhelming, and so it is hard to prioritise which corporate ownership structures are more ‘risky’ than others
    16 16   
    17 17  451 Corporate Risk Miner allows a user to navigate over different corporate ownership networks extracted from UK Companies House (UKCH) to identify and visualise those exhibiting risk signatures associated with financial crime. Example risk signatures include:
    18  -* Cyclic ownership: Circular company ownership (e.g. Company A owns Company B which owns Company C which owns Company A)
    19  -* Long-chain ownership: Long chains of corporate ownership (e.g. Person A controls company A. Company A is an officer for Company B. Company B is an officer of company C. etc)
     18 +* Cyclic ownership: measure of network interconnectedness (e.g. Company A owns Company B which owns Company C which owns Company A, or case when the same people direct multiple companies.)
    20 19  * Links to tax havens: Corporate networks which involve companies/people associated with tax haven or secrecy jurisdictions
    21  -* Presence of proxy directors: Proxy directors are individual people who are registered as a company director on paper but who are likely never involved in the running of the business.
    22  -* Links to sanctioned entities: Official sanctioned people or companies, from sources such as the UN Sanctions List.
     20 +* Presence of proxy directors: Proxy directors are entities that have links to more than 50 companies.
    23 21  * Links to politically-exposed persons (PEPs)
    24  -* Links to disqualified directors
     22 +* Links to russian politicians
     23 + 
    25 24   
    26 25  The user can customise the relative importance of each risk signature for their search. The app then computes a **total risk score** for each corporate network in UKCH, and outlines the details of the most high-risk networks. The user can export these network results as a .csv file for later viewing.
    27 26   
    skipped 30 lines
    58 57  In this project we used UK Company House Datasets. All information regarding the dataset, input schema and data processing can be found in [data_cache](https://github.com/sahanmar/451/tree/main/data_cache).
    59 58   
    60 59  #### Data enrichment
    61  -The original UKCH data did not provide niether sanctioned nor pep information. Hence, the data were enriched with the additional information from the publicly available external datasets. We have scraped [UN sanctions](https://www.un.org/securitycouncil/content/un-sc-consolidated-list), [Russian and Belorussian PEPs](https://rupep.org/en/persons_list/) and [all politicians dataset](https://raw.githubusercontent.com/everypolitician/everypolitician-data/master/countries.json). The scrapers, parsers and README can be found in [sanctions_and_peps](https://github.com/sahanmar/451/tree/main/sanctions_and_peps) directory.
    62  -In the final version of the app, UN and All politicians were used.
     60 +The original UKCH data did not provide pep information. Hence, the data wes enriched with the additional information from the publicly available external datasets. We have scraped [UN sanctions](https://www.un.org/securitycouncil/content/un-sc-consolidated-list), [Russian and Belorussian PEPs](https://rupep.org/en/persons_list/) and [all politicians dataset](https://raw.githubusercontent.com/everypolitician/everypolitician-data/master/countries.json). The scrapers, parsers and README can be found in [sanctions_and_peps](https://github.com/sahanmar/451/tree/main/sanctions_and_peps) directory.
     61 +In the final version of the app, Russian rupep.org and EveryPolitician.org were used.
    63 62   
    64 63  ### Limitations
    65  -* Limited to cliques of ??? hop distance owing to space limitation
     64 +* If a user wants to refresh the cached data with the latest UKCH datasets, it would need to be downloaded from UKCH company house and formatted as per data_schema/README instructions.
     65 +* Limited to neighbourhood of 2 hop distance, when network is parto of a Giant Ownership component.
    66 66  * Cyclicity calculation assumes an undirected graph to save computational time. This could be improved by taking into account specific directions of ownership.
    67  -* Entity resolution for company/people entities could be improved
     67 +* Entity resolution for company/people entities could be improved. Currently linking is done on name+yob+mob.
    68 68  * Graph visualisation for large corporate networks can be too cluttered to be useful.
    69 69   
    70 70  ### Potential next steps
    71 71  * Expand to corporate ownership databases outside of the UK, for example using OpenCorporates data.
    72 72  * Incorporate more external data sources identifying criminal or potentially-criminal activity for companies and people.
    73  -*
     73 +* Allow user to input custom lists.
    74 74   
Please wait...
Page is in error, reload to recover