🤬
  • ■ ■ ■ ■ ■ ■
    README.md
    skipped 14 lines
    15 15  3. The number of corporate networks is overwhelming, and so it is hard to prioritise which corporate ownership structures are more ‘risky’ than others
    16 16   
    17 17  451 Corporate Risk Miner allows a user to navigate over different corporate ownership networks extracted from UK Companies House (UKCH) to identify and visualise those exhibiting risk signatures associated with financial crime. Example risk signatures include:
    18  -* Cyclic ownership: Circular company ownership (e.g. Company A owns Company B which owns Company C which owns Company A)
    19  -* Long-chain ownership: Long chains of corporate ownership (e.g. Person A controls company A. Company A is an officer for Company B. Company B is an officer of company C. etc)
     18 +* Cyclic ownership: measure of network interconnectedness (e.g. Company A owns Company B which owns Company C which owns Company A, or case when the same people direct multiple companies.)
    20 19  * Links to tax havens: Corporate networks which involve companies/people associated with tax haven or secrecy jurisdictions
    21  -* Presence of proxy directors: Proxy directors are individual people who are registered as a company director on paper but who are likely never involved in the running of the business.
    22  -* Links to sanctioned entities: Official sanctioned people or companies, from sources such as the UN Sanctions List.
     20 +* Presence of proxy directors: Proxy directors are entities that have links to more than 50 companies.
    23 21  * Links to politically-exposed persons (PEPs)
    24  -* Links to disqualified directors
     22 +* Links to russian politicians
     23 + 
    25 24   
    26 25  The user can customise the relative importance of each risk signature for their search. The app then computes a **total risk score** for each corporate network in UKCH, and outlines the details of the most high-risk networks. The user can export these network results as a .csv file for later viewing.
    27 26   
    skipped 23 lines
    51 50   
    52 51  ## Usage
    53 52   
    54  -TBD
     53 +- Access the app online.
     54 + 
     55 +Data cache can either be:
     56 + 
     57 +- Downloaded from <a href="https://drive.google.com/drive/folders/15I2-spww_5ZG6tzslLyU4gGwb3zDNZU-?usp=sharing" target="_blank">GoogleDrive.</a>
     58 + (Then unzipped and stored in `/data` folder.)
     59 + 
     60 +- Recomputed with fresh UK Company House Downloads. Follow README and notebook in `/data_cache/` folder.
    55 61   
    56 62  ## Additional Information
    57 63   
    skipped 2 lines
    60 66  In this project we used UK Company House Datasets. All information regarding the dataset, input schema and data processing can be found in [data_cache](https://github.com/sahanmar/451/tree/main/data_cache).
    61 67   
    62 68  #### Data enrichment
    63  -The original UKCH data did not provide niether sanctioned nor pep information. Hence, the data were enriched with the additional information from the publicly available external datasets. We have scraped [UN sanctions](https://www.un.org/securitycouncil/content/un-sc-consolidated-list), [Russian and Belorussian PEPs](https://rupep.org/en/persons_list/) and [all politicians dataset](https://raw.githubusercontent.com/everypolitician/everypolitician-data/master/countries.json). The scrapers, parsers and README can be found in [sanctions_and_peps](https://github.com/sahanmar/451/tree/main/sanctions_and_peps) directory.
    64  -In the final version of the app, UN and All politicians were used.
     69 +The original UKCH data did not provide pep information. Hence, the data wes enriched with the additional information from the publicly available external datasets. We have scraped [UN sanctions](https://www.un.org/securitycouncil/content/un-sc-consolidated-list), [Russian and Belorussian PEPs](https://rupep.org/en/persons_list/) and [all politicians dataset](https://raw.githubusercontent.com/everypolitician/everypolitician-data/master/countries.json). The scrapers, parsers and README can be found in [sanctions_and_peps](https://github.com/sahanmar/451/tree/main/sanctions_and_peps) directory.
     70 +In the final version of the app, Russian rupep.org and EveryPolitician.org were used.
    65 71   
    66 72  ### Limitations
    67  -* Limited to cliques of ??? hop distance owing to space limitation
     73 +* If a user wants to refresh the cached data with the latest UKCH datasets, it would need to be downloaded from UKCH company house and formatted as per data_schema/README instructions.
     74 +* Limited to neighbourhood of 2 hop distance, when network is parto of a Giant Ownership component.
    68 75  * Cyclicity calculation assumes an undirected graph to save computational time. This could be improved by taking into account specific directions of ownership.
    69  -* Entity resolution for company/people entities could be improved
     76 +* Entity resolution for company/people entities could be improved. Currently linking is done on name+yob+mob.
    70 77  * Graph visualisation for large corporate networks can be too cluttered to be useful.
    71 78   
    72 79  ### Potential next steps
    73 80  * Expand to corporate ownership databases outside of the UK, for example using OpenCorporates data.
    74 81  * Incorporate more external data sources identifying criminal or potentially-criminal activity for companies and people.
    75  -* Add an ability to filter based on a custom list of people/companies, explain.......
     82 +* Allow user to input custom lists as additional risk flags.
    76 83   
Please wait...
Page is in error, reload to recover