To meet the information needs of various stakeholders across business units, technology and data engineering teams generate various insights from data analysis.
Statistical models or supervised learning models have been used in the financial services industry for decades now. Such a model involves forming a hypothesis first and then proving or disproving it using available data. The efficacy of statistical models can be impacted over time – primarily due to inefficient variables, a shift in the hypothesis, emerging unknown patterns, incorrect or incomplete labelling of data, etc. Corrective actions should hence be taken periodically to re-calibrate the model and verify its robustness. But what if one is unable to create a meaningful hypothesis because of an incomplete understanding of the data?
In this edition’s topic of the month, we take a look at how unsupervised learning can help in solving the above problem. In the case of an unsupervised learning model, one does not need to build a hypothesis upfront, but rather analyse the available data, establish relationships between the data points and gain new insights into unknown patterns. Data is parsed and labelled and unknown scenarios develop into ‘now-known’ ones, which helps to uncover or formulate a solid hypothesis and then build future capabilities for supervised learning models.
The newsletter also includes updates on transformational changes proposed by regulatory bodies such as the RBI and IRDA to protect consumer interests and respond to changes in industry dynamics. Happy reading!
ML enables decision-making intelligence by training algorithms to learn from data patterns. Simply put, in ML, users provide input data to the computer algorithms, which analyse it for patterns and arrive at decisions/recommendations based on identified data patterns.
ML can be divided into three types:
Supervised ML models are algorithms that are trained using a labelled or tagged dataset, i.e. data with a known set of variables that are classified as independent (predictors) and labelled/dependent (target). While supervised ML algorithms are heavily dependent on labels, absence of labelled data can also
be used for decision making.
Unsupervised ML algorithms help us derive insights from data sets that do not have a target/labelled variable.
Reinforcement learning, as the name suggests, is when the algorithm learns by itself through trial and error. It is a feedback-driven process that learns from experience and improves performance.
Unsupervised ML models are independent learning algorithms that explore unknown scenarios/patterns or data groupings without any supervision or guidance. Lack of supervision or human intervention in these models also eliminates human bias that may creep in while working with labelled data.
Unsupervised ML algorithms can be leveraged in the following situations:
Unsupervised ML identifies homogeneous and heterogeneous patterns in information provided. Applications of these algorithms are briefly described below:
The figure below lists the commonly used unsupervised algorithms by application:
Unsupervised models can be leveraged to build capabilities for supervised models by solving for the latter’s reliance on better labelled data. Unsupervised models offer the following benefits:
Problem statement: Transaction monitoring for a firm’s general ledger (GL) transactions across products to identify and pre-emptively track unknown risk or transaction patterns
Solution: PwC has implemented its Anomaly Detection Platform for transaction monitoring. The platform identifies unknown risks or transaction patterns in data by leveraging an unsupervised algorithm – isolation forest – to flag anomalies. These flagged labels are then fed to supervised decision trees to build explainability and detect patterns for validation. Validated patterns are streamlined into the current monitoring process. The solution is flexible across data trends, and the feedback tracked over time for these patterns will be leveraged to build a robust supervised learning engine to track anomalies, thus reducing complexity and run time.
Impact: The solution expands the user’s outlook on identifying possible risk patterns – for example, flagging accounts with sudden spikes in transaction amounts, seldom or rarely seen types of transactions within accounts, incorrect GL tagging for type of transaction or incorrect currency tagging for transactions, and defining new thresholds for certain types of transactions. It automatically flags suspicious and out-of-the-norm trends in the data. The insights provided in terms of deterministic rules or patterns make it easier for the user to plug in these checks into the monitoring process and account for any unknown leakages. Thus, it helps in building a robust monitoring process.
While unsupervised ML algorithms are a great choice when availability of data labels is an issue, one has to address some challenges that accompany their:
A survey of 164 decision makers from the financial services sectors in India, Australia and Indonesia found that 84% wish to adopt AI urgently for credit risk analysis and 67% expect real-time data and analytics for investments. Further, 66% feel legacy system dependency inhibits automation, while 36% feel data standardisation is a barrier to automation in credit risk management.
The world’s first metaverse ATM was launched on the Decentraland platform in partnership with the Metaverse Architects studio and the payment gateway Transak. The ATM will make Web 3.0 transactions similar to real-world ATM transactions where users can purchase MANA cryptocurrency with fiat currency or any other cryptocurrency. The team believes the solution will increase conversion rates in metaverse stores and make it easier for users to perform transactions in Decentraland.
Indian banks have reported 248 successful data breaches in the last 4 years, mostly pertaining to credit card data breaches. Out of the 248 breaches, 41 were reported by public sector banks and the rest by private sector banks. On this matter, the Reserve Bank of India (RBI) has informed the Centre that it has issued cyber security guidelines for better controls on data with banks. The RBI has also asked banks to strengthen their IT risk governance framework and advised management to initiate actions against erring staff within a specific time period. As per the Ministry of Home Affairs, a total of 1.4 lakh cyber security incidents were recorded in 2021.
The RBI governor discussed how banks can quickly offer personalised offerings to meet customer expectations using social media and analytics. Social media data offers the potential for enhanced customer acquisition, customer segmentation, financial inclusion and grievance management. He also emphasised the need for traditional banks to adopt technology or collaborate with FinTechs.
The IRDAI has developed a mechanism to speed up the registration process for new insurers and acquire certificate of registration. Additionally, to make the Pradhan Mantri Jeevan Jyoti Bima Yojana more accessible, the IRDAI has relaxed capital requirements. Greater flexibility will be provided to corporate agency tie-ups. While earlier each corporate agent could tie up with only three insurers each from the life, health and general insurance sector, this has now increased to nine for each sector. The regulator has also allowed insurance marketing firms to tie up with six insurance companies from each sector instead of the current cap of two. It has also recently released draft regulations and is seeking feedback from companies on proposing a 20% cap on insurance agent’s commission and a limit on expenses on-management (EoM) at 30% of gross premium for general and standalone health insurance companies in India.
In a press release dated 10 August 2022, the RBI issued the recommendations of the working group on digital lending implementation. The RBI has mandated that lenders refrain from accessing mobile phone resources such as files and media, contact lists, call logs and telephony functions. With clear consent from users, one-time access could be taken for necessary facilities.
Currently, SFBs are allowed to only lend directly. Co-lending with NBFCs will entail increased priority sector lending. Industry experts envision expertise-based lending – for instance, SFBs targeting the agriculture industry will co-lend with NBFCs currently operating in that field.
Acknowledgements: This newsletter has been researched and authored by Aniket Borse, Anuj Jain, Arpita Shrivastava, Dhananjay Goel, Harshit Singh, Krunal Sampat, Neeraj Sibal, Princia Viz, Priyank Aggarwal and Samir Shah.