A Geek’s Guide to Machine Learning (AI), Risk Analytics and Decisioning

Collateral

January 14, 2023 | Jonathan Pryer

Introduction

Artificial Intelligence (AI), Machine Learning (ML) – whatever you want to call it, these buzzwords are appearing more and more throughout the business and social world. So what are they and what do they mean?

Despite the growing interest, AI/ML isn’t new at all. In fact, the models themselves have been around since the 1970s and ‘80s. In the financial sector, banks have been using ML to mitigate fraud and detect irregular buyer behaviors and patterns for the last decade or more.

Fraud is a growing concern and is costing the financial sector millions of dollars in losses each year. A 2015 research note from Barclays stated that the United States is responsible for 47 percent of the world’s card fraud despite accounting for only 24 percent of total worldwide card volume. A 2014 Federal Trade Commission report shows that credit cards and other consumer payment methods produced the greatest losses over other types of fraud.

One of the ways in which UK financial firms have tried to reduce fraud is with the implementation of the Chip and Pin system. It was seen as an effective means to prevent and reduce card fraud. But a research paper by Murdoch et al (2010) showed how fundamentally flawed Chip and Pin is.

As technology evolves, so do the cunning methods for perpetrating a fraudulent crime. Financial firms are now relying on sophisticated artificial intelligence software to evolve, adapt and learn in line with the behavior patterns of fraudsters in order to track, detect and prevent fraud far more quickly than traditional methods. The use of AI has also been implemented in industries outside financial services including insurance, retail and telecommunications.

Obviously, it is in the interest of the card issuer or bank to implement strategies to reduce the risk of fraud. Unfortunately, this often requires a compromise between expense and inconvenience to the merchant and the customer. Merchants are at far more risk than the end credit card user as they are ultimately responsible for incurring the cost of a fraudulent purchase and the potential loss of the customer resulting from the bad experience. Other costs to the merchant include direct fraud costs, cost of manual order review, cost of reviewing tools and cost of rejecting orders.

This Provenir report describes the use of AI tools in credit card fraud to mitigate risk. We will be looking at various AI detection methods including Artificial Neural Networks (ANN), Fuzzy Neural Networks (FNN), Bayesian Neural Networks (BNN) and Expert Systems.

An Overview of Fraud Prevention and Detection Techniques

The modern information age is flooded with a rapidly growing and astonishingly huge amount of data. In the U.S alone, the total number of credit card transactions totaled 26.2 billion in 2012. The processing of these data sets by banks and credit card issuers requires complex statistical algorithms to extract the raw quantitative data.

An overview of the processes that compose Knowledge Discovery in Databases (KDD) (Source: Fayyad et al, 1996).

These systems work by comparing the observed and collected data with expected values. Expected values can be calculated in a number of ways. For example, a behavior model would look at the way a customer’s bank account has been used in the past, and any deviance from usual purchasing habits would return a suspicion score. This method works by flagging a transaction with a typical score, usually between 1 and 999. The higher the score, the more suspicious the transaction is likely to be, or, the more similarities it shares between other fraudulent values.

Typically, the measures taken to combat fraud can be distinguished into two categories – Prevention and Detection.

Fraud Prevention constitutes the necessary steps to prevent fraud from occurring in the first place, with various preventative methods used to deter fraudsters, such as MasterCard SecureCode and Verified by Visa.
Fraud Detection, the focus of this report, comes into play once fraud prevention fails. Detection consists of identifying and detecting the fraudulent activity as quickly as possible and implementing the necessary methods to block and prevent the card from being used by the perpetrator again. Issues arise when criminals adapt and change their tactics once they are aware that a prevention method is in place, therefore the need for more intelligent and sophisticated technology which ‘learns’ is essential for the detection of fraud.

The techniques used to detect fraud also fall into two primary classes – Statistical techniques (clustering, algorithms) and Artificial Intelligence (ANN, FNN, Data Mining). Both of these methods still involve mining through the available data and highlighting any anomalies (which can be defined by a set of rules) from the purchasing and transaction data of the customer. The difference is that where we used human analysts to manually search useable knowledge in the past, today we make use by machine learning.

Artificial Intelligence Models

Artificial Neural Networks
Also known as connectionism, parallel distributed processing, neuro-computing and machine learning algorithms, Artificial Neural Networks (ANNs) were first developed during the late 1980s and have since become a fundamental tool in combating fraud. ANNs work by imitating the way the human brain learns, using complex input, hidden, and output layers.

Diagram representing a feed-forward multilayer perceptron | Provenir

Diagram representing a feed-forward multilayer perceptron (the most common type of ANN). (Source: www. oscarkilo.net)

The input nodes retrieve information from an outside source (for credit card fraud detection, this would be the transactional data of a customer’s account) and the output nodes send the results from the neural networks back to the external source. The hidden nodes in-between the input and output nodes have no interaction with the external source and become more complex in their configuration and nature depending on the complexity of the problem at hand.

The various nodes in each layer of the neural network are connected by edges where each edge represents a particular weight between two connected nodes. (In the human brain, these are called synapses.) The information that the neural network learns through supervised or unsupervised learning is stored in these weights.

An example of the way neural networks learn is similar to the way children learn to recognize animals. After seeing a dog, the child can then generalize on various other breeds of dogs, categorizing and defining them as ‘dogs’ without having seen them before.
An important feature of neural networks is that when they learn, they have the option to be supervised or unsupervised.

For unsupervised neural network learning, the system makes use of clustering, which groups patterns based on similarity. The two main unsupervised learning methods are Hebbian and Kohonen. Hebbian learning takes place by association, meaning that if two neurons which are on either side of a synapse are activated simultaneously, the strength of that synapse will be increased. Kohonen (also called Self-Organizing Maps) learning takes place by learning the categorization of the input space.
For supervised neural network learning (back-propagation), the correct output values for certain input data are determined before starting the algorithm, and the system then learns the function between the paired input and output nodes.

A user can train a neural network by running through examples of past data. The learning process occurs when the output data is compared to that of the ANN’s predicted output. The weights for each connection are then adjusted based on the exampled data, allowing the system to learn new patterns and behavior and improve accuracy without having to be taught or shown it.

Fuzzy Neural Networks

Fuzzy Neural Networks (FNNs) are a branch of hybrid intelligence systems which make use of fuzzy logic together with ANNs to detect fraudulent activity. The idea was first developed and proposed by Zadeh and has since been used and implemented successfully in a variety of industries. The core framework for fuzzy logic is to provide an accurate method for describing human perceptions. Some experts believe that the use of fuzzy rules can provide a more natural estimate as to the amount of deviation from the normal.

FNNs, like Expert Systems, make use of IF-THEN-ELSE statements and heuristic rules to handle uncertainty in applications, resulting in better approximate reasoning without the need for analytical precision. The use of traditional IF-THEN-ELSE statements and heuristic rules (see Expert Systems below) has been controversial, and therefore has not been as widely implemented as some of the other AI fraud detection systems.

Expert Systems

Expert Systems saw increased usability and growth during the 1980s with the expansion of computer processing power, programming and AI. It was used in credit card fraud detection by using a rule-based system which proved to be fairly popular when no other intelligent systems were around. These systems were used to imitate and replicate the knowledge of an ‘expert’ person and can be defined into two classes – facts and heuristic.

Facts are classified as a quantity of information, such as the credit card transaction history or an individual’s credit rating.
Heuristic is where a person of ‘expert’ knowledge defines a set of rules that they would usually follow by protocol as a result of their ‘expert’ experience, education, observation and training.

Expert systems work by taking this human knowledge and transferring it into a logical language that a computer can understand and follow in order to solve a problem. A fundamental part of expert systems is their extensive database of stored rules which are defined by a typical IF-THEN-ELSE format. For example, a rule based system using IF-THEN-ELSE may look like the following:

IF the amount of purchase is greater (>) than $1000 and the card acceptance authorization is through ‘eBay’, THEN raise a suspicion score and require further verification, ELSE approve transaction.

Limitations of Expert Systems however are that they require considerable storage space and rely heavily on extensive programming of expert human knowledge in order to make decisions. Some experts believe that by using a rule-based system with neural networks, the performance of detecting fraudulent activity increases.

Bayesian Neural Networks

These types of networks take a slightly different approach to the general guidelines and rules of learning that are commonly seen in ANNs and FNNs. Typically, Bayesian Neural Networks use Naive Bayesian Classifiers, a simple method of classification, to classify transaction activity.

Bayesian learning can be trained very efficiently in a supervised learning setting and uses probability to represent uncertainty about relationships that have been learnt as opposed to variations on maximum likelihood estimation. Where neural networks try to find a set of weights for each node (process of learning) to best fit the data inputted, Bayesian learning makes prior predictions by means of probability distribution over the network weights as to what the true relationship might be. One study looked at the comparison of using both ANNs and Bayesian Belief Network algorithms in fraud detection, and found that the use of Bayesian Neural Networks, although slower, were in fact more accurate than the use of ANNs alone.

In fact, many believe the use of Bayesian methods to be highly effective in real world data sets as they offer better predictive accuracy. This is supported by research which concluded that the use of Bayesian Neural Networks were far superior and accurate in detecting credit card transactional fraud than Naive Bayesian Classifier.

The Data

The following table compares the research findings to highlight which combination of models provides the highest prediction accuracy.

Study	Method/Technique	Application	Prediction Accuracy (%)

Aleskerov et al (1997)	ANNs	Credit Card	n/a	n/a	n/a
Bell and Carcello (2000)	Statistical	Financial Reporting	55	96	87
Brause et al (1999)	Data mining and ANNs	Credit Card	n/a	n/a	n/a
Bolton and Hand (2002)	Clustering	Credit Card	n/a	n/a	n/a
Calderon and Green (1994)	Statistical	Financial Reporting	20	89	n/a
Dorronsoro et al (1997)	ANNs	Credit Card	n/a	n/a	n/a
Ezawa and Norton (1996)	BNNs	Credit Card	n/a	n/a	n/a
Ghosh and Reilly (1994)	ANNs	Credit Card	n/a	n/a	n/a
Green and Choi (1997)	ANNs	Financial Reporting	68	74	72
Leonard (1995)	Expert System	Credit Card	n/a	n/a	n/a
Lin et al (2003)	FNNs	Financial Reporting	35	86	76
Quash and Sriganesh (2007)	ANNs (Self-Organising Maps)	Credit Card	n/a	n/a	n/a
Zaslavsky and Strizkak (2006)	ANNs	Credit Card	n/a	n/a	n/a

Summary of the most notable investigations into the use of Artificial Intelligence at mitigating fraud.

The greatest challenge when talking about artificial intelligence/machine learning is actually in understanding what data sets we are looking at, and what model/combination of models to apply. Amazon’s Machine Learning offering is one example of an automated process which analyses the data and automatically selects the best model to use in the scenario. Other big players who have similar offerings are IBM Watson, Google and Microsoft.

Conclusion

Provenir’s clients are continually looking at new and innovative ways to improve their risk decisioning. Traditional banks offering consumer, SME and commercial loans and credit, auto lenders, payment providers and fintech companies are using Provenir technology to help them make faster and better decisions about potential fraud. Integrating artificial intelligence/machine learning capabilities into the risk decisioning process can increase the organization’s ability to accurately assess the level of risk in order to detect and prevent fraud.

Provenir provides model integration adaptors for machine learning models, including Amazon Machine Learning (AML) that can automatically listen for and label business-defined events, calculate attributes and update machine learning models. By combining Provenir technology with machine learning, organizations can increase both the efficiency and predictive accuracy of their risk decisioning.

Download the PDF version of this page

Next Collateral

Provenir for Automating Factoring and Invoice Discounting

Back to Collateral

Next Collateral

Provenir for Automating Factoring and Invoice Discounting

Latest Resources

View all

April 8, 2024

Cookie	Duration	Description
__cfruid	session	Cloudflare sets this cookie to identify trusted web traffic.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
OptanonConsent	1 year	OneTrust sets this cookie to store details about the site's cookie category and check whether visitors have given or withdrawn consent from the use of each category.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin to store whether or not the user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
debug	never	Cookie used to debug code and website issues.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
li_gc	5 months 27 days	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
loglevel	never	Maintains settings and outputs when using the Developer Tools Console on current session.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
_calendly_session	21 days	Calendly, a Meeting Schedulers, sets this cookie to allow the meeting scheduler to function within the website and to add events into the visitor’s calendar.
_gat	1 minute	This cookie is installed by Google Universal Analytics to restrain request rate and thus limit the collection of data on high traffic sites.
_uetsid	1 day	Bing Ads sets this cookie to engage with a user that has previously visited the website.
_uetvid	1 year 24 days	Bing Ads sets this cookie to engage with a user that has previously visited the website.
AWSALBTG	7 days	AWS Application Load Balancer Cookie. Load Balancing Cookie: Used to encode information about the selected target group.
AWSALBTGCORS	7 days	AWS Classic Load Balancer Cookie: Used to map the session to the instance. This cookie is identical to the original ELB cookie except for the attribute &SameSite=None;

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_3DY9STJEMW	2 years	This cookie is installed by Google Analytics.
_ga_J5QKCECHV7	2 years	This cookie is installed by Google Analytics.
_gat_UA-67726727-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gd_session	4 hours	This cookie is used for collecting information on users visit to the website. It collects data such as total number of visits, average time spent on the website and the pages loaded.
_gd_visitor	2 years	This cookie is used for collecting information on the users visit such as number of visits, average time spent on the website and the pages loaded for displaying targeted ads.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
6suuid	2 years	Registers user behaviour and navigation on the website, and any interaction with active campaigns. This is used for optimizing advertisement and for efficient retargeting.
ajs_anonymous_id	never	This cookie is set by Segment to count the number of people who visit a certain site by tracking if they have visited before.
ajs_user_id	never	This cookie is set by Segment to help track visitor usage, events, target marketing, and also measure application performance and stability.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
attribution_user_id	1 year	This cookie is set by Typeform for usage statistics and is used in context with the website's pop-up questionnaires and messengering.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
ln_or	1 day	Registers statistical data on users’ behaviour on the website. Used for internal anyalytics by the website operator.
lpv730213	30 minutes	Pending.
pardot	past	The pardot cookie is set while the visitor is logged in as a Pardot user. The cookie indicates an active session and is not used for tracking.
rl_anonymous_id	never	Generates an unique anonymous Id to identify a user and attach to a subsequent event.
rl_user_id	never	To store a unique user ID for the purpose of Marketing/Tracking.
UID	2 years	Scorecard Research sets this cookie for browser behaviour research.
undefined	never	Wistia sets this cookie to collect data on visitor interaction with the website's video-content, to make the website's video-content more relevant for the visitor.
visitor_id730213		Pardot Website tracking.
visitor_id730213-hash		Pardot Website tracking.
visitor-id	1 year	Pardot Website tracking.

Cookie	Duration	Description
_an_uid	7 days	Presents the user with relevant content and advertisement. The service is provided by third-party advertisement hubs, which facilitate real-time bidding for advertisers.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
MUID	1 year 24 days	Bing sets this cookie to recognize unique web browsers visiting Microsoft sites. This cookie is used for advertising, site analytics, and other operations.
NID	6 months	NID cookie, set by Google, is used for advertising purposes; to limit the number of times the user sees an ad, to mute unwanted ads, and to measure the effectiveness of ads.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.