Near Miss
Near Miss Benford's Law Fraud detectionDownloads First digits lawSite Map
Home Page | Population | Sampling | Fraud | Other | Tutorial | Macros
Near Miss

Command Overview

The near miss command is used to identify invoice pairs, which, although not identical, are similar enough to potentially merit further review to determine if they are in fact duplicates. The command supports four sources of data for testing:

  1. Data file in tab separated value format
  2. WorkSheet
  3. Selected range within a worksheet
  4. Database (e.g. MS-Access, MySQL, etc)

 

Syntax

The command is entered on the command line as the command name ("nearmiss", or just "ne") followed by one or more command parameters. Each command parameter consists of an argument name, immediately followed by an equal sign ("=") and then immediately followed by a value. The value may be enclosed with quotation marks, and must be enclosed by quotation marks if the value consists of any embedded spaces.

An example of a command parameter to specify that the file name to be processed is test.txt is entered as "file=test.txt" (without the quotations). Each argument on the command line must consist of only valid parameter names. If a required parameter name is missing, then the command line will be colored yellow, an error message will be displayed on the Excel status bar and an audible tone will be sounded.

The following is a listing of all parameter values for the nearmiss command:

Argument name Description Example
Common Parameters (Required for all)
vendno the vendor or supplier identifier VendorNo
invno Invoice number Invno
invdate Invoice date InvDate
datetol Date tolerance - the maximum number of days difference which can still be considered as potential duplicates 30
invamt the amount of the invoice "Invoice Amount"
recap the worksheet where the results are to be stored $Results
File Data Source (ds=file)
file the name of the file to be processed \test\invoice.txt
Worksheet Data Source (ds=rng)
sheet the name of the worksheet to be processed CMAData
ulc upper left hand corner of the range to be processed a2
Worksheet Selected Range (ds=sel)
sheet the name of the worksheet to be processed CMAData
Database (ds=db)
sql the SQL query to obtain the data to be used as a source "select vendno, invno, invdate, invamt from invoices"
conn the connection string for the database (varies by type of database) "c:\ms\data\cmadata.mdb"

 

 

An example

The command to be perform a test for near miss invoices on the file named \test\invoice\tran.txt using the amount column named "Paid Amount" and the invoice date column named "Invoice Date", storing the results on the worksheet named "Potential Duplicates " is as follows:

nearmiss file=\test\invoice\tran.txt date datecol="Invoice date " amtcol = "Invoice Amount" recap="Potential Duplicates" datetol=30 simtol=4

 

 

Output from the processing will include a text listing of invoices which, although not exact duplicates, may be invoices for the same goods or services

An example of the command bar is shown below:

This results in a report of "near misses", and example of which is shown below:

 

 

 

Main Form

Complete Auditor resources
Links | Privacy Policy | FAQ | Downloads | Trademarks
Web Page last updated on 11-10-2007
© EZ-R Stats, LLC 2005-2007