A68 HOW REAL ARE YOUR SURVEY RESPONDENTS? IDENTIFYING FRAUDULENT RESPONDENTS IN ONLINE SURVEYS – A CASE EXAMPLE IN INFLAMMATORY BOWEL DISEASE (IBD). (7th March 2023)
- Record Type:
- Journal Article
- Title:
- A68 HOW REAL ARE YOUR SURVEY RESPONDENTS? IDENTIFYING FRAUDULENT RESPONDENTS IN ONLINE SURVEYS – A CASE EXAMPLE IN INFLAMMATORY BOWEL DISEASE (IBD). (7th March 2023)
- Main Title:
- A68 HOW REAL ARE YOUR SURVEY RESPONDENTS? IDENTIFYING FRAUDULENT RESPONDENTS IN ONLINE SURVEYS – A CASE EXAMPLE IN INFLAMMATORY BOWEL DISEASE (IBD)
- Authors:
- MacDonald, K V
Nguyen, G C
Barker, K L
Harris, M
Sewitch, M J
Marshall, D A - Abstract:
- Abstract: Background: Social media and online surveys are commonly used to recruit and collect data from patients and physicians about GI diseases – they are efficient, convenient, and less resource intensive compared to traditional recruitment approaches and paper surveys. However, online data fraud is increasing and difficult to identify. Online data fraud can include intentional duplicate responses/straight-lining/inattention, bots/malicious software, and professional survey takers who provide fraudulent responses to meet study eligibility. Purpose: 1) Illustrate challenges of identifying fraudulent respondents through an algorithm and verification process we developed for our survey in IBD. 2) Demonstrate potential impact of fraudulent respondents on data and results. Method: Online survey of Canadian adults (>18 years) with IBD about healthcare processes for managing IBD hosted using Qualtrics. Recruitment was done in clinic and online (mailing lists, social media). A $25 giftcard was offered for participation due to low response after 3 months in field, after which a large influx of 'respondents' occurred. Most were fraudulent although not obvious at first. To mitigate further fraudulent responses, we added the following to our survey: reCAPTCHA score, repeated question (year of IBD diagnosis), duplicate ID score, fraud score and honeypot question. Our algorithm to identify fraudulent responses included 13 binary 'red flag' variables: age <18 years, year of diagnosis <Abstract: Background: Social media and online surveys are commonly used to recruit and collect data from patients and physicians about GI diseases – they are efficient, convenient, and less resource intensive compared to traditional recruitment approaches and paper surveys. However, online data fraud is increasing and difficult to identify. Online data fraud can include intentional duplicate responses/straight-lining/inattention, bots/malicious software, and professional survey takers who provide fraudulent responses to meet study eligibility. Purpose: 1) Illustrate challenges of identifying fraudulent respondents through an algorithm and verification process we developed for our survey in IBD. 2) Demonstrate potential impact of fraudulent respondents on data and results. Method: Online survey of Canadian adults (>18 years) with IBD about healthcare processes for managing IBD hosted using Qualtrics. Recruitment was done in clinic and online (mailing lists, social media). A $25 giftcard was offered for participation due to low response after 3 months in field, after which a large influx of 'respondents' occurred. Most were fraudulent although not obvious at first. To mitigate further fraudulent responses, we added the following to our survey: reCAPTCHA score, repeated question (year of IBD diagnosis), duplicate ID score, fraud score and honeypot question. Our algorithm to identify fraudulent responses included 13 binary 'red flag' variables: age <18 years, year of diagnosis < year of birth, 2 different year of diagnosis, invalid postal code, survey duration <10 minutes, survey duration 10-15 minutes, suspicious comments for open text questions (x2), duplicate email, suspicious email, duplicate ID score ≥30, fraud score ≥30, and failed honeypot question. These variables were used to generate a fraudulent response score (range: 0-13; 13=most likely fraudulent). 'Respondents' with scores >3 were categorized as likely fraudulent. Respondents with scores ≤3 were reviewed individually. Respondents flagged as likely real or unsure were emailed and asked to verify their age; those who correctly verified age were considered likely real and included in the final sample. Result(s): Of the 4334 'respondents' who started the survey, based on fraudulent response score we identified 75% (n=3258) as likely fraudulent, 17% (n=727) as unsure and 8% (n=349) as likely real. After age verification, 76% (n=3297) were considered likely fraudulent, 14% (n=592) remained unsure, 10% (n=442) were considered likely real, and <1% (n=3) were duplicates of likely real respondents. Conclusion(s): Despite convenience, social media and online surveys can be prone to fraudulent responses, especially when incentives are offered. We developed an algorithm and verification process to identify fraudulent responses using an IBD survey example. Given that only 10% of the full sample was considered likely real, researchers using social media and online surveys should carefully examine data for fraudulent responses and apply strategies to mitigate risks. Please acknowledge all funding agencies by checking the applicable boxes below: CCC Disclosure of Interest: None Declared … (more)
- Is Part Of:
- Journal of the Canadian Association of Gastroenterology. Volume 6(2023)Supplement 1
- Journal:
- Journal of the Canadian Association of Gastroenterology
- Issue:
- Volume 6(2023)Supplement 1
- Issue Display:
- Volume 6, Issue 1 (2023)
- Year:
- 2023
- Volume:
- 6
- Issue:
- 1
- Issue Sort Value:
- 2023-0006-0001-0000
- Page Start:
- 37
- Page End:
- 38
- Publication Date:
- 2023-03-07
- Subjects:
- Gastroenterology -- Periodicals
616.33005 - Journal URLs:
- https://academic.oup.com/jcag ↗
http://www.oxfordjournals.org/ ↗ - DOI:
- 10.1093/jcag/gwac036.068 ↗
- Languages:
- English
- ISSNs:
- 2515-2084
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26095.xml