The objective of this study was to harness neural networks to solve the rampant problem of non-accredited online pharmacies performing unapproved sales of drugs. A BERT model and MobileNetV2 model were used to classify the text and images of online pharmacies as accredited and non-accredited. Both models achieved moderately high accuracy on the validation datasets, with the BERT model at 90% and the MobileNetV2 model at 87% accuracy. Future work using a MultiModal model is recommended to combine the image and text models for improved overall accuracy.
The National Association of Boards of Pharmacy claims that 96% of all online pharmacies are illegitimate. Currently, the process of verifying legitimacy is manual and slow. In order to flag suspicious online pharmacies that may be selling unsafe medications, it is necessary to create an automated process that can detect whether an online pharmacy is accredited or not. There has yet to be a Deep Learning study that identifies non-accredited pharmaceutical websites. Our study is the first to train complex neural networks to detect accredited and non-accredited pharmaceutical websites.
Our custom dataset includes web-scraped content from the home pages of 72 known accredited and non-accredited pharmacies.
Below are the notebooks, datasets, models, and results:
-
Webscraping online pharmacies, parsing data, & building datasets
-
Literature & Reporting:
MobileNetV2 Model Performance
We recommend concatenating the image and text models with a MultliModal Model for a more comprehensive approach to classifying pharmacy website content as a whole.