Tracking counterfeiting on the web with python and ml

Valerio Cosentino
Valerio CosentinoSoftware developer en Smart Protection
Tracking counterfeiting on the Web
with Python and ML
Valerio Cosentino
Software Engineer
PyConEs, October 3rd, 2021
Tracking counterfeiting on the web with python and ml
Tracking counterfeiting on the web with python and ml
[1] https://www.cbc.ca/news/business/marketplace-counterfeits-fakes-online-shopping-1.5470639
[2] https://apnews.com/press-release/pr-businesswire/ef15478fa38649b5ba29b434c8e87c94
[3] https://www.cnbc.com/2020/03/02/shop-safe-act-2020-cracks-down-on-counterfeits-on-ecommerce-platforms.html
Buyer Marketplace Brand
Buyer Marketplace Brand
[1] https://arstechnica.com/tech-policy/2021/05/amazon-seized-and-destroyed-2-million-counterfeit-products-in-2020/
[2] https://www.ebay.com/help/policies/prohibited-restricted-items/counterfeit-item-policy?id=4276#section1
[3] https://www.aliexpress.com/buyerprotection/how_to_be_eligible.html
[4] https://ec.europa.eu/growth/industry/policy/intellectual-property/enforcement/memorandum-understanding-sale-counterfeit-goods-internet_en
?
?
?
How can a brand know if its products are being counterfeiting on the Web?
search extract evaluate get crazy
Can Python and ML help?
How can a brand know if its products are being counterfeiting on the Web?
search extract evaluate get crazy
Can Python and ML help?
EXTRACT ANALYSIS
etc..
SEARCH REPORT
How can a brand know if its products are being counterfeiting on the Web?
queries
marketplace
product
URLs
How to write effective queries?
How to set the frequency of queries?
SEARCH
queries
queue
search
product
URLs
lambda queue
scraping
API calls
SEARCH
queue
extract
lambda Dynamo
product
URLs
products
info
EXTRACT
mandatory
fields
optional
fields
ANALYSIS
Dynamo Aurora
contents
transform
ANALYSIS
What is a relevant content?
What is a legal/illegal content?
Relevance Detection
ANALYSIS
What is a relevant content?
What is a legal/illegal content?
Relevance Detection
manual
text analysis
image features
ANALYSIS
What is a relevant content?
What is a legal/illegal content?
Relevance Detection
rule-based
manual
text analysis
feature analysis
manual
text analysis
image features
[1] https://www.amazon.com/report/infringement
[2] https://sell.aliexpress.com/zh/__pc/77Y4QdcvjD.htm
[3] https://pages.ebay.com/seller-center/listing-and-marketing/verified-rights-owner-program.html
[4] https://merchant.wish.com/brand-protection/brand-violation-report
Fake product
URLs
Takedown
REPORT
Takeaways
● Counterfeiting is a growing problem
● Python and Machine Learning can help
● Manual intervention is still needed
● The approach can be applied to other scenarios
What’s next?
● More data, more questions to answer
○ Evolutionary analysis
○ Comparative analysis
Q&A
EXTRACT ANALYSIS
SEARCH REPORT
1 de 18

Más contenido relacionado

Similar a Tracking counterfeiting on the web with python and ml

Your Next IoT JourneyYour Next IoT Journey
Your Next IoT JourneyDr. Mazlan Abbas
2.3K vistas52 diapositivas

Similar a Tracking counterfeiting on the web with python and ml(20)

Your Next IoT JourneyYour Next IoT Journey
Your Next IoT Journey
Dr. Mazlan Abbas2.3K vistas
IOT - The 3rd Internet Tsunami is HereIOT - The 3rd Internet Tsunami is Here
IOT - The 3rd Internet Tsunami is Here
Dr. Mazlan Abbas1.3K vistas
IRJET -  	  Smart Marketing using QR CodeIRJET -  	  Smart Marketing using QR Code
IRJET - Smart Marketing using QR Code
IRJET Journal29 vistas
What CFEs can do about digital ad fraudWhat CFEs can do about digital ad fraud
What CFEs can do about digital ad fraud
Dr. Augustine Fou - Independent Ad Fraud Researcher672 vistas
Cryptocurrency TrackerCryptocurrency Tracker
Cryptocurrency Tracker
IRJET Journal97 vistas
chatgpt-privacy and security.pptxchatgpt-privacy and security.pptx
chatgpt-privacy and security.pptx
Deepak Kumar247 vistas
Man-In-The-Browser attacksMan-In-The-Browser attacks
Man-In-The-Browser attacks
Mário Almeida5.4K vistas

Más de Valerio Cosentino(19)

Último(20)

Dynamics of Hard-Magnetic Soft MaterialsDynamics of Hard-Magnetic Soft Materials
Dynamics of Hard-Magnetic Soft Materials
Shivendra Nandan13 vistas
9_DVD_Dynamic_logic_circuits.pdf9_DVD_Dynamic_logic_circuits.pdf
9_DVD_Dynamic_logic_circuits.pdf
Usha Mehta19 vistas
Investor PresentationInvestor Presentation
Investor Presentation
eser sevinç15 vistas
IWISS Catalog 2022IWISS Catalog 2022
IWISS Catalog 2022
Iwiss Tools Co.,Ltd23 vistas
CHI-SQUARE ( χ2) TESTS.pptxCHI-SQUARE ( χ2) TESTS.pptx
CHI-SQUARE ( χ2) TESTS.pptx
ssusera597c514 vistas
What is Whirling Hygrometer.pdfWhat is Whirling Hygrometer.pdf
What is Whirling Hygrometer.pdf
IIT KHARAGPUR 10 vistas
cloud computing-virtualization.pptxcloud computing-virtualization.pptx
cloud computing-virtualization.pptx
RajaulKarim2078 vistas
13_DVD_Latch-up_prevention.pdf13_DVD_Latch-up_prevention.pdf
13_DVD_Latch-up_prevention.pdf
Usha Mehta7 vistas
Solar PVSolar PV
Solar PV
Iwiss Tools Co.,Ltd11 vistas
Pull down shoulder press final report docx (1).pdfPull down shoulder press final report docx (1).pdf
Pull down shoulder press final report docx (1).pdf
Comsat Universal Islamabad Wah Campus8 vistas
SNMPxSNMPx
SNMPx
Amatullahbutt12 vistas
performance uploading.pptxperformance uploading.pptx
performance uploading.pptx
SanthiS107 vistas
Electrical CrimpingElectrical Crimping
Electrical Crimping
Iwiss Tools Co.,Ltd19 vistas

Tracking counterfeiting on the web with python and ml