Coding Challenge - Naver Scraping HTML

The document outlines a coding challenge to create a scalable and undetectable API for scraping product details from Naver SmartStore. The API must extract JSON data from the global variable __PRELOADED_STATE__, implement anti-detection techniques, and meet specific performance criteria. Deliverables include a hosted API link, source code, and a comprehensive README with setup and usage instructions.

Uploaded by

Hafiz Maulana Azhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

151 views3 pages

Coding Challenge - Naver Scraping HTML

Uploaded by

Hafiz Maulana Azhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Coding Challenge: Build a Scalable and Undetectable API for

Scraping Naver SmartStore Product Details

Objective
Your challenge is to build a scalable and undetectable API that scrapes product detail data
from smartstore.naver.com. The scraper must retrieve JSON data from the page’s global
variable __PRELOADED_STATE__.

Your scraper should be able to bypass anti-scraping mechanisms and return accurate data
in real time.

URL Schema to Target

Naver SmartStore product detail pages typically follow this structure:
https://smartstore.naver.com/{store_name}/products/{product_id}
Example:
https://smartstore.naver.com/rainbows9030/products/11102379008
From this page, the scraper must capture:
• JSON data of global variable __PRELOADED_STATE__

Requirements:
1. Scraping Logic
• Extract the raw JSON data of global variable __PRELOADED_STATE__
• Techniques to avoid detection must be implemented:
o Rotate Fingerprints and IPs
o Implement request throttling and random delays
2. API Development
• Build a REST API with an endpoint to retrieve product detail by product URL:
Example:
o GET https://your-
api.com/naver?productUrl=https://smartstore.naver.com/minibe
ans/products/8768399445

3. Tech Stack
• Use JavaScript for development.
• TypeScript is a strong plus.
4. Hosting
• Host your API via NGROK (or any publicly accessible tunnel).
• Share the link so we can test the API remotely.
• Provide a clear and complete README for local setup and usage instructions.

Test Success Criteria

To pass the test, your API must meet all of the following:

Successfully retrieve data for 1000+ products.

Maintain average latency ≤ 6 seconds per request.

Maintain error rate ≤ 5%.

Stay stable and responsive for 1 hour of continuous testing

Scraping & Proxy Notes

• Proxy: 6n8xhsmh.as.thordata.net:9999:td-customer-mrscraperTrial-country-
kr:P3nNRQ8C2
• You are free to search for free or trial proxy providers for this test.

Deliverables
• Hosted API link (e.g., Ngrok).
• Source code (preferably in a GitHub repo).
• README.md with:
o Setup instructions.
o Run/test instructions.
o Scraper explanation (evasion strategies, proxy usage, etc.).
o Example usage of your API.

Internship Assessment
No ratings yet
Internship Assessment
3 pages
ProblemStatement For t4 Track
No ratings yet
ProblemStatement For t4 Track
4 pages
Software Engineer - Backend Test
No ratings yet
Software Engineer - Backend Test
2 pages
Myrecent Projects
No ratings yet
Myrecent Projects
1 page
Context
No ratings yet
Context
8 pages
Developer's Guide to Product API
No ratings yet
Developer's Guide to Product API
2 pages
2.back-End Task
No ratings yet
2.back-End Task
3 pages
Inlab
No ratings yet
Inlab
2 pages
Problem Statement
No ratings yet
Problem Statement
7 pages
AFFORDMED® Campus Hiring Evaluation - Full Stack: Disclaimer
No ratings yet
AFFORDMED® Campus Hiring Evaluation - Full Stack: Disclaimer
8 pages
3hrs Backend Task
No ratings yet
3hrs Backend Task
3 pages
Product Info Scrapper
No ratings yet
Product Info Scrapper
18 pages
Frontend Dev Task: Dashboard Design
No ratings yet
Frontend Dev Task: Dashboard Design
2 pages
Python Microservices Guide
No ratings yet
Python Microservices Guide
22 pages
Assessment Task - Carbon38
No ratings yet
Assessment Task - Carbon38
5 pages
Tasks (Partially)
No ratings yet
Tasks (Partially)
5 pages
Scrapingquickstart
No ratings yet
Scrapingquickstart
32 pages
Ecom Research Paper
No ratings yet
Ecom Research Paper
4 pages
Sirclo Api: Creating Access Token General Request & Response Format
No ratings yet
Sirclo Api: Creating Access Token General Request & Response Format
20 pages
Shopping App Dev Assignment
No ratings yet
Shopping App Dev Assignment
6 pages
Oet M52 Ws 5 Ii 6 IGFf KPJH DH BRK
No ratings yet
Oet M52 Ws 5 Ii 6 IGFf KPJH DH BRK
10 pages
Api Design Course
No ratings yet
Api Design Course
11 pages
ShopEZ - E-Commerce Application - Report
No ratings yet
ShopEZ - E-Commerce Application - Report
14 pages
E Commerce
No ratings yet
E Commerce
5 pages
Complete Shopping API Spec
No ratings yet
Complete Shopping API Spec
4 pages
Python Programming
No ratings yet
Python Programming
11 pages
API Documentation Complete
No ratings yet
API Documentation Complete
12 pages
I Want Fully Seo Optimised Website
No ratings yet
I Want Fully Seo Optimised Website
1 page
Wse Internal 1
No ratings yet
Wse Internal 1
20 pages
Backend Assignment
No ratings yet
Backend Assignment
3 pages
Apify API v2: RESTful Access Guide
No ratings yet
Apify API v2: RESTful Access Guide
105 pages
How To Scrape Product Data From Amazon - A Complete Guide - Oxylabs
No ratings yet
How To Scrape Product Data From Amazon - A Complete Guide - Oxylabs
19 pages
Headless Questions - Sonali - Updated
No ratings yet
Headless Questions - Sonali - Updated
30 pages
Ip Mkeka 2
No ratings yet
Ip Mkeka 2
19 pages
Location Based Offer Notifier SRS
No ratings yet
Location Based Offer Notifier SRS
2 pages
ShopZ E-Commerce Application
No ratings yet
ShopZ E-Commerce Application
12 pages
Ecom RSP FINAL
No ratings yet
Ecom RSP FINAL
5 pages
B - 2 CIE Web Scraping
No ratings yet
B - 2 CIE Web Scraping
8 pages
React Technical Task
No ratings yet
React Technical Task
3 pages
Pruebatecnica FrontendJR
No ratings yet
Pruebatecnica FrontendJR
2 pages
Flutter REST API Guide for Devs
No ratings yet
Flutter REST API Guide for Devs
13 pages
Ecom Research Paper
No ratings yet
Ecom Research Paper
4 pages
API Descriptive Sheet Deb
No ratings yet
API Descriptive Sheet Deb
2 pages
Assignment Document: Frontend E-Commerce Website
No ratings yet
Assignment Document: Frontend E-Commerce Website
3 pages
AI E Commerce Chatbot Report
No ratings yet
AI E Commerce Chatbot Report
3 pages
Frontend Requirements and Code
No ratings yet
Frontend Requirements and Code
5 pages
Custom Api For Sorsu Dms
No ratings yet
Custom Api For Sorsu Dms
8 pages
Evaluation Task - Data Visualization
No ratings yet
Evaluation Task - Data Visualization
2 pages
Cabico Tan
No ratings yet
Cabico Tan
11 pages
Smart Refrigerator
No ratings yet
Smart Refrigerator
6 pages
ERP API Guide for Developers
No ratings yet
ERP API Guide for Developers
55 pages
Ecommerce Blueprint
No ratings yet
Ecommerce Blueprint
5 pages
Project
No ratings yet
Project
25 pages
Day 2 - OWASP Juice Shop - Introduction
No ratings yet
Day 2 - OWASP Juice Shop - Introduction
3 pages
From Web To File
No ratings yet
From Web To File
5 pages
61 Rest
No ratings yet
61 Rest
4 pages
Lab 17
No ratings yet
Lab 17
4 pages
Data Engineering Concepts #2 - Sending Data Using An API - by Bar Dadon - Dev Genius
No ratings yet
Data Engineering Concepts #2 - Sending Data Using An API - by Bar Dadon - Dev Genius
14 pages
E-Commerce Web App Practical Report
No ratings yet
E-Commerce Web App Practical Report
4 pages
Video Intercom / Access Control / Alarm: Products and Solutions
No ratings yet
Video Intercom / Access Control / Alarm: Products and Solutions
56 pages
Email Engine 700 PDF
No ratings yet
Email Engine 700 PDF
328 pages
Manual Fanuc Ladder Iii PDF
No ratings yet
Manual Fanuc Ladder Iii PDF
791 pages
Manual Kick Tolerance Guide
100% (1)
Manual Kick Tolerance Guide
3 pages
Syllabus DBI202
No ratings yet
Syllabus DBI202
8 pages
LectroPol-5 Brochure English PDF
No ratings yet
LectroPol-5 Brochure English PDF
4 pages
SAP Sales & Distribution Guide
100% (2)
SAP Sales & Distribution Guide
2 pages
VCAEI6 - RFA - MOS - ME - GS - 001 (Gas Installation) - BCE
No ratings yet
VCAEI6 - RFA - MOS - ME - GS - 001 (Gas Installation) - BCE
8 pages
2024 CF Moto - 450sr S - SM
100% (3)
2024 CF Moto - 450sr S - SM
219 pages
JIMS 5518 Instructions
No ratings yet
JIMS 5518 Instructions
2 pages
React Assignment
No ratings yet
React Assignment
5 pages
sf08-22 24oct2023 16-47
No ratings yet
sf08-22 24oct2023 16-47
4 pages
Globalization Empowers Civilization0330
No ratings yet
Globalization Empowers Civilization0330
45 pages
Mcp3008 Spi Adc
No ratings yet
Mcp3008 Spi Adc
9 pages
Liu Et Al 2024 A Matter of Time Publication Dates in Scopus
No ratings yet
Liu Et Al 2024 A Matter of Time Publication Dates in Scopus
10 pages
13 - Factory Lighting, Flood Lighting, Street Lighting
No ratings yet
13 - Factory Lighting, Flood Lighting, Street Lighting
11 pages
IFHE-Distance BBA Prospectus - July 2024
No ratings yet
IFHE-Distance BBA Prospectus - July 2024
8 pages
Banking & CSR Study Proposal
No ratings yet
Banking & CSR Study Proposal
7 pages
Industrial RS485 Temp Sensors
No ratings yet
Industrial RS485 Temp Sensors
2 pages
Gramarly - Google Search
No ratings yet
Gramarly - Google Search
2 pages
Cultivating Cordyceps Militaris in Kolkata's Humid Climate - A Comprehensive Guide
No ratings yet
Cultivating Cordyceps Militaris in Kolkata's Humid Climate - A Comprehensive Guide
4 pages
Introduction To Unified Modeling Language (UML)
No ratings yet
Introduction To Unified Modeling Language (UML)
27 pages
Course - Handout - EC101 - Basic Electrical and Electronic Engineering
No ratings yet
Course - Handout - EC101 - Basic Electrical and Electronic Engineering
3 pages
Linux Interview Questions Part4
No ratings yet
Linux Interview Questions Part4
3 pages
Causality-Inspired Taxonomy For Explainable Artificial Intelligence
No ratings yet
Causality-Inspired Taxonomy For Explainable Artificial Intelligence
43 pages
Telehealth Access in Nepal Pandemic
No ratings yet
Telehealth Access in Nepal Pandemic
124 pages
Formal Language and Automata Thorey FLAT
No ratings yet
Formal Language and Automata Thorey FLAT
24 pages
XVR301-16G3 16-Channel DVR Specs
No ratings yet
XVR301-16G3 16-Channel DVR Specs
4 pages
DR - Srinivas Bachu
No ratings yet
DR - Srinivas Bachu
8 pages
Glasses Direct Competitor Analysis
No ratings yet
Glasses Direct Competitor Analysis
4 pages

Coding Challenge - Naver Scraping HTML

Uploaded by

Coding Challenge - Naver Scraping HTML

Uploaded by

Coding Challenge: Build a Scalable and Undetectable API for

Scraping Naver SmartStore Product Details

URL Schema to Target

Test Success Criteria

Successfully retrieve data for 1000+ products.

Maintain average latency ≤ 6 seconds per request.

Maintain error rate ≤ 5%.

Stay stable and responsive for 1 hour of continuous testing

Scraping & Proxy Notes

You might also like