web scraping
dla elink.cat / zdalnie
Tagi: html skrypt web scraping web harvesting
Objectives
The objective of the project is to create a script to collect all product info from a website (web name to be delivered) and store it in a database or excel file.
In Scope
- One shot script
- To store captured data
Out of Scope
- Recurrent runs of this script
REQUIREMNENTS
Requirement 1. Data harvesting
Item 1.1
Priority: High
We need to collect all product data from (web name to be delivered)and store it.
The fields to be collected are listed below:
1. Category1
2. Category2
3. Category3
4. Image
5. Tittle
6. Main Link
7. Description
8. Images Link
9. Video Link
10. PDF Link
11. Other products of this company link
12. Premium badge? (Y/N)
13. Company Name
14. Company Logo & Description
15. Company Link
16. Company digital catalog
Requirement 2. To store information collected
Priority: High
All data collected in previous requirement needs to be stored in a .csv file or, if collected data exceeds this format, stored in a MySQL database with all previously detalled fields.
Jeżeli ta oferta pracy nie jest zgodna z regulaminem, powiadom nas!
Poleć znajomemu
Wyświetlona: 4424 razy