Please see http://www.opad.com/foscarini-caboche-medium-suspension-light.html as an example. We need to download the color swatches (there are two on this page - Transparent and Golden-Yellow). We need to also need to download all of the item images. It shows that there are 19 of them, however one of these images is actually a schematic image (i.e. the 3rd type of image we need to download). From the “view full size image” view, the 3rd to last image is the schematic image that I am referring to. Lastly, most item pages contain 1 or more pdf files. They can be found by searching for the pdf icon close to the bottom of the page - all pdf’s need to be downloaded as well.
Please have a look at the link above and let me know if this work makes sense. We want to scrape all images and all pdf’s from every item page on olighting.com, as well as opad.com. When downloading, we would want 4 folders for each site: 1) item images, 2) color swatch images, 3) schematic images, and 4) pdf files. We can provide these folders on dropbox.
We are seeing a problem that needs to be figured out in order for this work to be of any value to me. for some reason, the names of the images is not matching the names of the images that I uploaded to these sites in the first place. the images should follow this naming structure: productid, productid-image1, productid-image2, etc. The schematic should follow this naming structure: productid-inset (there will only be one schematic image per product), and the color swatches should follow this naming format: productid-colorswatchname, productid-nextcolorswatchname, etc
The images should follow this naming structure: productid-image1, productid-image2, etc. The schematic should follow this naming structure: productid-inset, and the color swatches should follow this naming format: productid-colorswatchname
There will only be 1 schematic image per product. All other file types may have multiple
color swatches are not straightforward because the names are based on the ID and the description of the image. See http://www.opad.com/moooi-random-pendant-light.html - So moooi-random-pendant-light would be moooi-random-pendant-light-white moooi-random-pendant-light-black
we will provide a list of all product page IDs for Opad.com and OLighting.com
Our main concerns are capturing all required files in their full sizes, and naming them correctly
We are also looking for someone to rename these images, swatches, PDFs and schematic files for products going onto a new website through the Magento platform. The files will be renamed with a clear new structure using the information on the CSV. The new ID will be used to populate all referencea to the files IE: all of the renamed images, PDF's, schematics, and swatches will be referenced on the new CSV.