Skip to main content

Hey guys I’m having trouble to scrap data from an HTML table and to populate a google sheet.
 

I’ve created line item based on the presence of the </td></tr><tr><td> between the different line. And now all I have to do is to split the data of each cell using </td><td> as separator.

So here is the data I input
 

Now I have two solutions
 



If I put the value “ALL (as Line_items)” I get this output.
 


Wich Is good but I can’t use it to populate my google sheet.

Or I use “All (as Separate Fields)”
 

And now I get this output
 

Which I can use to populate my google sheet, but I’m loosing the data from column 2 to 9, and I don’t understand why.

 

Any idea how to fix this, or to easily scrap data from an HTML table?

Hi @Rigawel 

Perhaps try replacing the HTML Tags with a value to split by later.
e.g. replace </td><tr> with a pipe |

Then you can remove any HTML tags with another Formatter step.

 

Formatter > Text > Replace

Formatter > Text > Remove HTML Tags

Formatter > Text > Split

 

Another option is to use a Code step to parse the data points.


Hey,

Thank you for your answer, I think I found what caused the problem.

 

I didn’t check all the list and thought Zapier would class the output numericaly.
 

And when I checked,  I understood that the order wasn’t the one I thought it was.


Well spotted, @Rigawel! 🙌

Thanks so much for following up here to share your discovery. I’m so glad you were able to locate that missing field. It sounds like you’re all set for now but please do reach out again if there’s anything else you’d like some assistance on. In the meantime, happy Zapping! ⚡️  


Reply