An API for retrieving webpage HTML in JSON format
It acts as a proxy that formats the response
For example, if the webpage HTML is:
<html>
<h3>Webpage Title</h3>
<p>Hello!</p>
</html>
The API will return in JSON format:
{
"tag": "html",
"html": "\n",
"children": [
{
"tag": "body",
"children": [
{
"tag": "h3",
"html": "Webpage Title"
},
{
"tag": "p",
"html": "Hello!"
}
],
"html": "\n"
}
]
}
Usage:
Simply make a GET request to:
kolb.dev/scrape/SOMEWEBSITE
Example:
kolb.dev/scrape/https://kolb.dev