

You’ll absolutely want to audit where your forms are embedded throughout the site.ĭisclosure: Martech Zone is using its affiliate links in this article. While this is a very specific application, it’s an incredibly useful one as you’re working with large sites. This does take a little longer to crawl the site, of course, but you’ll get forms that are rendered client-side by JavaScript as well as forms that are inserted server-side. Within Configuration > Spider, you can go to the Rendering tab and enable this. Screaming Frog SEO Spider Javascript RenderingĪnother great option of Screaming Frog is that you aren’t limited to the HTML in the page, you can render any JavaScript that’s going to insert forms within your site. The platform even has a nice method to test some URLs against the rules to ensure it works properly before you crawl your site. You can also use this to skip crawling unnecessary paths like tags by adding: martech.zone/tag/.* Because of this, we enabled the Exclude List Configuration and added the following rule. Crawling these translations isn’t necessary since all the assets and information is based on the core site. Heading Tag Inline Style: ]*?\s+)?style\s*=\s*"(*)" Exclude Subdomains In Your CrawlĪt Martech Zone, we serve the site in multiple languages at different subdomains.To identify where inline styles were used, we scrapted the site with a number of RegEx rules for custom extraction: We’re helping a client right now clean up a site where they used inline styles on the Elementor plugin to customize virtually every element with a page. The XPath query looks for a form with an input with a name of formid, then the extraction saves the value: XPath: Extract Inline Style Tags We apply an XPath rule to capture the id from within the form that’s inserted in the page. We then apply a Regex rule to capture the id from within the script tag that’s inserted in the page: Regex: id: *(.*?)įor the Form ID, the data is in an input tag within the Marketo form: The extraction screen allows for virtually unlimited data collection: Regex, XPath, and CSSPath Extractionįor the MunchkinID, the identifier is located within the form script that’s within the page:

With the tool, open Configuration > Custom > Extraction to identify elements you wish to extract. This comes in extremely useful as we wish to crawl the client’s sites and audit and capture the MunchkinID and FormId values from pages. Screaming Frog SEO Spider: Crawl And ExtractĪ key feature of Screaming Frog SEO Spider is that you can perform custom extractions based on Regex, XPath, or CSSPath specifics. The features extend far beyond optimization for search, though, with one incredibly helpful feature for extracting data from your site as it’s being crawled. The platform is feature-rich and offers hundreds of options for virtually every task you require. Companies often have thousands of pages and hundreds of forms throughout their sites that need to be identified for updating.Ī great tool for this is Screaming Frog’s SEO Spider… perhaps the most popular platform in the SEO market for crawling, auditing, and extracting data from a site. With an enterprise marketing automation platform like Marketo, forms are the entry point of data throughout sites and landing pages. Register a Google Account Without Gmail | Martech Zone
