Onboarding is Dead
Evan Boyle
LLMs have killed onboarding as we know it. Luckily, LLMs have reinvented it as well. We know that every additional step and manual input in an onboarding flow leads to falloff. No more.
Product onboarding often feels like dealing with the IRS. Despite having all my data and a deep understanding of their own system, the IRS still demands that I do all the legwork and provide numbers. Sometimes they punish my errors as well! Traditional product onboarding is no different.
I have to fill out a tedious form describing my company and select from a tired drop-down informing you about my role, company size, and objectives for using your software? Most of this data is published on my company’s website, or other publicly available sources. The rest of it can be inferred and extrapolated from it.
There is no excuse for collecting this category of information from your users manually in the year 2024. We have LLMs and LLM-optimized web scraping services that not only support scraping clean markdown ready for chunking and indexing into a vector database but also structured extraction into a predefined schema. Let's learn how to put them to use!
Effortless Onboarding with LLM-Optimized Web Scraping
At Cortex Click, we run the entire flow, soup to nuts, from a single input from the user: their website URL. This configures your catalogs, cortexes (what we call an AI agent specialized for content writing), and scrapes your entire docs and marketing website into our platform to improve the quality of content generation.
Walking through the onboarding flow configures everything necessary to get started with Cortex Click and starts indexing your website in the background.
When you only require a single input, you minimize dropoff. Most people are much better at editing than they are at synthesizing.
Learn more about the Cortex Click onboarding experience.
How to Utilize LLMs and Web Scraping for Onboarding
Let's look at a practical example of using Firecrawl to scrape a page, and then feeding the input into OpenAI using structured extraction against a JSON schema. With this output, we can prepopulate values for the rest of our onboarding flow.
Scraping Web Pages with Firecrawl
Firecrawl offers an LLM-optimized web scraper that converts output into LLM-friendly markdown. It includes features to remove common site elements like headers, footers, and navigation bars to increase the content density of each document and reduce duplicate elements that harm the quality of Retrieval-Augmented Generation (RAG). For more information, visit the Firecrawl website.
Using OpenAI GPT-4o for Structured Data Extraction
We'll use GPT-4o to extract structured data from web scraping results. This approach allows you to transform unstructured web content into actionable JSON data with a single input. We specify the response_format: { type: "json_object" }
parameter, and include a description of the desired JSON schema in the prompt. We also provide few shot examples to improve output quality.
Automating this process streamlines your onboarding flow, reducing friction and enabling users to get started quickly with minimal effort.
From here, you have a piece of JSON that can be used to kick off the state of your onboarding flow. Users have the option of making edits, but it is just one click to get started using your product.
Putting AI to Use for Go-to-Market
Cortex Click helps technical products build a high-quality content strategy. I wrote this blog post with the help of Cortex Click. This includes a cortex generating first draft, 15 AI refinements, and a few human edits to get the code examples right. The entire process only took one hour.
Cortex Click specializes in helping marketers, engineers, and sales teams generate high-quality content swiftly. Here's how it works:
- Long-Form Content Editor: Create detailed marketing materials including blog posts, developer documentation, and case studies.
- AI Refinements: Make targeted changes to your content with natural language commands.
- Easy ingestion of your knowledge base: Index your marketing and docs websites, github repos, and internal wikis to 10x the quality of generated content.
Cortex Click helps your team create higher quality content in 10% of the time through a combination of adversarial, mutli-agent LLM workflows, grounding in your company's existing data, and human review.
Get Started with Cortex Click:
The future of onboarding is here, and it's effortless, accurate, and zero-configuration. Embrace the power of LLMs and web scraping, and experience the new era of automated onboarding that saves time and minimizes falloff.