Frequently Asked Questions
Is Site Topic free?
\You can use it for free for up to 1,000 keywords. After that, there are paid plans available.
How does Site Topic use my data?
We process your Google Search Console (GSC) data through various large language models (LLMs) and software. We also store your GSC data over time. The collective data from all users is utilized to enhance our models, benefiting every user by providing more accurate and comprehensive analyses.
If you would like us to stop using Site Topic, you can disconnect your data and we will stop processing it.
How is this different from other methods of topic clustering?
Often when clustering keywords into topics you end up with a lot of topics -- perhaps hundreds! Then you need to manually work out how they each relate to one another. It ends up being a hugely labourious and time-consuming process. Site Topic clusters keywords into topics in a different way, so that you can quickly see topics that can be categorized similarly. For instance, if a site has lots of recipes, Site Topic might show how to group these by ingredient, since that's an intuitive way that many people will search for a recipe. Automating keyword research like this delivers information you can share with a customer without any additional manual work.
Why is it valuable to translate customer searches into the business' language?
Translating customer search terms into the business's language helps connect what customers are looking for with what your business offers. This process makes it easier for different parts of the company, like marketing or product development, to understand customer needs quickly and accurately.
By using terms that are familiar within the business, teams can collaborate more effectively and make data-driven decisions, leading to better products and services.
Essentially, it bridges the gap between what customers want and business actions, letting you respond more quickly to market trends.
What are the challenges of extracting named entities from search query data?
Extracting named entities from search query data is tough. Queries are short and ambiguous and often have informal language, misspellings, and lack of context. That brevity means queries often lack the grammatical structure of longer text, making it difficult to identify entities accurately.
Additionally, queries may contain abbreviations, slang, or non-standard terms, which are hard to interpret correctly. The varying contexts in which a term can be used also add complexity, as the same word might represent different entities depending on the query.
What is the purpose of normalizing search query data?
Search queries are often noisy, rife with misspellings or inconsistent formatting. Site Topic cleans and standardizes this data, ensuring that the labels generated are accurate and meaningful, reflecting genuine user interests and behaviors.
Can Site Topic identify synonyms in search queries?
Yes, Site Topic identifies keyword synonyms in search queries, recognizing different terms with similar meanings to ensure accurate categorization. It's common (especially in GSC data) to find a lot of misspellings. Site Topic cleans that up automatically without you needing to lift a finger.
How does Site Topic handle label variations in keywords?
Site Topic identifies variations in keywords, such as different keyword spelling or abbreviations, and normalizes them to ensure accurate labeling and categorization.
How does Site Topic assist in keyword clustering?
Site Topic groups related keywords into clusters using labels & label groups, helping you understand and organize search queries.
How does Site Topic handle large datasets?
Site Topic is designed to process and analyze large sets of keywords efficiently, providing actionable insights even for extensive keyword lists. You can handle querying 100-100,000 easily. If you'd like to run Site Topic on much larger sets of keywords, please just ask -- you can reach us at hello@similar.ai.
How does Site Topic generate reports from Google Search Console data?
Site Topic allows you to generate reports by connecting to your Google Search Console account, selecting the desired property, and choosing the market to analyze. It works out what people mean when they search and find your site. We categorise keyword interpretations. This process provides insights into your website's performance.
Can Site Topic analyze data from SEMrush?
Yes, Site Topic can automate SEMrush keyword research: you can take a SEMrush keyword export, invest a few minutes of your time and get keyword research that's ready to share with clients.
Can Site Topic provide recommendations for new content?
Yes, Site Topic analyzes search data to identify gaps and opportunities, to find pages that your site misses. There are a few different ways to use Site Topic to do content gap analysis. One of the simplest for an e-commerce site is to add a product page filter to identify which pages are product pages. The "Missing Pages" report will show topics for which only three or more product pages rank.
How frequently is my Site Topic report updated?
By default, we don't update your Site Topic report, so it is possible it will grow stale over time. You can re-run a report as often as you'd like. Often you'll see that there are new keywords, labels and topics, as well as big shifts in search volume. Good to know you can easily update incorporate the latest data and trends, ensuring that your insights are current and relevant!
Does Site Topic work effectively for non-ecommerce websites?
Yes. Site Topic isn’t limited to ecommerce -— it works effectively for informational, navigational, and service-oriented search queries too. It's built on top of a Universal Ontology that Similar.ai has invested years in, to interpret user intent behind keywords, enabling it to form meaningful label groups and content clusters. You can use labels and label groups in a hub-and-spoke model to reorganize your site’s content around clear themes, directly matching user intent.
For longer-tail informational queries that often contain filler or noise, running a larger keyword set (eg. 10k instead of 1k) typically improves accuracy and clarity.
How can I use Site Topic to discover completely new content topics my site isn't ranking for yet?
Site Topic supports a workflow specifically designed to discover new, untapped content opportunities:
- First, gather keywords from external sources, such as competitors, broad match keyword tools, or platforms like SE Ranking.
- Use the "From upload keywords" flow (instead of the default "Start here" path) and upload your external keyword list(s).
- Select your site's Google Search Console (GSC) property, optionally applying a URL filter.
- Site Topic then analyzes only the uploaded keywords your site currently does not rank for, highlighting completely new topics and potential content gaps.
You can further segment these reports by specific ideal customer profiles (ICPs) to clearly structure your site's information architecture around different customer interests and needs.
Can I combine GSC and non-GSC keyword lists, deduplicate them, and analyze that combined set in Site Topic?
Yes. To do this:
- Export your GSC data into a CSV file.
- Upload the GSC keyword CSV into Site Topic, alongside other non-GSC keyword lists.
- Site Topic automatically deduplicates the combined keywords and enriches the data with fresh search volume information, even if you don’t provide impressions or volume data initially.
While impressions data can be included, Site Topic always fetches new search volumes. Keep in mind seasonal queries (like "Mother's Day") might show high impressions but low average monthly search volumes, due to their temporary spikes in search popularity.
Questions, features or bugs?
For any questions or support, please feel free to contact us at hello@similar.ai. Go to our ideas board to suggest a new feature or vote on the ones others have suggested.