Introduction
Shopify sites have unique URL structures and e-commerce patterns that require specialized filtering in Google Search Console. Standard regex filters don't always capture the nuances of how Shopify organizes products, collections, and variants.
This guide provides regex patterns specifically designed for Shopify's architecture, helping you segment traffic, find technical issues, and discover optimization opportunities that generic filters miss.
Understanding Shopify URL Structure
Before diving into filters, understand how Shopify organizes URLs:
-
Products:
/products/product-name -
Collections:
/collections/collection-name -
Pages:
/pages/page-name -
Blogs:
/blogs/blog-name/article-title -
Cart/Checkout:
/cart,/checkout/* -
Account:
/account/* -
Search:
/search?q=keyword
Shopify also generates variant URLs, filter parameters, and pagination that can create indexation challenges.
Product & Collection Filters
All Product Pages
Use case: Analyze your entire product catalog's search performance.
Regex: /products/[^?]+
What it matches: All product pages, excluding those with parameters.
Why it's useful: See which products drive organic traffic, identify top performers, and find products that aren't ranking well.
Pro tip: Combine with position filter (<10) to find products ranking on page 1 that need optimization to reach position 1-3.
Product Variants with Parameters
Use case: Identify when Google indexes product variant URLs with parameters (potential duplicate content issue).
Regex: /products/.*\?variant=
What it matches: Product URLs with variant parameters like /products/t-shirt?variant=12345678.
Why it's useful: Shopify creates unique URLs for variants. If these are indexed separately, you have duplicate content issues. These should typically be canonicalized to the main product page.
How to fix if found: Ensure your theme has proper canonical tags pointing variant URLs back to the main product page.
Collection Pages (Main Level Only)
Use case: Analyze collection landing page performance without pagination or filters.
Regex: /collections/[^/?]+$
What it matches: Main collection URLs only, like /collections/mens-shirts, excluding pages with parameters or subpaths.
Why it's useful: Collection pages are crucial for category-level SEO. This filter helps you see which collections drive traffic and which need optimization.
Paginated Collection Pages
Use case: Find collection pagination that Google has indexed.
Regex: /collections/.*\?page=\d+
What it matches: Paginated collection pages like /collections/shoes?page=2.
Why it's useful: Excessive indexation of pagination wastes crawl budget. Use rel="next/prev" or canonicalize to page 1.
What to look for: If you see high numbers of indexed pagination pages, review your pagination strategy.
Filtered Collection URLs
Use case: Identify collections with filter parameters that may create duplicate content.
Regex: /collections/.*\?(.*&)?(filter|sort_by|constraint)=
What it matches: Collection URLs with filters like /collections/shoes?filter.p.tag=Blue&sort_by=price.
Why it's useful: Filtered URLs can create massive duplicate content problems. These should typically be:
- Blocked with noindex tags
- Canonicalized to the main collection
- Listed in robots.txt if they don't add value
Search Result Pages
Use case: Track internal site search performance and discover what users can't find.
Regex: /search\?.*q=|/search\?.*type=product
What it matches: Search results pages on your Shopify store.
Why it's useful: If search pages are indexed and getting traffic:
- Users can't find these products through normal navigation
- You might be missing collection/category pages
- It reveals content gaps
Action item: Extract the search queries and create collection pages or improve navigation for popular searches.
Technical SEO Filters
All URLs with Parameters
Use case: Find all URLs with query parameters that might cause indexation issues.
Regex: /products/.*\?|/collections/.*\?|/pages/.*\?
What it matches: Any Shopify page with URL parameters.
Why it's useful: Comprehensive view of parameter usage across your site. Export this list to identify:
- Tracking parameters being indexed (utm_, fbclid, etc.)
- Variant URLs
- Filter/sort parameters
- Pagination issues
Cart and Checkout Pages
Use case: Verify that cart/checkout pages aren't being indexed (they shouldn't be).
Regex: /cart|/checkout|/checkouts/
What it matches: Cart and checkout URLs.
Why it's useful: These pages should NEVER appear in Search Console's indexed pages. If they do:
- Check for noindex tags
- Verify robots.txt blocks them
- Ensure canonical tags don't point to them
Critical: If you find indexed cart/checkout pages, fix immediately—these hurt SEO and user experience.
Account and Authentication Pages
Use case: Ensure user account pages aren't indexed.
Regex: /account|/login|/register|/password
What it matches: User account, login, registration, and password pages.
Why it's useful: Like cart pages, these should never be indexed. Finding them in Search Console indicates a serious indexation problem.
Discount and Promo URLs
Use case: Track how discount codes in URLs perform (and whether they should be indexed).
Regex: \?discount=|/discount/
What it matches: URLs containing discount parameters or paths.
Why it's useful: Discount URLs can:
- Create duplicate content if the same product appears at multiple discount URLs
- Leak private discount codes
- Confuse Google about your pricing
Best practice: Use canonical tags or noindex on discount URLs unless you intentionally want them indexed for specific campaigns.
Content Type Filters
Blog Posts vs. Blog Index
Use case: Separate blog post performance from blog category/index pages.
Regex for blog posts: /blogs/[^/]+/[^/?]+
Regex for blog index: /blogs/[^/?]+/?$
Why it's useful: Analyze whether your individual blog posts or your blog categories drive more traffic. This helps prioritize content creation vs. category optimization.
Static Pages
Use case: Track performance of informational pages like About, Contact, FAQ, Shipping, Returns.
Regex: /pages/[^/?]+
What it matches: All static pages created in Shopify's Pages section.
Why it's useful: These pages often rank for important informational queries like "shipping policy" or "return policy." They support purchase decisions even if they don't directly sell.
Query Pattern Filters
Brand + Product Queries
Use case: Identify branded searches that indicate high purchase intent.
Regex: yourstore|yourbrand|your store name
What it matches: Queries containing your store or brand name.
Why it's useful:
- Measure brand awareness
- Track branded traffic growth over time
- Identify if competitors are bidding on your brand terms
- Find brand + product combinations users search for
Customize: Replace "yourstore" with your actual brand name.
Product + Shopify Queries
Use case: Find searches where people specifically want to buy from Shopify stores.
Regex: shopify|shop pay|buy.*shopify
What it matches: Queries mentioning Shopify explicitly.
Why it's useful: Some users specifically seek Shopify stores for Shop Pay or trust reasons. If you're ranking for these, emphasize your Shopify/Shop Pay badges.
Size/Color Variant Searches
Use case: Discover searches for specific product variants.
Regex: size|color|colour|variant|\d+oz|\d+ml|\d+lb|small|medium|large|xl|black|white|red|blue
What it matches: Queries with size, color, or variant specifications.
Why it's useful: Shows user intent for specific variants. Use this data to:
- Feature popular variants in product titles/descriptions
- Create variant-specific landing pages if volume is high
- Ensure variant images are optimized for image search
Price and Deal Queries
Use case: Find high-intent searches focused on pricing and deals.
Regex: price|cost|cheap|affordable|sale|deal|clearance|discount|coupon|promo code|how much
What it matches: Price-conscious queries.
Why it's useful: These searchers are close to purchase but comparing options. Ensure:
- Your pricing is competitive
- You have sale/clearance collection pages
- Product descriptions mention value propositions
- You use price schema markup
Product vs. Review Queries
Use case: Separate product searches from review-seeking searches.
Regex: review|reviews|rating|ratings|testimonial|worth it|good|best|vs|comparison
What it matches: Queries where users seek validation before buying.
Why it's useful:
- Prioritize reviews on products ranking for these terms
- Create comparison content if you rank for "vs" queries
- Add review schema markup to improve visibility
- Build dedicated review pages for high-volume review queries
Shipping-Related Queries
Use case: Track queries about shipping, crucial for e-commerce conversions.
Regex: shipping|delivery|ship|free shipping|how long|when will|tracking
What it matches: Shipping and delivery-related queries.
Why it's useful: Shipping concerns block conversions. If you rank for these:
- Feature free shipping prominently on those pages
- Add shipping information to product descriptions
- Create detailed shipping policy pages
- Use FAQ schema for shipping questions
Advanced Shopify-Specific Patterns
Shopify App Proxy URLs
Use case: Find proxy URLs created by Shopify apps (reviews, wishlists, etc.).
Regex: /apps?/|/a/|/tools/
What it matches: Common app proxy URL patterns.
Why it's useful: Some apps create proxy pages that:
- Might be indexed unnecessarily
- Could create duplicate content
- May have performance issues
Review these to ensure app-generated pages add SEO value.
JSON and Resource Files
Use case: Ensure Shopify's system files aren't indexed.
Regex: \.json|/cdn/|/assets/|/files/
What it matches: JSON endpoints, CDN resources, and asset files.
Why it's useful: These should never be indexed. If found:
- Check your robots.txt blocks them
- Verify your theme isn't creating links to these
- Ensure X-Robots-Tag headers are set correctly
Multi-Currency URLs
Use case: Track URL variations created by currency switchers.
Regex: [?&]currency=|/currency/
What it matches: URLs with currency parameters.
Why it's useful: Currency parameters can create duplicate content. Solutions:
- Use hreflang tags for different currencies/countries
- Canonicalize to a default currency
- Use URL parameter handling in Search Console
- Consider subdomains or subdirectories for major markets
Practical Workflows
Finding Duplicate Content Issues
Step 1: Use the product variant parameter filter (/products/.*\?variant=)
Step 2: Check if these pages are indexed with their own title/description
Step 3: Verify canonical tags point to main product page
Step 4: Use URL parameter handling in Search Console to tell Google how to treat variant parameters
Optimizing Collection Page Structure
Step 1: Filter for main collections (/collections/[^/?]+$)
Step 2: Identify collections with < 100 impressions per month
Step 3: Either:
- Merge low-traffic collections into related ones
- Improve collection descriptions and add more content
- Build internal links to them from high-traffic pages
- Consider removing if they're genuinely low-value
Discovering Content Gaps from Search Queries
Step 1: Use the search results page filter (/search\?.*q=)
Step 2: If these pages are getting organic traffic, extract the search terms
Step 3: Create collection pages or blog content for popular internal searches
Step 4: Improve site navigation so users don't need to search
Cleaning Up Parameter Proliferation
Step 1: Use "All URLs with parameters" filter
Step 2: Export the list
Step 3: Categorize parameters:
- Tracking (remove from indexation)
- Variants (ensure proper canonicals)
- Filters/sorting (noindex or canonical)
- Pagination (implement proper pagination tags)
Step 4: Configure URL parameter handling in Search Console
Step 5: Update robots.txt if needed
Shopify-Specific Tips
Use Shopify's Built-in SEO Features
- Automatic canonical tags: Shopify adds these by default, but verify your theme doesn't override them
- Structured data: Shopify includes product schema automatically
-
Sitemap: Auto-generated at
/sitemap.xml -
Robots.txt: Basic version at
/robots.txt, customizable via theme files
Common Shopify SEO Pitfalls to Avoid
- Product/collection descriptions in theme code: Some themes hard-code text, creating duplicates across pages
- Over-use of apps: Each app adds code weight and potential indexation issues
- Thin collection pages: Generic descriptions hurt SEO
- Ignoring variant optimization: Variants need unique descriptions if indexed separately
- Not optimizing blog: Shopify's blog is powerful but often ignored
Theme Selection Matters
When choosing a Shopify theme:
- Verify it has proper canonical tags
- Check structured data implementation
- Test mobile speed (Core Web Vitals)
- Ensure it doesn't create duplicate content
- Confirm it handles variants properly
Monitoring Checklist
Use these regex filters monthly to maintain healthy indexation:
- [ ] Check for new parameter URLs being indexed
- [ ] Verify cart/checkout pages remain unindexed
- [ ] Monitor product variant URL indexation
- [ ] Review filtered collection URLs
- [ ] Check for app-generated pages being indexed
- [ ] Track internal search page indexation
- [ ] Monitor collection pagination patterns
- [ ] Verify JSON endpoints aren't indexed
Conclusion
Shopify's structure creates unique SEO challenges and opportunities. These regex filters help you navigate the platform's quirks, from variant URL management to collection optimization.
The most common issues Shopify stores face are parameter proliferation and variant URL duplicates. Start by auditing these with the filters above, then expand to query analysis and content gap discovery.
Remember: Shopify handles much of the technical SEO automatically, but you still need to monitor for theme issues, app conflicts, and strategic opportunities. These filters give you the visibility to spot problems early and capitalize on what's working.
Use these patterns as a foundation, then customize them to match your store's specific URL structure and business goals. The insights you uncover will guide smarter optimization decisions and help your Shopify store capture more organic traffic.