Join the waitlist and get Sublim Business free for 3 months  Claim offer

Behavioral AnalyticsWeb Analytics

Filter Before You Watch: Why Traffic Source Changes What Session Replays Tell You

Jocerand LeroyJocerand Leroy
9 min read
#session-replay#conversion#web-analytics
Most teams watch session recordings without context and draw weak conclusions. The teams that consistently find real problems do one thing differently: they define the segment before they press play. Here is the filtering framework and the 20-recording protocol.
Filter Before You Watch: Why Traffic Source Changes What Session Replays Tell You

Most teams watch session recordings the same way: open the tool, filter by the page with the highest bounce rate, and click play on whatever comes up. After 20 recordings, they make a change. Sometimes it works. Usually it does not.

The problem is not the recordings. It is the sample. A random recording of a visitor on your pricing page tells you what one person did. It does not tell you whether that behavior is typical, what brought them there, or why they left without converting. Without that context, you are pattern-matching on noise.

The teams that consistently find real problems in session recordings do one thing differently before they press play: they filter by traffic source, device type, and conversion outcome. That combination turns an anecdotal observation into evidence.

Why traffic source changes everything you see

A visitor who arrived at your landing page from a Google Ads campaign saw a specific promise in the ad ("GDPR-compliant analytics, no setup required") and clicked expecting exactly that. If the page opens on a generic product overview instead of confirming that promise immediately, they leave within seconds.

A visitor who arrived at the same page from an organic search for "best analytics alternatives" found you through content, has read about the problem space, and is in a slower evaluation mode. They will scroll further, read more, and leave for different reasons than the paid visitor.

If you mix recordings from both sources and watch them together, you will see a confused signal. The rage clicks from the paid visitor frustrated by a mismatch between ad promise and page content will average out against the slower, more exploratory behavior of the organic visitor. You will draw conclusions that do not apply to either.

Segment first. Watch second.

Session recording filter funnel: from all sessions to a focused 20-recording sample filtered by source, device, and outcome
Each filter removes noise. The combination of source, device, and outcome turns a random sample into a controlled observation.

The three dimensions that define a useful segment

Before opening a single recording, define the combination you want to investigate. Three dimensions are enough:

The filtering framework
1
Traffic source
Paid search, organic, direct, email, social. Each brings a visitor with different expectations and intent. The same page will perform differently for each.
2
Device type
Mobile and desktop are different products. A conversion problem that only appears on mobile is a different fix than one that appears everywhere.
3
Conversion outcome
Bounced, converted, or abandoned mid-funnel. This separates the sessions worth diagnosing from the ones that worked as intended.

A segment defined by all three is specific enough to be actionable. "Paid search, mobile, bounced" is a problem statement. "All visitors to my pricing page" is not.

What each combination tells you before you watch a single recording

Not every combination points to the same type of problem. Reading the signal in advance focuses your attention and prevents you from retrofitting an explanation after the fact.

Source Device Outcome Most likely problem
Paid Any Bounced Ad-to-page mismatch: the promise in the ad does not match what the page delivers above the fold
Organic Any Bounced Search intent mismatch: the content does not answer what the keyword implied
Any Mobile Bounced Mobile experience failure: layout, load speed, or a CTA that is not tappable
Any Any Abandoned mid-funnel Friction at a specific step: a form field, a missing payment option, or an absent trust signal
Direct Any Bounced Return visitor expectation mismatch: something changed that broke a familiar flow
Email Any Bounced Audience-page misalignment: the landing page does not match the email segment or the offer

This diagnosis happens before you open a recording, not after. You are not looking for a surprise. You are looking for confirmation of a specific hypothesis.

The 20-recording protocol

Once you have your segment and your hypothesis, watch 20 recordings. Not 5 (too few to find a pattern). Not 50 (a full afternoon for diminishing returns). Twenty is enough.

What to look for, in order of diagnostic weight:

1
Instant exit with no scroll
The visitor arrives, reads nothing, and leaves within 3 seconds. The above-the-fold experience failed immediately: headline, load speed, or a visual mismatch with the referrer. If this appears in more than a third of your paid segment, the ad and the page are saying different things.
Critical
2
Rage clicks
Rapid repeated clicks on the same element. Either something looks clickable and is not, or an action failed and the visitor is retrying. Always a UX breakage. Cross-reference with your heatmap to confirm the location.
Critical
3
Dead scroll at the same point across sessions
Visitors scroll to a consistent depth and stop. The content above that point is not compelling enough to keep them moving. Especially telling when it appears consistently within a single source segment: organic bounces stopping at different points than paid bounces means two different problems.
Important
4
Form abandonment at a specific field
The visitor starts filling out a form and stops at the same field across multiple sessions. That field has a friction problem: a confusing label, a required field that should not be required, or a format constraint that is not clearly communicated. In B2B forms, "company size" and "phone number" are the most common culprits.
Important
5
CTA never reached
The visitor scrolls actively but never reaches the call to action before leaving. Either the page is too long, the CTA is positioned too low, or the content before it does not create enough forward momentum. Confirm with the scroll depth heatmap.
Contextual

Track what you observe across all 20 recordings with a simple tally. You are looking for patterns that appear in at least 30 to 40 percent of sessions in your segment. A pattern in 2 out of 20 recordings is noise. A pattern in 10 out of 20 is a finding.

A concrete example: a paid campaign landing page at 1.8% conversion rate

Your paid search campaign sends traffic to a specific landing page. The conversion rate is 1.8%. Industry benchmark for similar pages sits around 3 to 4%. Something is underperforming, but the GA4 report just shows a high bounce rate and low session duration. It does not tell you why.

Without the filtering framework, you watch 20 random recordings on that page. You see a mix: some visitors scroll slowly, some exit quickly, some seem engaged but never click the CTA. You cannot tell whether the quick exits are from paid traffic or organic visitors who landed on the same URL. You cannot tell whether the engaged-but-not-converting sessions represent a different problem than the instant exits. You make a guess and change the headline.

With the filtering framework, you run three separate investigations:

1Paid search, mobile, bounced. You watch 20 recordings and see consistent instant exits: no scroll, exit within 2 to 3 seconds. The mobile visitor read the ad, arrived, scanned the first line, and decided this was not what they expected. The ad headline and the page headline are misaligned. Fix: rewrite the above-the-fold headline to echo the ad copy exactly.
2Paid search, desktop, bounced. In this segment, visitors scroll and reach roughly 40% of the page before leaving. Dead scroll at the same point across most sessions. The above-the-fold experience works on desktop but the value proposition breaks down mid-page. Fix: rewrite or replace the section at the 40% mark.
3Paid search, any device, abandoned at signup. Visitors who clicked the CTA and started the form are abandoning at the "company size" dropdown. Removing it or making it optional is a single-field change. Fix: make the field optional and move it to the end of the form.
Three filtered segments on the same landing page, each revealing a different problem and a different fix
Same page. Same campaign. Three filtered segments, three different problems, three different fixes.

Three segments, three different diagnoses, three different fixes. The aggregate bounce rate report showed you that something was wrong. The filtered recordings told you exactly what to change and where.

Why this is nearly impossible with disconnected tools

The filtering described above requires two pieces of information simultaneously: what the analytics tool knows (traffic source, device, conversion outcome) and what the behavioral tool knows (the recording itself).

Most teams use Google Analytics for traffic data and a separate tool (Hotjar, Microsoft Clarity, or similar) for recordings. The data does not flow between them. When you open Hotjar, you see sessions. You can filter by page, by device, sometimes by duration. But you cannot filter by the UTM campaign that brought the visitor there, or by whether they completed a goal configured in GA4.

The workaround is manual: export session IDs from GA4, cross-reference with Hotjar, find the matching recordings. In practice, almost no team does this. They watch recordings without context, draw weak conclusions, and wonder why the changes they make do not move conversion rate.

Capability GA4 + Hotjar (separate tools)
See Sublim vs Hotjar →
Sublim (integrated)
Try for free →
Filter recordings by traffic source Manual cross-reference only Native filter
Filter recordings by conversion outcome Not possible directly Native filter
Filter recordings by device + source combined Device in Hotjar, source in GA4: no link All three combined in one view
Traffic source visible in the recording view No Yes
GDPR: no consent banner needed No (both tools require consent in EU) Yes

There is a compounding GDPR problem worth noting. In EU markets with an active consent banner, 30 to 50% of visitors decline tracking. Hotjar does not record sessions from visitors who declined. Your recordings are already a self-selected sample of the visitors who accepted, skewing toward engaged, brand-familiar users. The visitors most likely to bounce fast are also the most likely to have declined consent and be invisible in your recording tool. For more on this, see our article on running analytics without a consent banner.

The one hypothesis rule

At the end of the 20-recording protocol, you should have one hypothesis. Not five. Not a list of things to improve. One statement in this form: "If I change X, the conversion rate for [source, device, outcome] segment will improve because [pattern] appeared in [N] out of 20 recordings."

This constraint matters. Teams that emerge from a recording session with a list of ten things to fix tend to implement all of them at once, then measure the outcome without knowing which change drove the result, or which one caused a regression elsewhere. One hypothesis, one change, two weeks of measurement.

If your recordings reveal strong patterns across multiple segments, prioritize by traffic volume multiplied by conversion gap. A problem affecting your highest-spend paid campaign takes precedence over a form abandonment pattern in a small direct traffic cohort.

The bottom line

Session recordings are evidence. Like all evidence, their value depends on how you collect them. A random sample of recordings from a high-bounce page is anecdotal. A sample filtered by source, device, and conversion outcome is a controlled observation.

The process that works: define your segment before you open a recording → form a hypothesis about what you expect to see → watch 20 recordings and tally the patterns → identify one finding that appears in at least 30% of sessions → make one change → measure for two weeks.

The process that does not work: open a tool, click play, notice interesting things, change several things, wonder why conversion rate did not move.

The filtering is the work. The watching is just confirmation.

For the traffic-side diagnostic that complements this workflow, see our bounce rate diagnostic guide. The segmentation logic is the same: isolate the source, then interpret the signal.

Jocerand Leroy
Author
Jocerand Leroy
Web Analytics & Privacy Lead

Jocerand writes about privacy-first web analytics, conversion diagnostics, and helping teams make sense of their data without compromising on compliance.

View all articles by this author

Ready to try Sublim?

Simple, fast analytics that respects privacy. It's free to get started.

Business plan · 3 months free at launch · Promo code sent by email

Session Replay & Traffic Source: A Better Diagnostic Method | Sublim Analytics