In 2023, my staff and I started engaged on maybe one of the bold content material audits ever carried out on the HubSpot Weblog. We’ve run content material audits up to now — however not like this.
We ran the audit in three phases:
- Section 1 addressed our oldest content material.
- Section 2 evaluated our lowest-performing content material.
- Section 3 assessed the worth of our subject clusters.
When it was all stated and completed, we audited over 10,000 weblog put up URLs and over 450 subject clusters.
On this put up, I’m going to give attention to part one in all our audit. I’ll stroll you thru how we audited our oldest content material and the way we took motion. Plus, I’ll share the outcomes we discovered.
However first, let me provide you with some background on why we determined to run an audit of this magnitude.
Why We Audited
It began in early 2023. On the time, my staff was known as the Historic Optimization staff and we sat on the intersection of HubSpot’s Website positioning and Weblog groups.
We have been liable for updating and optimizing our current weblog posts and discovering progress alternatives inside our library. (We’ve since developed into what’s now the EN Weblog Technique staff.)
In case you’re new right here, the HubSpot Weblog is HUGE.
For context, the weblog was dwelling to 13,822 pages in February of 2023, the month we started our audit. Ahrefs’ Mateusz Makosiewicz even declared it because the “largest company weblog … ever” in an Website positioning case examine earlier this yr.
Whereas we’re lucky to have a excessive area authority and drive tens of millions of visits monthly, having a weblog of this dimension doesn’t come with out challenges.
As our library ages, the quantity of alternative for brand new content material throughout our weblog properties and clusters shrinks.
So, we determined to audit our library to seek out alternatives for optimization.
We hypothesized that we might uncover “greenspace” and “quasi-greenspace” — matters that we now have lined however haven’t capitalized on that effectively — by auditing the oldest 4,000 URLs in our library.
Though this was solely a few third of our content material library, we believed we’d be capable of unearth some visitors alternatives and provides our weblog a lift.
Across the similar time, we began to really feel the consequences of Google’s March 2023 Core Replace that emphasised expertise, which our Technical Website positioning staff instantly began addressing.
Nonetheless, one other a part of that algorithm replace emphasised content material freshness and helpfulness. In different phrases, how cutting-edge and helpful our content material is to our readers.
That is the place we actually felt a way of urgency.
As a result of we had 4,000 URLs with revealed dates starting from 2006 to 2015, we already knew that this chunk of content material was not recent or useful.
So, we started working and audited these weblog posts over the course of ten weeks.
Finally, we added phases two and three to our plan so we might additional handle unhelpful content material and clusters.
How We Audited Our Oldest Content material
1. Outline our targets.
Earlier than we began auditing the content material, it was vital for us to find out the targets.
For some publishers, the purpose of a content material audit might embody enhancing on-page Website positioning, enhancing consumer engagement, aligning content material with advertising targets, or figuring out content material gaps.
For this explicit audit, it meant uncovering “greenspace” and “quasi-greenspace” in our weblog library, and enhancing our general content material freshness.
We additionally needed to decide the scope of our audit.
There’s no proper or incorrect technique to method this. Relying in your targets and the scale of your web site, you may audit the entire thing in a single go.
You might additionally begin with a small portion of your web site (similar to product pages or particular subject clusters) and construct out from there.
Since HubSpot has such a big content material library, we opted to restrict this audit to our oldest 4,000 URLs. Not solely was this extra manageable than reviewing all of our content material in a single audit, however this additionally focused URLs that have been extra more likely to profit from an replace or prune.
We additionally did this figuring out that we might later handle the remainder of our library throughout phases two and three.
2. Collect our content material stock.
As soon as we established our targets and scope, we needed to collect the oldest 4,000 weblog posts and put them right into a spreadsheet.
This course of can range relying on the instruments and CMS you utilize. Right here’s how we did it utilizing Content material Hub:
1. Log into HubSpot and navigate to the Weblog web page in Content material Hub.
2. Navigate to the Actions drop-down and click on Export weblog posts.
3. Choose File format and click on Export. This may ship all your weblog put up info to your electronic mail. Additionally, you will get a notification in HubSpot as soon as your export is prepared.
4. Obtain your export and open it in your most popular spreadsheet software program (I’m normally a Google Sheets girlie, however I had to make use of Microsoft Excel for the reason that file was so giant).
5. Assessment every column within the spreadsheet and delete those that aren’t related to your audit. We instantly deleted the next:
- Put up Website positioning title
- Meta description
- Final modified date
- Put up physique
- Featured picture URL
- Head HTML
- Archived
6. As soon as the irrelevant columns have been eliminated, the next remained:
- Weblog identify
- Put up title
- Tags
- Put up language
- Put up URL
- Creator
- Publish date
- Standing
7. Filter the Put up language column for EN posts solely. Delete the column as soon as the sheet is filtered.
8. Filter the Standing column for PUBLISHED solely. Delete the column as soon as the sheet is filtered.
9. Filter the sheet by Publish date from oldest to latest.
10. Spotlight and duplicate the primary 4,000 rows and paste them right into a separate spreadsheet.
11. Title the brand new spreadsheet Content material Audit Grasp.
If you happen to’re feeling fancy, you too can create a customized report in Content material Hub and choose solely the fields you need included within the audit so that you don’t need to filter as a lot when organising your spreadsheet.
3. Retrieve the information.
After compiling all the content material wanted for our audit, we needed to gather related information for every weblog put up.
For this audit, we stored it fairly easy, solely analyzing complete natural visitors from the earlier calendar yr, complete backlinks, and complete key phrases.
We did this as a result of our advisable actions for every URL have been decided throughout post-evaluation. (We’ll cowl this within the subsequent step.)
We obtained natural visitors information from Google Search Console and used a VLOOKUP to match every URL with its corresponding variety of Clicks.
Then, we obtained backlink and key phrase information by copying and pasting our audit URLs into Ahrefs’ Batch Evaluation software and exporting the information into our spreadsheet.
On the time of our audit, the Batch Evaluation software might solely analyze as much as 200 URLs directly, so we needed to repeat this step 20 instances till we had information for every URL.
Fortunately, Ahrefs has rolled out a Batch Evaluation 2.0 software since then, which might analyze as much as 1,000 URLs directly. So, if we have been to do the same audit sooner or later, it will take a lot much less time to retrieve this information.
4. Consider the content material.
Subsequent, we assessed every bit of content material through the use of the collected information. Then, we evaluated the put up itself to find out the next:
- Kind of Content material
- Freshness Degree
- Natural Potential
Kind of Content material
The HubSpot Weblog is dwelling to many several types of weblog posts, every serving a novel goal. Labeling every put up helped us decide its relevance and have become a key consider our resolution to replace or prune.
Whereas this isn’t an exhaustive listing of all of the content material varieties you may discover on the HubSpot Weblog, we narrowed it right down to the next for the needs of this audit:
- Instructional: A subject that may educate the consumer on a ache level or drawback they know they’ve.
- Thought Management: A subject that may educate the consumer on a ache level or drawback they didn’t know they’d till an professional drew it to their consideration.
- Enterprise Replace: A HubSpot-related piece of reports or a press launch that’s seemingly not evergreen.
- Newsjacking: An industry-related piece of reports or a press launch that’s seemingly not evergreen.
- Analysis: A set of information or the outcomes of an experiment that’s used to coach the reader. This subject might or might not be evergreen, however the content material shouldn’t be and desires updates to remain recent.
Freshness Degree
As a result of the posts on this audit hadn’t been up to date in a very long time, none of them might be thought-about 100% “recent.” Nonetheless, we took several types of freshness into consideration when figuring out what motion wanted to be taken on the URLs.
For instance, some matters, similar to Google+, are so outdated that an replace can be foolish. Nonetheless, loads of matters have been nonetheless evergreen, though our content material was not.
The next scale helped us make choices on whether or not the URL had worth with regard to freshness:
- Outdated: The subject is outdated, and an replace might not be doable.
- Stale: The subject is evergreen, however it will want an in depth replace to make the content material extra aggressive.
- Comparatively Contemporary: The subject is evergreen, and it will solely want a reasonable replace to make the content material aggressive.
Natural Potential
To find out the natural potential of every URL, we needed to ask ourselves the next query: Will anybody seek for the content material on Google?
- Sure: Somebody would positively seek for this, so we’ll have to optimize/recycle the content material.
- No: Somebody wouldn’t seek for this. There’s no level in optimizing/recycling the content material since there’s no doable focus key phrase.
For all the posts marked “Sure” for natural potential, we advisable a spotlight key phrase for the re-optimized content material to compete for. We did this by evaluating the prevailing title, slug, and content material. Then, we did some key phrase analysis on Ahrefs and reviewed the Google SERP for that question.
We additionally included the main target key phrase’s month-to-month search quantity (MSV) to assist prioritize which updates to carry out first. We did this by plugging the advisable key phrase into Ahrefs’ Key phrases Explorer and including the MSV to our grasp sheet.
For an additional layer of warning, we additionally checked for cannibalization on all posts marked “Sure” for natural potential. There are a number of methods to do that:
- Do a web site search and see if any URLs come up for the main target key phrase.
- Plug the main target key phrase into Google Search Console to see if any URLs come up.
- Plug the main target key phrase into Ahrefs’ Key phrase Explorer, scroll right down to Place Historical past, search your area identify, and filter for Prime 20 and your required time-frame (I normally test the final six months). If a number of URLs are discovered, which will point out cannibalization.
If the main target key phrase was flagged for cannibalization, we both discovered a special focus key phrase or famous that the URL must be redirected to the more energizing put up.
If no cannibalization was discovered, then we had the inexperienced gentle to maneuver ahead with updating the put up.
5. Suggest an motion.
As soon as a put up was fully evaluated, we turned the insights into motion gadgets.
Every URL was positioned into one of many following classes:
- Preserve: No motion is required as a result of each the content material and the URL are good.
- Optimize: The content material is sweet however outdated when it comes to freshness or Website positioning practices. Preserve the spirit of the article, however refresh and re-optimize to enhance efficiency.
- Recycle: The content material shouldn’t be salvageable, however the URL nonetheless has worth (when it comes to backlinks or key phrase alternative). Create new content material from scratch, however retain the URL.
- Prune: Neither the content material nor the URL has worth from an natural standpoint.
Audit Insights
Out of the 4,000 URLs we audited, 951 (23.78%) have been categorized as posts with natural potential and advisable for optimization or recycling. Moreover, 2,888 URLs have been advisable to be pruned. That’s about 72.2% of the audit.
These posts both didn’t have natural potential, posed a cannibalization threat, or have been so outdated that there was no level in updating them.
The remaining 161 URLs both didn’t require any motion or had already been redirected.
How We Took Motion
The motion taken for a URL was decided by its potential for natural visitors.
The URLs with natural potential have been delivered to our Weblog staff and advisable to be optimized or recycled.
In the meantime, the URLs with no natural potential have been delivered to our Website positioning staff and advisable to be archived or redirected.
First, let’s stroll by way of how we took motion on the posts advisable to be optimized or recycled.
Taking Motion on Content material with Natural Potential
Earlier than addressing any of the 951 posts with natural potential, we would have liked to determine the next:
- Our capability for strategic evaluation and transient writing
- The capability of our in-house writing workers and out there freelancers
- Our capability to edit the updates
We coordinated with stakeholders and decided we solely had the bandwidth to replace 240 posts in 2024 (along with the handfuls of weblog posts we replace every month). This initiative was internally often called the “De-Age the Weblog Mission” and was led by my EN Weblog Technique teammate Kimberly Turner.
As soon as we knew what number of posts we might tackle, we needed to slim down which of them to prioritize. We did this by evaluating the complexity of the raise required for every put up replace:
- Easy Replace: The content material updates wanted are comparatively gentle, making them appropriate for freelancers.
- Advanced Replace: The content material updates wanted are heavy, making them higher fitted to in-house writers.
- Recycle: Content material shouldn’t be salvageable, however the URL is. Rewrite the put up from scratch, however retain the URL.
- No Alternative: Go on updating.
We initially prioritized updating the best URLs first, however later pivoted our technique to sort out the URLs with the best MSV potential, no matter replace complexity.
We did this as a result of we needed to get essentially the most we might out of our updates.
De-Age the Weblog Outcomes
Initially, we projected that these updates can be full by the tip of H1 2024, however we needed to shift our technique … once more.
Like many different publishers, we felt the consequences of Google’s March 2024 Core Replace in addition to the introduction of AI Overviews.
After having positioned the De-Age the Weblog Mission on maintain whereas we addressed the problems, we deprioritized the venture totally in favor of higher-impact workstreams.
Website positioning, am I proper? It all the time retains you in your toes.
Regardless of sunsetting the venture earlier than it was full, we have been nonetheless capable of carry out 76 put up updates. Six months after the updates have been carried out, the cumulative month-to-month visitors for these posts had elevated by 458%.
This goes to point out that even updating a small portion of URLs could make a giant distinction.
Taking Motion on Content material with No Natural Potential
Whereas the De-Age the Weblog Mission was going down, we additionally took motion on the two,888 URLs that have been advisable to be pruned.
Because the preliminary audit didn’t embody suggestions on tips on how to prune, we had to return and re-review every URL to find out how we might prune.
Right here’s how we evaluated the posts:
- Archive (404): The URL has lower than 10 backlinks and the backlink profile doesn’t have worth.
- Redirect (301): The URL has greater than 10 backlinks and/or the backlink profile has worth.
How precisely did we decide the backlink profile worth? Rory Hope, HubSpot’s head of Website positioning, advisable we observe these steps:
1. Login into Ahrefs and submit the URL into Ahrefs’ Website Explorer search bar.
2. Choose Overview from the left-hand sidebar.
3. Scroll down and click on Backlink Profile.
4. Scroll down additional and choose By DR beneath Referring Domains.
5. Analyze and examine any referring domains which can be > 50.
6. Navigate to the Referring Area you’re investigating > 50 by clicking the quantity.
7. Analyze the Referring web page.
Choose Redirect (301) if:
- The Referring web page hyperlink is from a website that also receives Area visitors.
Choose Archive (404) if:
- The Referring web page hyperlink seems to be “spammy.” You possibly can decide this by asking the next questions:
- Does this web site solely publish low high quality visitor put up (Website positioning-led) content material from numerous completely different matters?
- Does this web site nonetheless publish content material? If not, ignore it.
- The Referring web page is from an internet site that you simply see linking to quite a lot of EN Weblog posts, by way of a RSS fashion automated linking system.
Moreover, all URLs labeled “Redirect (301)” required a brand new URL to be redirected to.
When selecting a brand new URL, we did our greatest to select essentially the most related and related web page. If we couldn’t discover one, we redirected to the pillar web page of the cluster that the put up belonged to.
If for some purpose, the URL didn’t belong to a cluster or there wasn’t a pillar web page, we redirected it to the HubSpot Weblog homepage.
Determination-making for some content material varieties was simpler than others. For instance, we have been capable of mechanically assign 301 redirects to URLs that have been flagged for cannibalization in the course of the preliminary audit. We additionally mechanically assigned 404s to URLs with lower than 10 backlinks labeled as Newsjacking and Enterprise Updates.
All the things else was manually reviewed to make sure accuracy. To make the analysis course of simpler, we adopted this resolution tree:
It took my staff about two and a half weeks to make sure that each URL had the right label. In the long run, we had 1,675 URLs assigned to be redirected and 1,210 URLs assigned to be unpublished and archived.
As soon as every URL was evaluated, we have been lastly able to take motion.
After coordinating with Rory and Precept Technical Website positioning Strategist, Sylvain Charbit, we determined to prune the URLs in batches as an alternative of abruptly. That approach, we might higher monitor the influence of redirecting and archiving a big amount of content material.
Initially, we deliberate to implement our prune in 5 batches over 5 weeks, permitting us time to observe efficiency in the course of the weeks in between.
Batches one and two contained URLs meant to be archived and unpublished, and batches three by way of 5 contained URLs designated for 301 redirects.
As a result of there have been so many URLs to unpublish and archive, we labored with builders on HubSpot’s Digital Expertise staff to create a script that may mechanically unpublish and archive URLs and redirect them to our 404 web page.
Then, we have been capable of implement the 301 redirects with the Bulk URL Redirect software in Content material Hub.
Be aware: Though we have been capable of work by way of this course of internally and end earlier than our deadline, I wish to acknowledge that manually evaluating over 2,000 URLs may be tedious and time-consuming.
Relying in your assets and the scope of your audit, you might wish to think about hiring a freelancer to assist your staff work by way of a activity this massive.
Content material Pruning Outcomes
Whereas we efficiently carried out every batch, this course of didn’t come and not using a few street bumps.
Halfway by way of our pruning schedule, Google rolled out the March 2024 Core Algorithm Replace. We ended up inserting our pruning schedule on maintain so we might higher monitor efficiency in the course of the replace.
As soon as the replace was full, we resumed the remainder of our prune till it was full.
Due to the risky search panorama in 2024, we didn’t see the visitors good points we’d hoped to see as soon as the prune was full. Nonetheless, we did have a good time an enormous win for general content material freshness on the weblog.
Initially of our audit in 2023, we calculated the freshness of our content material library by taking a look at every URL’s publish date and quantifying the variety of days since they have been up to date.
For instance, say the present date is November 12, 2024, and you’ve got a put up that was final up to date on February 19, 2008. Based mostly on the 2024 date, the put up from 2008 is 16.7 years previous or 6,110 days.
As soon as we had all the ages for each put up on the HubSpot Weblog, we averaged these numbers to find out the typical age of our content material library, which was 2,088 days (5.7 years).
Since pruning 2,888 URLs (and updating tons of of URLs from the audit and past), the HubSpot Weblog’s common age has dropped to 1,747 days — that’s 341 days youthful than once we began.
As content material freshness and helpfulness play a fair larger function in search algorithms, being almost a yr youthful could make a giant distinction.
What’s Subsequent?
Earlier on this put up, I discussed that this audit is just one of three that my staff has labored on in 2024.
Our part two audit focuses on the lowest-performing posts that weren’t included in part one, totaling over 6000 URLs. Then, part three assesses the worth of our Weblog’s subject clusters.
We’re nonetheless taking motion on the outcomes from these audits, however I’m so excited to share the method and insights once they’re full.
Finally, content material auditing is a job that’s by no means really completed — particularly when working with giant libraries. You end one audit, then it’s on to the following.
Though the work may be tedious, the rewards of enhancing content material high quality, consumer expertise, and efficiency make it well worth the effort.