Are there ways to identify where is enough content on urls?
Page 1 of 1 [ 4 posts ]
Author
Message
joeadvent
Post subject: Are there ways to identify where is enough content on urls?
Posted: Sat Nov 19, 2011 6:33 pm
Tenderfoot
Posts: 20
Online
hi,
i really would like to identify where is already content / or where is enough content (e.g. 300 words written text) on an url. are there ways or does somebody has an idea how to solve this?
the page in webauditor has more than 50.000 urls, so it would be hard to check them manually ...
Well, since analyzing each page separately in the webpage analysis module of WebSite Auditor would be way too long, you could pay attention to the following ranking factors:
Content type Page size HTML code size.
Once you update these for all your pages you can see pages of the "text" content type and really small page & html size, those can be the pages with lacking content. You can sort further and browse through empty pages afterwards.
unfortunately the urls without a >100 words on the site have the same size like the ones with e.g. 200 Words. There are products with small texts on each site, that's probably the reason, you can't identify the urls without the 100-300 word-articles / texts.
content type is also not an indicator, because every page i checked has content type text/html.
i'll keep on trying and let you know if i have found a way to handle this. atm i checked the urls in scrapebox and got a list of urls containing words, which are an indicator that they are containing 100-300 word-articles.
the question is: Is there a way to import the txt/csv/xls... scrapebox-list to websiteauditor in order to give all the urls a specific TAG? I really would like to mark these urls in website auditor...