RSS-Spider

Development, Ideas, Issues, problems, ßetas and what not…

Database disaster :(

Filed under: What Not... — Dave at 12:59 pm on Tuesday, September 30, 2008

Well over the weekend while backing up the database our hard drive running the database & sphinx search up and died on us. We have a new one on order but it won’t be here for a few days (ok.. we bought it on Ebay). Anyhow… we’ll be back soon
-D

The red ones taste like burning…

Filed under: Issues, What Not... — Dave at 10:18 pm on Tuesday, August 19, 2008



Actually they all taste like burning…

wordpress hack <u style=’display:none’>

Filed under: Problems — Dave at 1:21 am on Saturday, April 26, 2008

Checking email for this site today I ran across this email from Google Search Quality. At first I thought it was a spam seeing as it was filled with crap about viagra & calliass but was shocked to find that this crap WAS on this blog. Well it seems an older version of Wordpress that I was running has a venerability allowing someone to update your theme files and post all sorts of CRAP into it with links leading back to thier spammy sites. Some one did this since I am a lazy sysadmin and didn’t update wordpress. Broke rule number 2 on the Google webmaster security check list…

Shame on me…

Dear site owner or webmaster of rss-spider.com/blog,

While we were indexing your webpages, we detected that some of your pages were using techniques that are outside our quality guidelines, which can be found here: http://www.google.com/webmasters/guidelines.html. This appears to be because your site has been modified by a third party. Typically, the offending party gains access to an insecure directory that has open permissions. Many times, they will upload files or modify existing ones, which then show up as spam in our index.
(Read on …)

Answering the questions I get at least once a week… “How can I setup an RSS feed for my site?”

Filed under: Development — Dave at 12:27 am on Thursday, June 14, 2007

This step by step comes from Design World Online and is reprinted with permission. The orginal document is located at http://www.designworldonline.com/ftp/dmm_pdf/DesignWorld_HowToRSS.pdf
FEED / BLOG CREATION

STEP 1. Find Service or Application
Design World recommends TypePad (www.typepad.com). It is very easy to set up a free 30-day trial and get started with your communication. If getting your IT department involved is required, you can even point these services to your own domain for seamless integration. If you have a Web site, blog, audio/video content or even photos, you can offer a feed of your content as an option. If you are using a popular blogging platform or publishing tool like TypePad, Wordpress or Blogger, you likely publish a feed automatically. There are also tools on the market that can help transform traditional web content into the right format for distribution. Simply creating an XML version of your content allows Aggregators the ability to read, but this entails some knowledge of XML syntax. Another method is PC-based software that allows blogging with associated feeds to be automatically published to a specific website location.

STEP 2. Enter Data
The more frequent the better! Your readers and search engines like constantly updated content.

STEP 3. (Optional) Enhance your Feed
There are services like FeedBurner (www.feedburner.com) that allow you to track statistics on your feed that include subscribers, hits and other good stuff.

STEP 4. Required! Tell us about your Feed*
Once your up and running, go to http://www.rss-spider.com/fsb.php and submit your feed address so we can subscribe to your feed and keep apprised of your news automatically. You post and we redistribute immediately. You gain the exposure of the RSS-Spider with no hassle.

* Edited out Design World’s email address since 99.999% of user submitted feeds have no relation to design engineering.

SPLOGS SPLOGS SPLOGS… Why are we indexing crap?

Filed under: What Not... — Dave at 12:52 am on Saturday, April 7, 2007

In a recent post on DIGG.com I read about a study where someone figured out that about 75% of all blogs being hosted at BlogSpot where spam blogs. You can read the article at : http://www.infoniac.com/hi-tech/google-blogs-spam.html Anyhow… it got me to thinking… we’ve put a few things in to block spammers on this site, however, since we’re pulling in RSS feeds from all around the web, how many splogs have we indexed? Ugh… what a mess. Searching for “debt reduction” came back with 1000+ results instantly and most of them were within 3 days old. 1000 by the way is the ceiling of results that Sphinx is setup to return. Searching for “credit-card-debt” came up with the same results. So I’ve decided that Blogspot and all the other domains listed in the article above are going to be put on the “no spider” list & ban list effective Monday 4/9.

Next Page »