Jump to content


  • Content Count

  • Joined

  • Last visited

Everything posted by a__kc

  1. I didn't know, either. Excellent information
  2. Er, I wish I had reviewed the following MySQL reference page first (http://dev.mysql.com/doc/refman/4.1/en/cha...conversion.html). I think (not 100% sure) I corrupted some data just by using PhpMyAdmin to change collation. This was unheard of with MySQL 4.0!
  3. Same problem here, too. I guess TCH has upgraded MySQL to a version that supports collation? Unfortunately all of my previously collation-less data now have been (mistakenly) labeled "latin1_swedish_c" (apparently rather arbitarily since MySQL is a Swedish company). This would all be academic except that I can no longer view or properly manipulate the data via PhpMyAdmin. It's all scrambled and I hope this is only a presentation issue and not indicative of data corruption. Problem with "latin1_swedish_c": 1. My data are definitely not Latin1 encoded (they're UTF-8, to support
  4. Kevan, thanks for the tip. I'll take a look at Google Sitemaps -- it looks promising and I must say I'm happy some search engines are trying to refine the old-fashioned "brute force" approach. ---- timhodge, don't want to get too political here, but I'm pretty sure Baidu censors some of the info it finds (e.g. Tibet, Falungong, Taiwan independence stuff -- and p0rn ) in terms of hiding them from view AND ranking pro-government stuff higher. The law demands it. PS: Okay, please don't ask me what constitutes "pro-government p0rn"
  5. OK, I thought I'd give a brief update on the situation, especially regarding search engine bots. The top hits I received for March were: Unknown robot (identified by 'spider') 6.66 GB (mostly Baidu) Googlebot 124063+182 6.02 GB (Google) Inktomi Slurp 114994+6645 1.73 GB (Yahoo) MSNBot 4433+679 155.22 MB (MSN) Clearly the first two entries sucked up more than half of my alloted bandwidth for the month. The first entry, I discovered, can be attributed mostly to the Chinese Baidu SE. Given the nature of my contents I was reluctant to ban ou
  6. Andy, thanks a lot for the offer. I'll do that (Yeah, TCH rulez!) Fact is, I've poured over the awstats summaries quite a few times, and I still have a lot to learn on figuring out how to read logs. Btw, thanks to Bruce for moving the post here, where it belongs.
  7. This is awful. I've noticed significant jumps in bandwidth consumption in the last month or two. This is what Awstats shows now: Traffic viewed = 2.37 GB Traffic not viewed = 14.29 GB As Awstats tells us, "not viewed" includes "traffic generated by robots, worms, or replies with special HTTP status codes." Sadly, as of this writing, I've exceeded my alloted bandwidth I run a "modest" personal site, both in terms of scale and popularity. Some possible reasons: * It looks like most of the traffic can be blamed on busy/nosy search engine bots (some consuming sever
  8. I renamed entry and comment-related scripts. Worked for a while, until the spammers caught on. Later on I upgraded to Lazarus (AGB's cousin), which uses a user-defined question to test against bots. Alas, the spammers took pains to answer the question, so I still got spam. i.e. Against human-delivered spam I am still at a loss as to how best to deflect them. I checked to see if these spammers are using blacklisted open proxies (which would make it possible to block posts coming from them), well, apparently not I may end up having to approve all posts manually, at least until
  9. Just want to add that wget -q -t 2 --delete-after http://mysite.net/script.php does not appear to work (at least on my server). You get a /bin/sh: line 1: /usr/bin/wget: Permission denied Same thing with using Lynx the text-based browser (this used to work): /bin/sh: line 1: /usr/bin/lynx: Permission denied
  10. Today I saw 406 errors for the first time, quite surprising. The messages claim that one or another "resource" (name of script indicated) could not be found on the server. This seems to occur only when posting forms but only with some contents and not others. I've been able to use a set of anti-spam tools quite effectively, including one I coded. I also upgraded to MT 3.15 yesterday, though it sounds like it's a server-side thing. I might re-install but doubt that would help. Edited: I've opened a ticket. Please regard this post as "experience-sharing".
  11. Maybe it's a server-level configuration? Frankly I know little about Apache directives (just enough RewriteRule to get by). It would be useful, though, to be able to make some HTML files behave like .shtml. Otherwise I'd need to change filenames and set up redirects. Update: "XBitHack on" works fine for me (hurray!!). This is a lot better than forcing all .html files to be parsed
  12. I doubt it. Unless your script is using the Encode module, that should have no effect at all. BTW, to answer your last question, TCH (at least the server I'm on) is using Perl 5.8.0. While not having the latest bug fixes (and new bugs), it should be more than adequate. About "use utf8": Perl's official doc says: I no longer remember if I tried that, though I bet I have. As I indicated before, the utf-8 encoded scripts ran fine on XP (or my particular XP Perl), so...that's that. Maybe try again another time. Thanks for all the suggestions
  13. Thanks, guys, for the extra info. That injection statement looks weird but then I don't go around cracking people's sites. Since I did upgrade AG a few weeks ago to close off that hideous loophole, that's good. Hope everyone else here has done the upgrade, as well.
  14. Hi, Recently I've received two email apparently from my installation of Advanced Guestbook (2.3.1) telling me about SQL errors. My guestbook is apparently unaffected (so far?) but I'm concerned about some kind of script attack. I hope this is just some failed attempt at running a spam script rather than a security compromise. I looked up the IPs, one's from Germany, another Italy. My guess is they've been grabbed from anonymous proxies -- I don't really know. What do you think? Anyone with similar experience? Should I ignore this or...? Thanks. -------- Original M
  15. Hmm...I think my description of the problem with the unforwarded mail was possibly inaccurate: it seems that Mailman 2.1.2 (the current installed version) refuses to distribute mail whose subject line uses certain non-US ASCII encoding, for example Traditional Chinese (aka Big5). No warning is given and the guilty mail is apparently discarded. So possibly Spam Assassin is not responsible for the problem. Still, I'd appreciate any info as to whether our favorite assassin targets mail headed for list distribution -- thanks!
  16. Not that I know of. You can check size usage of each mail account, but that's all. If you're using that PHP function, I'd do as you say: track the usage programmatically.
  17. Hi, I've been happily tweaking Spam Assassin to handle the Junk Mail Phenomenon, and everything's worked out great -- thank you. Recently I activated a mailing list. I noticed mail sent through it have Spam Assassin headers, so apparently SA has been filtering the mail. I also noticed a couple e-mail I sent to the list have not shown up. I assume the e-mail has been designated as spam, but have no evidence whatsoever. Now, how can I receive the spam (real and false positive) sent through a Mailman-manged list? As the list is already running, this is somewhat urgent for me....
  18. What software are you using to make the RSS files? At my end I currently have Movable Type publish a few RSSs, and I also make a couple of my own from a database. I LOVE RSS! It's a simple way to export periodic contents for others to read, re-process, or otherwise display elsewhere. Rather than people going all over the Net to catch up on happenings, they'd load up their desktop or server-based RSS readers/webpages, and get everything they're interested in one go. (The only thing is, few of my buddies have the know-how to use it, which though it ain't their fault, does frustrate
  19. Has anyone had success executing a Perl CGI script written in Unicode (e.g. UTF-8)? On my XP development platform such scripts run just like any other. When I upload them to TCH servers, however, I get generic error messages. Converting the scripts to US ASCII (and sacrificing some features) allows the same scripts to run. Now I know Perl has great support for all sorts of Unicode features, which I've used reliably before, so I see no good reason why such a script wouldn't run. I've tried uploading in ASCII, binary to no avail. Any thoughts on why this might be the case??
  20. I just noticed a "backup" feature in my CPanel. This is a nice option for those who don't need automated backup (say via cron). One more reason to recommend TCH. Thumbs Up
  21. Google seems to send out kinder and gentler robots. In my experience, though, I've had to issue directives to other robots to keep them out of, say, a PHP calendar with a dynamic link for each day and many, many years. One robot downloaded 100 megs just traversing the calendar Mad!!!
  22. This is one of the major weaknesses of PHP: various internationalization and localization features are considered optional, and US-based servers tend not to have them compiled. BUT if you can afford to port your code to Perl (especially 5.8.x, which TCH has -- yeah!), the outlook is good. Rock Sign
  23. Honestly, www.subdomain.domain.org looks so ugly, I am inclined to show the user the friendly 404 (along with a site map and search box) Another thing is, rewrite rules make use of regular expressions, and bad expressions can slow down the server and delay page display.
  24. Hi, Jim: Thanks a lot! Worked like a charm. --Hank
  25. Hi, The cPanel Counter script has trouble logging a hit on the .dat file, so I got rid of ".dat". That works but now the problem is I am unable to reset it to a count from my previous life (I mean, host). Anyway to fix either of these problems? I still don't understand why the default "user.dat" file access does not work, though it seems to be a common enough problem to have been documented in the FAQ. If all fails, I will just have to install my own script. Thanks.
  • Create New...