|
Ä¿ÇÇÇâÀÌ ³ª´Â *NIX
Ä¿ÇǴнº
½Ã½ºÅÛ/³×Æ®¿÷/º¸¾ÈÀ» ´Ù·ç´Â °÷
|
|
|
|
ÀÌÀü ÁÖÁ¦ º¸±â :: ´ÙÀ½ ÁÖÁ¦ º¸±â |
±Û¾´ÀÌ |
¸Þ½ÃÁö |
truefeel Ä«Æä °ü¸®ÀÚ
°¡ÀÔ: 2003³â 7¿ù 24ÀÏ ¿Ã¸° ±Û: 1277 À§Ä¡: ´ëÇѹα¹
|
¿Ã·ÁÁü: 2006.4.11 È, 9:48 pm ÁÖÁ¦: ichiro(ÀÌÄ¡·Î) bot µé¶ô°Å¸²°ú Á¶Ä¡ |
|
|
User-Agent¸í : ichiro/2.0 (http://help.goo.ne.jp/door/crawler.html)
ÀϺ» °Ë»ö»çÀÌÆ®ÀÎ goo¿¡¼´Â ichiro/2.0 À̸§ÀÇ botÀ» »ç¿ëÇϰí ÀÖ½À´Ï´Ù.
ÀÌ ³ðÀÇ botÀÌ ½Ãµµ¶§µµ ¾øÀÌ µé¶ô°Å¸®°í ÀÖ½À´Ï´Ù. ¹¹~ NaverBotÀ̳ª ¾ßÈÄÀÇ Slurpµµ ¸¶Âù°¡ÁöÁö¸¸.
Slurp, msnbotÀº robots.txt¿¡ 'Crawl-delay: ÃÊ'¸¦ ¼³Á¤Çϸé ÁöÁ¤ÇÑ ½Ã°£¸¸Å delay¸¦ ÁÖ¸é¼ ¹æ¹®À» ÇÏÁö¸¸ ichiro´Â ¾ÈµÇ´Â °É·Î º¸À̳׿ä.
ichiro bot ÀÚü¸¦ ¿ÏÀüÈ÷ ¸·À»±î ÇÏ´Ù°¡ ÀÌ botÀÇ ¹ÝÀÀ »ìÆìº¸±â À§ÇØ
httpd.conf ¼³Á¤¿¡¼ ½Ã°£´ë Áß¿¡ 30~60ÃÊ »çÀÌ¿¡ Á¢¼ÓÀ» ÇÑ °æ¿ì(19:31:37¸é ÇØ´ç, 19:31:12¸é NO)¿¡´Â http://help.goo.ne.jp/door/crawler.html ·Î Æ÷¿öµùµÇµµ·Ï ÇØ¹ö·È½À´Ï´Ù.
ÄÚµå: |
RewriteEngine on
RewriteCond %{HTTP_USER_AGENT} ^ichiro
RewriteCond %{TIME_SEC} >30
RewriteCond %{TIME_SEC} <60
RewriteRule ^/(.*)$ http://help.goo.ne.jp/door/crawler.html [R]
|
ÀÌÇØÇϽðÚÁÒ?
1) ichiro botÀÌ¸é¼ 2) 30~60ÃÊ´ë »çÀÌ¿¡ Á¢¼ÓÀ» Çϸé 3) help.goo.ne.jp... ·Î Æ÷¿öµù
½½½½ ¹ÝÀÀÀ» ÁöÄѺÁ¾ß°Ú³×¿ä.
µ¡ºÙ¿©¼ Agent ¸ñ·ÏÀ» Á¦°øÇÏ´Â »çÀÌÆ® 2°÷À» ¼Ò°³ÇϰڽÀ´Ï´Ù.
1. List of User-Agents(Spiders, Robots, Crawler, Browser)
http://www.psychedelix.com/agents/index.shtml
¿©±ä Agent¸íÀ» A-Z¼øÀ¸·Î ³ª¿Çß½À´Ï´Ù. ¾öû³ ¾çÀÔ´Ï´Ù.
2. User Agent Database
http://www.tnl.net/ua/
À¯Çüº°(Browser, Proxy, Spider, RSS Reader...), OSº°·Î °Ë»öÇÒ ¼ö ÀÖ½À´Ï´Ù
robots.txt ¼³Á¤Àº http://www.robotstxt.org/ À» Âü°íÇϽðí,
robots.txt ÆÄÀÏÀ» ¸î°¡Áö Á¶°Ç¸¸ ¼±ÅÃÇϸé ÀÚµ¿ »ý¼ºÇØÁÖ´Â Robot Control Code Generation Tool »çÀÌÆ®µµ ÀÖ½À´Ï´Ù.
http://www.mcanerin.com/EN/search-engine/robots-txt.asp |
|
À§·Î |
|
 |
|
|
»õ·Î¿î ÁÖÁ¦¸¦ ¿Ã¸± ¼ö ¾ø½À´Ï´Ù ´ä±ÛÀ» ¿Ã¸± ¼ö ¾ø½À´Ï´Ù ÁÖÁ¦¸¦ ¼öÁ¤ÇÒ ¼ö ¾ø½À´Ï´Ù ¿Ã¸° ±ÛÀ» »èÁ¦ÇÒ ¼ö ¾ø½À´Ï´Ù ÅõÇ¥¸¦ ÇÒ ¼ö ¾ø½À´Ï´Ù
|
Powered by phpBB © 2001, 2005 phpBB Group
|