OpenAI crawler burning money for nothing
3 by babuskov | 0 comments on Hacker News.
I have a bunch of blog posts, with URLs like these: https://mywebsite/1-post-title https://mywebsite/2-post-title-second https://mywebsite/3-post-title-third https://mywebsite/4-etc For some reason, it tries every combination of numbers, so the requests look like this: https://mywebsite/1-post-title/2-post-title-second https://mywebsite/1-post-title/3-post-title-third etc. Since the blog engine simply discards everything after number (1,2,3...) and just serves the content for blog post #1, #2, #3,... the web server returns a valid page. However, all those pages are the same. The main problem here is that there is no website page that has such compound links like https://mywebsite/1-post-title/2-post-title-second So it's clearly some bug in the crawler. Maybe OpenAI is using AI code for their crawler because it has so dumb bugs you cannot believe any human would write it. They will make 90000 requests to load my small blog with 300 posts. Cannot imagine what happens with larger websites that have thousands of blog posts.
Don't forget to subscribe our youtube channel Click here:- http://www.youtube.com/c/techgk Product of the day
Post Top Ad
Responsive Ads Here
New ask Hacker News story: OpenAI crawler burning money for nothing
Share This
Subscribe to:
Post Comments (Atom)
Post Bottom Ad
Responsive Ads Here
Author Details
Templatesyard is a blogger resources site is a provider of high quality blogger template with premium looking layout and robust design. The main mission of templatesyard is to provide the best quality blogger templates which are professionally designed and perfectlly seo optimized to deliver best result for your blog.
No comments:
Post a Comment