Advertisement
If you have a new account but are having problems posting or verifying your account, please email us on hello@boards.ie for help. Thanks :)
Hello all! Please ensure that you are posting a new thread or question in the appropriate forum. The Feedback forum is overwhelmed with questions that are having to be moved elsewhere. If you need help to verify your account contact hello@boards.ie
Hi there,
There is an issue with role permissions that is being worked on at the moment.
If you are having trouble with access or permissions on regional forums please post here to get access: https://www.boards.ie/discussion/2058365403/you-do-not-have-permission-for-that#latest

Java HTML Scraping 403 error

  • 09-02-2009 3:25pm
    #1
    Registered Users, Registered Users 2 Posts: 648 ✭✭✭


    Hi everyone,
    I'm trying to extract the page source of a google query and store the source in a stringbuffer, but i keep getting a Http 403 error. Is there a way to set the user agent or something in java so i can actually access the page source? The method works fine on normal websites, so i assume its some sort of anti-bot measure. Is there a way to get around it?

    Thanks,

    Simon

    ps I can post code if you need it but i'm more just posting to see if it's possible to do with java


Comments

  • Registered Users, Registered Users 2 Posts: 648 ✭✭✭simonw


    Mods can close the thread now, i found a solution myself


  • Registered Users, Registered Users 2 Posts: 917 ✭✭✭Páid


    I think you should post the solution that worked for you, if only for the benefit of others who may have the problem in the future.


Advertisement