Click here to Skip to main content
16,012,508 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
I want to scrape (Screen Scrape) a website developed using Google Web Toolkit and the page which I am trying to scrape seems to be a flash page.

I use the following code.

HttpWebRequest request   = (HttpWebRequest)HttpWebRequest.Create("https://xxx);
request.Method            = "POST";
request.UserAgent         = "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/40.0.2214.115 Safari/537.36";
request.Headers["Cookie"] = SessionID;
request.Referer           = "xxx";
request.Accept            = "*/*";

request.Headers["X-GWT-Permutation"] = "E81756AE355F23274CB68B43D62F0248";
request.Headers["X-GWT-Module-Base"] = "https://xxx";

byte[] buffer             = System.Text.Encoding.ASCII.GetBytes(encodeData("7|0|6|https://xxx"));  //);
Stream PostData           = request.GetRequestStream();

PostData.Write(buffer, 0, buffer.Length);
PostData.Close();

HttpWebResponse response = (HttpWebResponse)request.GetResponse();
Stream stream            = response.GetResponseStream();


I get an output throwing an error as below

/EX[2,1,["com.google.gwt.user.client.rpc.IncompatibleRemoteServiceException/3936916533","Parameter 0 of is of an unknown type 'java.lang.String%2F2004016611'"],0,7]
Posted
Comments
Richard MacCutchan 5-Mar-15 3:42am    
You need to ask the people who own the site.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900