what is wrapper and how can uses in my application?

Question

1.00/5 (1 vote)

See more:

What's Wrapper?
I want to use the parser in a C # program. But this is Java parser. But there is a Wrapper for C #. How can I add it to the program.
It is best to use a parser?
link parser: http://htmlunit.sourceforge.net/

Posted 10-Feb-13 19:50pm

e.v.r

Add a Solution

Comments

Sergey Alexandrovich Kryukov 11-Feb-13 1:55am

I doubt this is a wrapper. Who told it so? In the page you referenced, also there are no mention of a wrapper.
What parser are you looking for? HTML? Is it well-formed XML or not?
—SA

e.v.r 11-Feb-13 2:08am

Thank you for comment.
this is a Htmlunit parser for java. but i want for c#. this parser have a wrapper.

Sergey Alexandrovich Kryukov 11-Feb-13 2:13am

Who told you it does? Where? How?!
You need a pure .NET parser, that's it. Please see my answer.
—SA

e.v.r 11-Feb-13 2:22am

The parser does not have the features that I want.
However, I noticed that the documentation. Someone did not tell me
I check the crawler and parser, but I did not get the result.
I need a good crawler that I felt would work better with the parser.
What do you think?

Sergey Alexandrovich Kryukov 11-Feb-13 2:36am

How can you mix up a crawler and parser. You need to explain your ultimate goal, otherwise the discussion makes no sense at all...
—SA

e.v.r 11-Feb-13 2:30am

In your opinion, how is the parser?
Do is the for asp? And I can not use?
http://www.beletsky.net/2010/09/crawling-web-sites-with-htmlagilitypack.html
http://blog.abodit.com/2010/03/a-simple-web-crawler-in-c-using-htmlagilitypack/

Sergey Alexandrovich Kryukov 11-Feb-13 2:36am

Please see my comment above...
—SA

2 solutions

Add a Solution

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Sergey Alexandrovich Kryukov · Accepted Answer · 2013-02-10T20:02:00

Solution 1

Please see my comment to the question. I doubt you can simply use what you have referenced… You need to use something else.

If you want to parse HTML which is well-formed as XML (and only such HTMLs have a right to exist, but in real life… :-(), this is the best case, as .NET FCL has three different XML parsers (at least; I can reference/overview them if you want, but you will easily find them). If this is no case, you will need some HTML parser to do the dirty job. Try this one:
http://www.majestic12.co.uk/projects/html_parser.php[^].

[EDIT]

This is a problem of Web scraping: http://en.wikipedia.org/wiki/Web_scraping[^].

Please also see my past answers:
http://en.wikipedia.org/wiki/Web_scraping[^],
get specific data from web page[^],
How to get the data from another site[^].

—SA

Posted 10-Feb-13 20:02pm

Sergey Alexandrovich Kryukov

Updated 10-Feb-13 21:11pm

v2

Comments

Sergey Alexandrovich Kryukov 11-Feb-13 2:37am

[OP commented:]

Hello
Thank you for answer.
your link is good. but allow me that put check list for i need parser.
for example:

- The ability of a text file or an HTML DOM tree
- Ability to recognize the character Encoding
- Parse the JavaScript
- Interpretation and implementation of the Java
- Support HTTP and HTTPS protocols
- Support cookies
- Failing to identify whether the server response should be considered as an exception, or should be returned to a specific page (based on content)
- Support POST and GET methods
- Written in C #

Sergey Alexandrovich Kryukov 11-Feb-13 2:39am

First of all: please don't post you comment as "answer".

Now I see: you don't really have a clue, not at all... And your purpose is very questionable. So, 1) you need to learn how Web and HTTP work, the role of HTML, etc; 2) you need to explain your ultimate purpose (and, next time, always start with it), otherwise there is nothing to talk about...

—SA

e.v.r 11-Feb-13 2:57am

Thanks, sorry I am beginner in site.
I should explain what , so I mean it's so wide?

Sergey Alexandrovich Kryukov 11-Feb-13 2:59am

I advise to explain your ultimate purpose. The idea is to abstract out from your ideas on how you should approach your goals, as they can be wrong.
—SA

e.v.r 11-Feb-13 3:03am

I wrote a crawler that will list links to the site. But crawler can not find all the links correctly. I decided use the parser. But I could not found a proper parser, except that I gave the same link.
And also I can not use it because it was difficult for me to understand their code.
and i want to use this crawler in scanner.
ok?

Sergey Alexandrovich Kryukov 11-Feb-13 3:09am

The parser I reference is more then enough to solve the problem. Use also HttpWebBrowser.
—SA

Sergey Alexandrovich Kryukov 11-Feb-13 3:11am

Please see my updated answer, after [EDIT].
—SA

e.v.r 11-Feb-13 3:19am

Thanks for answer.
yes, I used this System.Net.WebRequest.WebRequest and ... and another your links but my crawler is not good.
Thank you

Sergey Alexandrovich Kryukov 11-Feb-13 3:21am

You are welcome.
When (and it) you see my advice makes sense, please accept the answer formally (green button) — thanks.
—SA

e.v.r 11-Feb-13 3:25am

Ok,
I have one question.
What are crawler between the parser? (Difference)

Sergey Alexandrovich Kryukov 11-Feb-13 3:34am

See Solution 2.
—SA

Sergey Alexandrovich Kryukov · Accepted Answer · 2013-02-10T21:33:00

e.v.r. wrote:
What are crawler between the parser? (Difference)

All "difference" questions are inherently incorrect. It only can be used as a figure of speech, if something is very similar. If some things have nothing to do one with another, you won't be able to define the notion of "difference". If you could, you would be able to answer "what's the difference between apple and Apple", but can you? :-)

And this is exactly the case. So, just learn what are they:

http://en.wikipedia.org/wiki/Parser[^],
http://en.wikipedia.org/wiki/WebCrawler[^].

The crawler has to use some parser, as its implementation detail, I guess. I depends though.

Clear enough, isn't it?

—SA