ansaurus

Question

What is the correct way in C# to download a file from the Internet and write it on the fly?

Answer 1

+2 A:

In the full framework, the simplest way to do this is to use the WebClient class's DownloadFile method, like this:

using(var wc = new WebClient()) {
    wc.DownloadFile(url, filePath);
}

EDIT: To report the download progress, call DownloadFileAsync and listen for the DownloadProgressChanged event. You can also cancel the download by calling the CancelAsync method.

SLaks 2010-01-08 16:13:51

Your suggestion won't help with these two requirements as stated in his question: "ability to report the percentage completed and resume the downloads"

David Stratton 2010-01-08 16:16:26

Technically you (from what I remember) you should be able to do both of those things with the WebClient class......But I do not have that class available in the compact framework which is why it has to be does with HttpWebRequests.

Ben 2010-01-08 16:19:20

I didn't notice that you were using the compact framework. I'll keep the answer for reference.

SLaks 2010-01-08 16:22:01

Sorry, I didn't mention that in the first place...I edited my thread.Thanks for the effort though.

Ben 2010-01-08 16:23:49

Answer 2

+2 A:

First thing, I would get rid of the finally clause and change the code to use "USING" clauses.

Anything that implements IDisposable should be programmed that way to make sure garbage collection occurs correctly and when it is supposed to.

For example:

using (HttpWebRequest webReq = (HttpWebRequest)HttpWebRequest.Create(_url)) {
    /* more code here... */
}

Second, I wouldn't instantiate my variables at the head with null values (ala Pascal style). See above example.

Third, the download should be in it's own thread which sync's with a call back function in the main thread to report status. Put the sync call in the middle of your while loop.

Chris Lively 2010-01-08 16:18:47

Hi there,Thanks for the response. Yeah that method is executed on its own thread for each download.I was worried about memory leaks so thats why I was being careful to close/dispose/set to null everything?I take it that I should still close the FileStream and Stream?Thanks.

Ben 2010-01-08 16:33:51

If it implements IDisposable, use the USING clause. That will take care of what you need. Two other reasons: 1. not all exceptions will actually fire your finally clause. Some will blow right past to the OS. 2. If anything was to blow up before you instantiated one of the objects you are trying to close, then you would have another exception thrown during the execution of the finally clause.. which means something won't get closed or disposed of properly.

Chris Lively 2010-01-08 17:46:23

I thought that 'using' statements are converted into try/finally blocks by the compiler, so your item 1 will still apply for the few exceptions that don't get caught. Also if an 'uncatchable' exception is thrown it's likely that whatever is in that finally is the least of your worries at that point

Matt 2010-01-12 02:24:19

Answer 3

A:

You can get all the tips you need from existing code thats been published and is freely available.

Such as here: http://www.codeproject.com/KB/IP/MyDownloader.aspx

Better yet, take the existing code and modify it for your needs. (That's why the code gets posted there in the first place.)

David Stratton 2010-01-08 16:20:33

Answer 4

A:

If you need to track the progress, use WebClient.DownloadFileAsync along with the DownloadProgressChanged and DownloadFileCompleted events

WebClient wc = new WebClient();
wc.DownloadProgressChanged += wc_DownloadProgressChanged;
wc.DownloadFileCompleted += wc_DownloadFileCompleted;
wc.DownloadFileAsync(sourceUri, localPath);

...

private void wc_DownloadProgressChanged(object sender, DownloadProgressChangedEventArgs e)
{
    ...
}

private void wc_DownloadFileCompleted(object sender, AsyncCompletedEventArgs e)
{
    ...
}

Thomas Levesque 2010-01-08 16:21:28

Answer 5

+1 A:

From a user experience perspective, you should be able to answer a lot of these questions by looking at an application like Internet Explorer or Firefox. For example;

In Internet Explorer, new data is reported every few kilobytes, up to the one megabyte mark. After that, it is reported in 100 kilobyte increments.
How often you write to the buffer depends on whether you're allowing recovery when the connection is dropped. If you're like IE and force the user to start from scratch, it doesn't really matter how often you save your buffer as long as you do it eventually. Set your saving based on "acceptable loss".
Your application should obviously not take 100% of the CPU, since that isn't good etiquette in the programming world. Have your threads at least sleep long enough not to dominate the CPU.

Your code, generally speaking, is functional, though it could stand a lot of refactoring to make it a little cleaner/easier to read. You might consider using the WebClient class in .NET, too, but if this is a learning exercise, you're doing it the right way.

Good luck! You're on the right track.

Ed Altorfer 2010-01-08 16:21:34

Thank you, I have found this the most useful response thus far. I expect writing the buffer to file takes *relatively* alot of CPU power compared to having a larger buffer and writing less often.

Ben 2010-01-08 16:30:46

Well, file IO is always going to be one of your largest bottle-necks, so if you don't HAVE to write to disk that often, you should avoid it. Like I said, though—figure out what your acceptable loss threshold is.

Ed Altorfer 2010-01-08 16:51:15

You might also consider either (a) figuring out which response is best and mark it as such or (b) compiling several of the responses into a new answer and marking it as correct.

Ed Altorfer 2010-01-08 16:52:09

ansaurus

tags:

views:

answers:

What is the correct way in C# to download a file from the Internet and write it on the fly?

related questions