Need Help with Optimizing My C# Code for Better Performance

charlieg

off to the C# forum you go :)

Charlie Gilley “They who can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety.” BF, 1759 Has never been more appropriate.

Richard Deeming

Aside from posting your question in the correct forum - did you somehow miss the repeated "no programming questions" warnings at the top of the page? - the only thing we can suggest based on such a vague description is that you profile your code to find out where the actual bottleneck is. Anything else is going to be a wild stab in the dark, with very little chance of success.

"These people looked deep within my soul and assigned me a number based on the order in which I joined." - Homer

Maximilien

Profile.

CI/CD = Continuous Impediment/Continuous Despair

abmv

[A Fast CSV Reader](https://www.codeproject.com/Articles/9258/A-Fast-CSV-Reader) [GitHub - phatcher/CsvReader: Extended version of Sebastian Lorien's fast CSV Reader](https://github.com/phatcher/CsvReader) [NuGet Gallery | ETLBox.Csv 3.4.0](https://www.nuget.org/packages/ETLBox.Csv/)

Caveat Emptor. "Progress doesn't come from early risers – progress is made by lazy men looking for easier ways to do things." Lazarus Long

OriginalGriff

As has been mentioned, this is not the right place for programming queries - post your request here instead: C# Discussion Boards[^] But ... Think about a few things first. 1) Remember that we can't see your screen, access your HDD, or read your mind - we only get exactly what you type to work with - we get no other context for your project. Imagine this: you go for a drive in the country, but you have a problem with the car. You call the garage, say "it broke" and turn off your phone. How long will you be waiting before the garage arrives with the right bits and tools to fix the car given they don't know what make or model it is, who you are, what happened when it all went wrong, or even where you are? That's what you've done here. So stop typing as little as possible and try explaining things to people who have no way to access your project! Have a look at this: Asking questions is a skill[^] and think about what you need to know, and what you need to tell us in order to get help. 2) Parallelism isn't a "magic bullet" which automagically speeds up your code: it's is a complicated subject and it's very simple to slow an app down if you don't understand what you are doing. 3) The first part of speeding up an app is identifying where bottlenecks occur: if you are guessing (and "I suspect ..." is a good clue that you are) then you are very likely to chase off down a blind alley making tiny gains in performance but ignoring the "slow code". Profile your app and find out what is actually taking time! 4) We can't tell you "do this and your code will be faster" without understanding your code - and nobody wants to wade through a whole app looking for performance gains: we are all volunteers and most of us have paying job to do! So show us code fragments that we actually need to see - without those, we are just whistling in the dark!

"I have no idea what I did, but I'm taking full credit for it." - ThisOldTony "Common sense is so rare these days, it should be classified as a super power" - Random T-shirt AntiTwitter: @DalekDave is now a follower!

PIEBALDconsult

It sounds mildly interesting. More specifics would help.

Dave Kreskowiak

The terms "nested loops" and "performance" do not belong in the same sentence. If you're nesting loops in what is supposed to be high-performance code, you're killing performance of that code.

Asking questions is a skill CodeProject Forum Guidelines Google: C# How to debug code Seriously, go read these articles. Dave Kreskowiak

Rob Philpott

A bit late to the party, but here are a few thoughts. String manipulations are as you say possibly the most likely bottleneck. Strings are immutable in C# (they don't change), so if you concatenate two strings it involves copying the data from both of them into a new string. If you're doing this in a tight loop there's going to a lots of work (and garbage collection) involved. For building up strings, StringBuilder is the class of choice, and for substrings etc. you could look at Spans ([C# - All About Span: Exploring a New .NET Mainstay | Microsoft Learn](https://learn.microsoft.com/en-us/archive/msdn-magazine/2018/january/csharp-all-about-span-exploring-a-new-net-mainstay). If your files are large, take a streaming approach. For instance File.ReadAllLines() loads everything into memory where as File.ReadLines() returns an iterator where you just deal with the current line. Reading a massive file into memory often isn't a great idea. With those in place, the only other thing I can think of is pipelining. Have one thread read the input file and present records via a queue to a processing stage which processes them and then submits them via a queue to a writing stage. All these stages can run concurrently. TPL dataflow is the thing for this. Finally, if you are using Parallel processing for nested loops, make sure you do it on the outer loop. There is overhead in parallelising work and you get better performance that way. It's all rather difficult to say without seeing the code!

Regards, Rob Philpott.

jochance

Good stuff. A simple poor man's parallelization might be to spin off worker threads which each handle a different segment of the CSV file... Like a 1000 line file spins off 10 threads handling 0-99:100-199:200-299 etc How simple it is might be not at all though if what you need to do in parsing is actually also calculate things across ALL rows. Still totally possible/beneficial, just more difficult.

jschell

Steves Smith wrote:

I suspect the string operations might be the bottleneck

Programmers are not good at guessing. That is why, as a different poster said "profile". That involves instrumenting your code and then running it. The instrumentation measures the process so there is no longer any guessing. Profiling involves two criteria: time and count Both are important. A method that runs one time and takes 5 minutes is different than a method that runs 300 times and takes one second (which is also 5 minutes.) You can instrument your code manually or find a tool (free or not) that does it for you.

trønderen

A week has passed, and your rather provocative, categorical statement hasn't succeeded in provoking a single reaction :-) I smiled when I first saw it, not sure if it was intended as a pure joke or as a serious, but provocatively phrased, statement. Nested loops should of course raise your awareness level (and so should single loop, although not necessarily as much). But some problems are by nature two- (or even multi-) dimensional, and lends themselves to nested loop implementations. The only alternative is to roll out one of the levels, requiring the iteration count to be fixed (which is not always the case). It could reduce code cash hit rate significantly, and in extreme cases ever virtual memory hit rate. So I hope noone takes your advice as Absolute Truth. It very strongly tells you to check twice what you put into the loops, whether simple or nested. But as single loops have their place, so has nested loops (but of cause a smaller one) - even in high performance code.

Religious freedom is the freedom to say that two plus two make five.

Dave Kreskowiak

Sigh. WTF is it with you? Is it absolute gospel? NOOOOO! Should a nested loop be your first port of call for high-performance code? Again, NOOOOOO! Is it possible that you don't have a choice have to use a nested loop in high-performance code? YESSSSSS! Happy now? Loops are only one specific case of performance issues in code. I NEVER said it was the only reason for performance issues.

Asking questions is a skill CodeProject Forum Guidelines Google: C# How to debug code Seriously, go read these articles. Dave Kreskowiak

trønderen

You had forgotten to flag your post as "Super sensitive matter! No critical remarks or alternate opinions, please!" If you do that next time, I will of course leave your post totally uncommented (and as soon as I discover the flag: unread).

Religious freedom is the freedom to say that two plus two make five.

Dave Kreskowiak

Well, it's obvious my one small tip offended your sensibilities, so be offended all you want.

Asking questions is a skill CodeProject Forum Guidelines Google: C# How to debug code Seriously, go read these articles. Dave Kreskowiak