Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. performance puzzle

performance puzzle

Scheduled Pinned Locked Moved The Lounge
questionc++performancecsharpcss
65 Posts 18 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • M Mechanical

    Super Lloyd wrote:

    any C# performance tip is welcome!

    That is like looking for performance from Java: You won't get it. If you need (or want) performance, don't do retards' languages. I know this thread isn't meant to incite hatred against retards' languages.

    NULL

    D Offline
    D Offline
    Derek Viljoen
    wrote on last edited by
    #38

    Mechanical, I checked your profile. You've never authored an article, posted a snippet, entered a blog post, or otherwise shared any original thought, except to snipe at other people's comments on the message boards. In other words, you have yet to provide any empirical evidence for your worth to this site. No be a good boy and go away, please.

    M 1 Reply Last reply
    0
    • M Mechanical

      My statement was made because I had to maintain some Native code I wrote some years ago. I realized how retarded I had become doing retards' languages.

      peterchen wrote:

      Are you open to evidence for my statement

      I'm all ears (big, pointy ears).

      NULL

      P Offline
      P Offline
      peterchen
      wrote on last edited by
      #39

      The Raymond Chen vs. Rico Mariani competition comes to mind. Clickety[^] Clickety-Click[^] The final result, at first look, supports your statement: The C++ version completed before the .NET application even did load. However, when you look at the history and details, you see that the road to the superfast C++ application was a rocky one, introducing subtle bugs and requiring a very experienced developer to counter the rather simple improvements to the C# solution.


      With extensions like LINQ, Closures, Lambdas, Generics and some syntactic sugar, C# has become a rather complex and powerful language. It's complex and powerful in a different sense than C++, but still.

      Agh! Reality! My Archnemesis![^]
      | FoldWithUs! | sighist | WhoIncludes - Analyzing C++ include file hierarchy

      modified on Friday, November 5, 2010 11:17 AM

      1 Reply Last reply
      0
      • S Super Lloyd

        Well it's only tip #237, so I guess it's not the 1st thing I should rush to do! :)

        A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

        P Offline
        P Offline
        peterchen
        wrote on last edited by
        #40

        Hey! Take stop reading my mind!

        Agh! Reality! My Archnemesis![^]
        | FoldWithUs! | sighist | WhoIncludes - Analyzing C++ include file hierarchy

        1 Reply Last reply
        0
        • D DaveAuld

          Nothing to do with the optimisation, but what about using the StopWatch class instead of datetime? Start the stop watch immediately before you enter the loop, and then stop it as soon as the loop exists.

          Dave Find Me On: Web|Facebook|Twitter|LinkedIn CPRepWatcher now available as Packaged Chrome Extension, visit my articles for link.

          L Offline
          L Offline
          Luc Pattyn
          wrote on last edited by
          #41

          I agree. On my machine 10 iterations take 0.5 milliseconds, which DateTime.Now can't catch at all. :)

          Luc Pattyn [Forum Guidelines] [Why QA sucks] [My Articles] Nil Volentibus Arduum

          Please use <PRE> tags for code snippets, they preserve indentation, and improve readability.

          1 Reply Last reply
          0
          • S Super Lloyd

            I already posted the source code: http://www.codeproject.com/Lounge.aspx?msg=3657380#xx3657380xx[^] As to the disassembly, err.... well, ok, but I have to cut the C++ version as it huge loads of stuff coming the #include(s), hopefully I'll cut it right ... disassembled JITted C#

            --- C:\Dev\Test\Performance\CSperf\Program.cs ----------------------------------
            if (args.Length != 1)
            00000000 push ebp
            00000001 mov ebp,esp
            00000003 push edi
            00000004 push esi
            00000005 sub esp,6Ch
            00000008 mov esi,ecx
            0000000a lea edi,[ebp-74h]
            0000000d mov ecx,1Bh
            00000012 xor eax,eax
            00000014 rep stos dword ptr es:[edi]
            00000016 mov ecx,esi
            00000018 mov dword ptr [ebp-0Ch],esp
            0000001b mov dword ptr [ebp-74h],2941BF9Bh
            00000022 mov dword ptr [ebp-60h],ecx
            00000025 cmp dword ptr ds:[001B7CB8h],0
            0000002c je 00000033
            0000002e call 62889E21
            00000033 xor edx,edx
            00000035 mov dword ptr [ebp-28h],edx
            00000038 xor edx,edx
            0000003a mov dword ptr [ebp-1Ch],edx
            0000003d xor edx,edx
            0000003f mov dword ptr [ebp-24h],edx
            00000042 xor edx,edx
            00000044 mov dword ptr [ebp-2Ch],edx
            00000047 xor edx,edx
            00000049 mov dword ptr [ebp-20h],edx
            0000004c xor edx,edx
            0000004e mov dword ptr [ebp-10h],edx
            00000051 mov eax,dword ptr [ebp-60h]
            00000054 cmp dword ptr [eax+4],1
            00000058 je 0000006B
            {
            Console.WriteLine("Usage:\tsieve [iterations]");
            0000005a mov ecx,dword ptr ds:[03832088h]
            00000060 call 62202DB4
            return;
            00000065 nop
            00000066 jmp 0000019A
            }

            		int NUM = int.Parse(args\[0\]);
            

            0000006b mov eax,dword ptr [ebp-60h]
            0000006e cmp dword ptr [eax+4],0
            00000072 ja 00000079
            00000074 call 6288BA5C
            00000079 mov ecx,dword ptr [eax+0Ch]
            0000007c call 6220EAC4
            00000081 mov dword ptr [ebp-38h],eax
            00000084 mov eax,dword ptr [ebp-38h]
            00000087 mov dword ptr [ebp-10h],eax
            var t0 = DateTime.Now;
            0000008a lea ecx,[ebp-40h]
            0000008d call 61D07768

            D Offline
            D Offline
            Daniel Grunwald
            wrote on last edited by
            #42

            You need to be careful, the JIT will turn off optimizations when running inside the Visual Studio debugger (even in release builds!). To get the optimized JITted assembly code, you need to run your program from the command line, then attach the debugger to the running process, and click Break.

            S 1 Reply Last reply
            0
            • S Super Lloyd

              yes, do post back please! :P

              A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

              S Offline
              S Offline
              Simon P Stevens
              wrote on last edited by
              #43

              Well I've written a C# version that runs 15-20% faster that your original on my PC. It doesn't use unsafe either. But I don't think you'll like it.

              private static void SafeTest_Improved1(string[] args)
              {
              int NUM = int.Parse(args[0]);
              int end = 8193;
              var primes = new byte[8193]{
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
              1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,

                  1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1
              
              S 1 Reply Last reply
              0
              • D Daniel Grunwald

                You need to be careful, the JIT will turn off optimizations when running inside the Visual Studio debugger (even in release builds!). To get the optimized JITted assembly code, you need to run your program from the command line, then attach the debugger to the running process, and click Break.

                S Offline
                S Offline
                Super Lloyd
                wrote on last edited by
                #44

                To test the speed I was running each program from the command line! ^_^

                A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

                D 1 Reply Last reply
                0
                • S Super Lloyd

                  To test the speed I was running each program from the command line! ^_^

                  A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

                  D Offline
                  D Offline
                  Daniel Grunwald
                  wrote on last edited by
                  #45

                  Yes but where did you get the disassembled C# code from? It looks like all locals are on the stack (never in registers), which the JIT normally does only when a debugger is attached.

                  S 1 Reply Last reply
                  0
                  • L Lost User

                    Are you sure you got the optimized version of the C# code? I mean, yea it's supposed to be lame, but it's worse than I expected

                    S Offline
                    S Offline
                    Super Lloyd
                    wrote on last edited by
                    #46

                    Yes it! :( Is it that bad? I'm not that conversant in ASM, so I couldn't tell... But in a way that's good news, they have room for improvement! Now, what are they waiting for!?! :rolleyes:

                    A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

                    1 Reply Last reply
                    0
                    • D Daniel Grunwald

                      Yes but where did you get the disassembled C# code from? It looks like all locals are on the stack (never in registers), which the JIT normally does only when a debugger is attached.

                      S Offline
                      S Offline
                      Super Lloyd
                      wrote on last edited by
                      #47

                      Ha, the disassembly I got it from VS! Not sure how to get it otherwise... Because on the disk it's not compiled, it's just MSIL, and even if I NGen I dunno where they store the compiled version! (BTW I just tried NGen and it was not faster!?!)

                      A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

                      D 1 Reply Last reply
                      0
                      • M Mechanical

                        Aamir Butt wrote:

                        In some cases, C# can outperform C/C++

                        How about ALL cases ? People who talk about C# (or any other retards' language) performance are talking about void while engaged with other C# (or any other retards' language) performance addicts in group masturbation. The link in your post points to another post by a C# guru (read as: VB guru). It says: "Remember : Engineers are expensive and servers are not!". Look at that! I mean just look at how pathetic that retard is.

                        Aamir Butt wrote:

                        However, if you like to follow the sheep, go ahead.

                        I'll bet you are one of those that believe Java is the future of 3D.

                        NULL

                        A Offline
                        A Offline
                        Aamir Butt
                        wrote on last edited by
                        #48

                        Mechanical wrote:

                        I'll bet you are one of those that believe Java is the future of 3D.

                        Java... Hahahaha... well I can't stop laughing. I haven't worked in Java since school. And I bet you are the one who writes all his linked list, hashtable, search and sorting algorithm from scratch because he believes that he can do better than those retards.

                        Mechanical wrote:

                        How about ALL cases ?

                        Well, give me a case you are talking about but before that, give me the name of a NON-RETARD language as per your understanding.

                        1 Reply Last reply
                        0
                        • D DaveAuld

                          Nothing to do with the optimisation, but what about using the StopWatch class instead of datetime? Start the stop watch immediately before you enter the loop, and then stop it as soon as the loop exists.

                          Dave Find Me On: Web|Facebook|Twitter|LinkedIn CPRepWatcher now available as Packaged Chrome Extension, visit my articles for link.

                          S Offline
                          S Offline
                          Super Lloyd
                          wrote on last edited by
                          #49

                          well, I run 10000 thousand iteration to compare them, so DateTime seems precise enough! :) I did use StopWatch in the past but, I dunno, was never converted to it, I guess I never needed to make measurement precise enough!

                          A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

                          1 Reply Last reply
                          0
                          • S Super Lloyd

                            Ha, the disassembly I got it from VS! Not sure how to get it otherwise... Because on the disk it's not compiled, it's just MSIL, and even if I NGen I dunno where they store the compiled version! (BTW I just tried NGen and it was not faster!?!)

                            A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

                            D Offline
                            D Offline
                            Daniel Grunwald
                            wrote on last edited by
                            #50

                            Super Lloyd wrote:

                            Ha, the disassembly I got it from VS! Not sure how to get it otherwise...

                            To get the optimized JITted assembly code, you need to run your program from the command line, then attach the debugger to the running process, and click Break. Because your Main function was already running before you attached the debugger, it will still have the real native code. .NET doesn't re-jit anything when attaching a debugger.

                            S 1 Reply Last reply
                            0
                            • D Dan Neely

                              Aamir Butt wrote:

                              See here for a performance comparison between different languages[^]

                              The comments on that post were staggeringly stupid. :((

                              3x12=36 2x12=24 1x12=12 0x12=18

                              A Offline
                              A Offline
                              Aamir Butt
                              wrote on last edited by
                              #51

                              Dan Neely wrote:

                              The comments on that post were staggeringly stupid.

                              Yes, unfortunately. There are other performance comparisons as well but there is no point in posting the links here. As i said earlier, following the sheep..... :)

                              1 Reply Last reply
                              0
                              • S Simon P Stevens

                                Well I've written a C# version that runs 15-20% faster that your original on my PC. It doesn't use unsafe either. But I don't think you'll like it.

                                private static void SafeTest_Improved1(string[] args)
                                {
                                int NUM = int.Parse(args[0]);
                                int end = 8193;
                                var primes = new byte[8193]{
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
                                1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,

                                    1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1
                                
                                S Offline
                                S Offline
                                Super Lloyd
                                wrote on last edited by
                                #52

                                :laugh: you could have moved the "for(...) prims[i] = 1" statement out of the loop for the same (and more condensed) result! But this is an unfair comparison with C++, you removed a whole set a "required" operation for the algorithm! Although... I wonder if an array copy from an initialized array would speed up things....

                                A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

                                1 Reply Last reply
                                0
                                • D Derek Viljoen

                                  Mechanical, I checked your profile. You've never authored an article, posted a snippet, entered a blog post, or otherwise shared any original thought, except to snipe at other people's comments on the message boards. In other words, you have yet to provide any empirical evidence for your worth to this site. No be a good boy and go away, please.

                                  M Offline
                                  M Offline
                                  Mechanical
                                  wrote on last edited by
                                  #53

                                  Derek Viljoen wrote:

                                  In other words, you have yet to provide any empirical evidence for your worth to this site. No be a good boy and go away, please.

                                  You have much to learn, little one.

                                  NULL

                                  R 1 Reply Last reply
                                  0
                                  • L Lost User

                                    Post it?

                                    S Offline
                                    S Offline
                                    Super Lloyd
                                    wrote on last edited by
                                    #54

                                    okay, I ran it from the command line and attached a debugger, here is the release version, run and optimized at its best I guess. what do you think? (well, if you can read that! :~ ) (no inline C# comment this time!) ===============================================

                                    00000000 push ebp
                                    00000001 mov ebp,esp
                                    00000003 push edi
                                    00000004 push esi
                                    00000005 push ebx
                                    00000006 sub esp,48h
                                    00000009 mov esi,ecx
                                    0000000b lea edi,[ebp-54h]
                                    0000000e mov ecx,12h
                                    00000013 xor eax,eax
                                    00000015 rep stos dword ptr es:[edi]
                                    00000017 mov ecx,esi
                                    00000019 mov dword ptr [ebp-10h],esp
                                    0000001c mov dword ptr [ebp-54h],496CC9B0h
                                    00000023 cmp dword ptr [ecx+4],1
                                    00000027 je 00000043
                                    00000029 call 620CD310
                                    0000002e mov ecx,eax
                                    00000030 mov edx,dword ptr ds:[03522030h]
                                    00000036 mov eax,dword ptr [ecx]
                                    00000038 call dword ptr [eax+000000D8h]
                                    0000003e jmp 0000017B
                                    00000043 cmp dword ptr [ecx+4],0
                                    00000047 jbe 00000191
                                    0000004d mov esi,dword ptr [ecx+0Ch]
                                    00000050 call 620C0CC0
                                    00000055 push eax
                                    00000056 mov ecx,esi
                                    00000058 mov edx,7
                                    0000005d call 620CF2F0
                                    00000062 mov ebx,eax
                                    00000064 lea edi,[ebp-30h]
                                    00000067 xor eax,eax
                                    00000069 stos dword ptr es:[edi]
                                    0000006a stos dword ptr es:[edi]
                                    0000006b lea ecx,[ebp-30h]
                                    0000006e call 62069650
                                    00000073 call 62069730
                                    00000078 mov ecx,eax
                                    0000007a lea eax,[ebp-30h]
                                    0000007d push dword ptr [eax+4]
                                    00000080 push dword ptr [eax]
                                    00000082 lea edx,[ebp-18h]
                                    00000085 mov eax,dword ptr [ecx]
                                    00000087 call dword ptr [eax+48h]
                                    0000008a mov ecx,801h
                                    0000008f push 0
                                    00000091 dec ecx
                                    00000092 jne 0000008F
                                    00000094 mov ecx,esp
                                    00000096 mov dword ptr [ebp-10h],esp
                                    00000099 mov esi,ecx
                                    0000009b jmp 000000DA
                                    0000009d xor edx,edx
                                    0000009f mov byte ptr [esi+edx],1
                                    000000a3 inc edx
                                    000000a4 cmp edx,2001h
                                    000000aa jl 0000009F
                                    000000ac mov ecx,2
                                    000000b1 cmp byte ptr [esi+ecx],0
                                    000000b5 je 000000D1
                                    000000b7 mov edx,ecx
                                    000000b9 add edx,edx
                                    0000

                                    L D 2 Replies Last reply
                                    0
                                    • D Daniel Grunwald

                                      Super Lloyd wrote:

                                      Ha, the disassembly I got it from VS! Not sure how to get it otherwise...

                                      To get the optimized JITted assembly code, you need to run your program from the command line, then attach the debugger to the running process, and click Break. Because your Main function was already running before you attached the debugger, it will still have the real native code. .NET doesn't re-jit anything when attaching a debugger.

                                      S Offline
                                      S Offline
                                      Super Lloyd
                                      wrote on last edited by
                                      #55

                                      Thanks, done! Posted there: http://www.codeproject.com/Lounge.aspx?msg=3657593#xx3657593xx[^] However, I'm not conversant in ASM, so I can't tell much from it...

                                      A train station is where the train stops. A bus station is where the bus stops. On my desk, I have a work station.... _________________________________________________________ My programs never have bugs, they just develop random features.

                                      1 Reply Last reply
                                      0
                                      • S Super Lloyd

                                        okay, I ran it from the command line and attached a debugger, here is the release version, run and optimized at its best I guess. what do you think? (well, if you can read that! :~ ) (no inline C# comment this time!) ===============================================

                                        00000000 push ebp
                                        00000001 mov ebp,esp
                                        00000003 push edi
                                        00000004 push esi
                                        00000005 push ebx
                                        00000006 sub esp,48h
                                        00000009 mov esi,ecx
                                        0000000b lea edi,[ebp-54h]
                                        0000000e mov ecx,12h
                                        00000013 xor eax,eax
                                        00000015 rep stos dword ptr es:[edi]
                                        00000017 mov ecx,esi
                                        00000019 mov dword ptr [ebp-10h],esp
                                        0000001c mov dword ptr [ebp-54h],496CC9B0h
                                        00000023 cmp dword ptr [ecx+4],1
                                        00000027 je 00000043
                                        00000029 call 620CD310
                                        0000002e mov ecx,eax
                                        00000030 mov edx,dword ptr ds:[03522030h]
                                        00000036 mov eax,dword ptr [ecx]
                                        00000038 call dword ptr [eax+000000D8h]
                                        0000003e jmp 0000017B
                                        00000043 cmp dword ptr [ecx+4],0
                                        00000047 jbe 00000191
                                        0000004d mov esi,dword ptr [ecx+0Ch]
                                        00000050 call 620C0CC0
                                        00000055 push eax
                                        00000056 mov ecx,esi
                                        00000058 mov edx,7
                                        0000005d call 620CF2F0
                                        00000062 mov ebx,eax
                                        00000064 lea edi,[ebp-30h]
                                        00000067 xor eax,eax
                                        00000069 stos dword ptr es:[edi]
                                        0000006a stos dword ptr es:[edi]
                                        0000006b lea ecx,[ebp-30h]
                                        0000006e call 62069650
                                        00000073 call 62069730
                                        00000078 mov ecx,eax
                                        0000007a lea eax,[ebp-30h]
                                        0000007d push dword ptr [eax+4]
                                        00000080 push dword ptr [eax]
                                        00000082 lea edx,[ebp-18h]
                                        00000085 mov eax,dword ptr [ecx]
                                        00000087 call dword ptr [eax+48h]
                                        0000008a mov ecx,801h
                                        0000008f push 0
                                        00000091 dec ecx
                                        00000092 jne 0000008F
                                        00000094 mov ecx,esp
                                        00000096 mov dword ptr [ebp-10h],esp
                                        00000099 mov esi,ecx
                                        0000009b jmp 000000DA
                                        0000009d xor edx,edx
                                        0000009f mov byte ptr [esi+edx],1
                                        000000a3 inc edx
                                        000000a4 cmp edx,2001h
                                        000000aa jl 0000009F
                                        000000ac mov ecx,2
                                        000000b1 cmp byte ptr [esi+ecx],0
                                        000000b5 je 000000D1
                                        000000b7 mov edx,ecx
                                        000000b9 add edx,edx
                                        0000

                                        L Offline
                                        L Offline
                                        Lost User
                                        wrote on last edited by
                                        #56

                                        Is it a Debug build?

                                        S 1 Reply Last reply
                                        0
                                        • S Super Lloyd

                                          okay, I ran it from the command line and attached a debugger, here is the release version, run and optimized at its best I guess. what do you think? (well, if you can read that! :~ ) (no inline C# comment this time!) ===============================================

                                          00000000 push ebp
                                          00000001 mov ebp,esp
                                          00000003 push edi
                                          00000004 push esi
                                          00000005 push ebx
                                          00000006 sub esp,48h
                                          00000009 mov esi,ecx
                                          0000000b lea edi,[ebp-54h]
                                          0000000e mov ecx,12h
                                          00000013 xor eax,eax
                                          00000015 rep stos dword ptr es:[edi]
                                          00000017 mov ecx,esi
                                          00000019 mov dword ptr [ebp-10h],esp
                                          0000001c mov dword ptr [ebp-54h],496CC9B0h
                                          00000023 cmp dword ptr [ecx+4],1
                                          00000027 je 00000043
                                          00000029 call 620CD310
                                          0000002e mov ecx,eax
                                          00000030 mov edx,dword ptr ds:[03522030h]
                                          00000036 mov eax,dword ptr [ecx]
                                          00000038 call dword ptr [eax+000000D8h]
                                          0000003e jmp 0000017B
                                          00000043 cmp dword ptr [ecx+4],0
                                          00000047 jbe 00000191
                                          0000004d mov esi,dword ptr [ecx+0Ch]
                                          00000050 call 620C0CC0
                                          00000055 push eax
                                          00000056 mov ecx,esi
                                          00000058 mov edx,7
                                          0000005d call 620CF2F0
                                          00000062 mov ebx,eax
                                          00000064 lea edi,[ebp-30h]
                                          00000067 xor eax,eax
                                          00000069 stos dword ptr es:[edi]
                                          0000006a stos dword ptr es:[edi]
                                          0000006b lea ecx,[ebp-30h]
                                          0000006e call 62069650
                                          00000073 call 62069730
                                          00000078 mov ecx,eax
                                          0000007a lea eax,[ebp-30h]
                                          0000007d push dword ptr [eax+4]
                                          00000080 push dword ptr [eax]
                                          00000082 lea edx,[ebp-18h]
                                          00000085 mov eax,dword ptr [ecx]
                                          00000087 call dword ptr [eax+48h]
                                          0000008a mov ecx,801h
                                          0000008f push 0
                                          00000091 dec ecx
                                          00000092 jne 0000008F
                                          00000094 mov ecx,esp
                                          00000096 mov dword ptr [ebp-10h],esp
                                          00000099 mov esi,ecx
                                          0000009b jmp 000000DA
                                          0000009d xor edx,edx
                                          0000009f mov byte ptr [esi+edx],1
                                          000000a3 inc edx
                                          000000a4 cmp edx,2001h
                                          000000aa jl 0000009F
                                          000000ac mov ecx,2
                                          000000b1 cmp byte ptr [esi+ecx],0
                                          000000b5 je 000000D1
                                          000000b7 mov edx,ecx
                                          000000b9 add edx,edx
                                          0000

                                          D Offline
                                          D Offline
                                          Daniel Grunwald
                                          wrote on last edited by
                                          #57

                                          That looks quite similar to the C++ assembler code! Here, compare the main loop bodies. I've added annotations to the C# assembler code. C#:

                                          000000b1 cmp byte ptr [esi+ecx],0 // primes[i] != 0
                                          000000b5 je 000000D1
                                          // if (primes[i] != 0) {
                                          000000b7 mov edx,ecx
                                          000000b9 add edx,edx // int k = 2 * i
                                          000000bb cmp edx,2001h // k < end
                                          000000c1 jge 000000D1
                                          // for {
                                          000000c3 mov byte ptr [esi+edx],0 // primes[k] = 0;
                                          000000c7 add edx,ecx // k += i
                                          000000c9 cmp edx,2001h // k < end
                                          000000cf jl 000000C3
                                          // }
                                          // }
                                          000000d1 inc ecx // ++i
                                          000000d2 cmp ecx,2001h // i < end
                                          000000d8 jl 000000B1

                                          C++:

                                                              if (primes\[i\] != 0)
                                          

                                          002610A0 cmp byte ptr [esp+ecx+10h],bl
                                          002610A4 je wmain+0BDh (2610BDh)
                                          {
                                          int p = i; // using this extra variable speeds up C++!!! (and slow down C# if I do it)
                                          for (int k = i + p; k < end; k += p)
                                          002610A6 lea eax,[ecx+ecx]
                                          002610A9 cmp eax,2001h
                                          002610AE jge wmain+0BDh (2610BDh)
                                          primes[k] = 0;
                                          002610B0 mov byte ptr [esp+eax+10h],bl
                                          002610B4 add eax,ecx
                                          002610B6 cmp eax,2001h
                                          002610BB jl wmain+0B0h (2610B0h)

                                          	for (int i = begin; i < end; ++i)
                                          

                                          002610BD inc ecx
                                          002610BE cmp ecx,2001h
                                          002610C4 jl wmain+0A0h (2610A0h)

                                          The C++ compiler was a bit more clever in combining mov+add into lea, but other than that, the instructions are identical. C++ also lifted the 0 literal into a register (bl), but I'm not sure if that gives you a performance advantage over immediates. (this optimization does give you a code size advantage though, at least for data types larger than 1 byte) So I'm not sure if this small optimization (lea) is causing the difference in performance, or if the reason for the difference is outside this loop. Because there's a major difference in the 'reset to 1' loop: C# resets each byte individually, whereas C++ calls the much faster memset. You could try rewriting that C# loop to:

                                                      int\* primesInt = (int\*)primes;
                                          	for (int i = 0; i < ((end-1)/sizeof(int))+1; i++)
                                          		primesInt\[i\] = 0x01010101;
                                          

                                          Maybe even try if long is faster on your machine. In fact, I have the suspicion that the C++ memset might use SSE registers to set 16 ele

                                          S 1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups