Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. Computer Construction Conundrum

Computer Construction Conundrum

Scheduled Pinned Locked Moved The Lounge
asp-netsysadminbusinessperformancequestion
42 Posts 17 Posters 4 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • D Dan Neely

    *nod* I'm not quite sure what to be looking at there since intel/amd use different numbering schemes for their multi-socket chips than the single socket ones, but for my biggest use intel's cleaning amd's clock. As of late spring this year a ~$1000 s1366 system (to match my existing 1366 for troubleshooting ease) performed roughly on par with 2x$500 amd quad core systems. I know the general performance comparison is less lopsided although I'm not sure what's driving it. E@H is compiled using GCC and at least one of the two apps has hand optimized hot loops written by the guy who ~4 years ago got a 9x speedup vs the stock app by hacking the binary before the project hired him as a consultant to improve their stock apps.

    3x12=36 2x12=24 1x12=12 0x12=18

    N Offline
    N Offline
    Nicholas Butler
    wrote on last edited by
    #20

    I'm in a niche ( ATM ) market, as I work from home writing concurrent software. The equivalent Intel chips are the 7500 Beckton series. These have 8 cores and 16 threads which according to reports beat the 12-core AMD 6100s. However, I can't find any in stock and the retail prices that are published are prohibitive.

    Dan Neely wrote:

    the guy who ~4 years ago got a 9x speedup vs the stock app by hacking the binary

    Impressive :omg: Nick

    D 1 Reply Last reply
    0
    • C Chris Losinger

      Nicholas Butler wrote:

      What would you do with 48 cores?

      i'd wonder what was going to become of those two bags of apples.

      image processing toolkits | batch image processing

      L Offline
      L Offline
      Lost User
      wrote on last edited by
      #21

      Apparently that went over someone's head. :rolleyes: (5)

      L u n a t i c F r i n g e

      C 1 Reply Last reply
      0
      • N Nicholas Butler

        I'm in a niche ( ATM ) market, as I work from home writing concurrent software. The equivalent Intel chips are the 7500 Beckton series. These have 8 cores and 16 threads which according to reports beat the 12-core AMD 6100s. However, I can't find any in stock and the retail prices that are published are prohibitive.

        Dan Neely wrote:

        the guy who ~4 years ago got a 9x speedup vs the stock app by hacking the binary

        Impressive :omg: Nick

        D Offline
        D Offline
        Dan Neely
        wrote on last edited by
        #22

        Nicholas Butler wrote:

        Dan Neely wrote: the guy who ~4 years ago got a 9x speedup vs the stock app by hacking the binary Impressive OMG

        Yeah. It was really nice for a while since we were getting 9x as many points/day which helped a lot in the bragging rights contest. IIRC he got about ~4-5x from algorithmic changes that were incorporated in the next official app (moving stuff into lookup tables and tweaking working sets to fit inside L1 caches) and ~2x from writing better hot loops to eliminate pipeline stalls and incorporating SSE/SSE2 instructions. The original was pure x87, his replacements interleaved both x87 and SSE instructions to keep both of their pipelines fully loaded. It was eye opening after years of reading that a good compiler is almost impossible to beat these days. Granted that a good C/C++ programmer could have done most of the improvements in his code and used compiler options to get basic SSE1/2 versions, but IIRC we've had 30-50% speedups in new apps after they were regarded as stable enough to start mucking around with asm. Overkill for most uses probably, but at supercomputer work loads even a few percent can help; especially when the total problem set is large enough that probabilistic algorithms are needed to prune the data massively early on. Even with those at present signals need to be an order of magnitude larger than the sensors thresholds to be detected (below that the odds decline roughly linearly), the previous algorithm's sure detection threshold was +2 magnitudes. This is a major challenge when the strongest signal expected from a known pulsar is right at the theoretician detection threshold and is near a massive noise spike.

        3x12=36 2x12=24 1x12=12 0x12=18

        C 1 Reply Last reply
        0
        • R realJSOP

          I don't understand why you feel the need to build a new machine. Compiler tech certainly is just barely using what you have now. My current quad-core box is 3-4 years old, and I feel no need to upgrade the hardware.

          .45 ACP - because shooting twice is just silly
          -----
          "Why don't you tie a kerosene-soaked rag around your ankles so the ants won't climb up and eat your candy ass..." - Dale Earnhardt, 1997
          -----
          "The staggering layers of obscenity in your statement make it a work of art on so many levels." - J. Jystad, 2001

          D Offline
          D Offline
          Dan Neely
          wrote on last edited by
          #23

          From comments made elsewhere I don't think is using all zillion cores to compile on, he's writing apps that scale across a zillion cores.

          3x12=36 2x12=24 1x12=12 0x12=18

          E 1 Reply Last reply
          0
          • D Dan Neely

            Nicholas Butler wrote:

            Dan Neely wrote: the guy who ~4 years ago got a 9x speedup vs the stock app by hacking the binary Impressive OMG

            Yeah. It was really nice for a while since we were getting 9x as many points/day which helped a lot in the bragging rights contest. IIRC he got about ~4-5x from algorithmic changes that were incorporated in the next official app (moving stuff into lookup tables and tweaking working sets to fit inside L1 caches) and ~2x from writing better hot loops to eliminate pipeline stalls and incorporating SSE/SSE2 instructions. The original was pure x87, his replacements interleaved both x87 and SSE instructions to keep both of their pipelines fully loaded. It was eye opening after years of reading that a good compiler is almost impossible to beat these days. Granted that a good C/C++ programmer could have done most of the improvements in his code and used compiler options to get basic SSE1/2 versions, but IIRC we've had 30-50% speedups in new apps after they were regarded as stable enough to start mucking around with asm. Overkill for most uses probably, but at supercomputer work loads even a few percent can help; especially when the total problem set is large enough that probabilistic algorithms are needed to prune the data massively early on. Even with those at present signals need to be an order of magnitude larger than the sensors thresholds to be detected (below that the odds decline roughly linearly), the previous algorithm's sure detection threshold was +2 magnitudes. This is a major challenge when the strongest signal expected from a known pulsar is right at the theoretician detection threshold and is near a massive noise spike.

            3x12=36 2x12=24 1x12=12 0x12=18

            C Offline
            C Offline
            Chris Losinger
            wrote on last edited by
            #24

            Dan Neely wrote:

            It was eye opening after years of reading that a good compiler is almost impossible to beat these days

            yup. i beat it almost every time i try, and i'm a complete ASM novice. and the 'intrinsics' are nearly more trouble than they're worth. i was struggling this weekend to come up with x64 equivalents to some old _asm {... MMX stuff...} blocks in our code, so i tried the intrinsics. but they ended up being like 4x slower than my hand-written ASM because things were constantly being shoved in and out of the MMX registers instead of being held registers for reuse.

            image processing toolkits | batch image processing

            1 Reply Last reply
            0
            • N Nicholas Butler

              So, it's time to replace my main dev box. Request for advice &| experience please :) I built my current box for £1200 2.5 years ago. It's a dual quad-core Xeon and I've learnt just about all I can from it. It's been the best £1 per day I've ever spent, but hey! nothing lasts forever... So I've looked around a bit and AMD have released a new range "Magny-Cours" of 8-core and 12-core Opterons at sensible prices. They are basically two quad- or hex-core CPUs in one package for half the price. They have are NUMA ( albeit cache coherent ) as each half of each socket has it's own memory channel. And they all support 2-socket and 4-socket configurations. Very interesting! I'm sorely tempted to build a 48-core box just for fun, but I'm having difficulty convincing myself as my last client still only had up to 8-core boxes. On the other hand, I think that will change soon, and certainly over the next 2.5 years. BTW, I would be using this for writing concurrent business server software. What would you do with 48 cores? Cheers, Nick

              E Offline
              E Offline
              El Corazon
              wrote on last edited by
              #25

              Nicholas Butler wrote:

              What would you do with 48 cores?

              Fully describe the high power DE stray energy problem Recreate my college "Lana" (Lexical Analysis) and add AI Make some killer raytraces.... Increase my RF-LOS work even more.... Massive amounts of soft-body physics.... "wouldn't you love to get your hands on one of those gibsons...." :-D

              _________________________ John Andrew Holmes "It is well to remember that the entire universe, with one trifling exception, is composed of others." Shhhhh.... I am not really here. I am a figment of your imagination.... I am still in my cave so this must be an illusion....

              modified on Monday, September 20, 2010 3:09 PM

              C N 2 Replies Last reply
              0
              • C Chris Losinger

                Nicholas Butler wrote:

                What would you do with 48 cores?

                i'd wonder what was going to become of those two bags of apples.

                image processing toolkits | batch image processing

                E Offline
                E Offline
                El Corazon
                wrote on last edited by
                #26

                Chris Losinger wrote:

                i'd wonder what was going to become of those two bags of apples.

                Apple cobbler!

                _________________________ John Andrew Holmes "It is well to remember that the entire universe, with one trifling exception, is composed of others." Shhhhh.... I am not really here. I am a figment of your imagination.... I am still in my cave so this must be an illusion....

                C 1 Reply Last reply
                0
                • D Dan Neely

                  From comments made elsewhere I don't think is using all zillion cores to compile on, he's writing apps that scale across a zillion cores.

                  3x12=36 2x12=24 1x12=12 0x12=18

                  E Offline
                  E Offline
                  El Corazon
                  wrote on last edited by
                  #27

                  Dan Neely wrote:

                  he's writing apps that scale across a zillion cores.

                  The only way to fly!! :-D :-D

                  _________________________ John Andrew Holmes "It is well to remember that the entire universe, with one trifling exception, is composed of others." Shhhhh.... I am not really here. I am a figment of your imagination.... I am still in my cave so this must be an illusion....

                  1 Reply Last reply
                  0
                  • E El Corazon

                    Chris Losinger wrote:

                    i'd wonder what was going to become of those two bags of apples.

                    Apple cobbler!

                    _________________________ John Andrew Holmes "It is well to remember that the entire universe, with one trifling exception, is composed of others." Shhhhh.... I am not really here. I am a figment of your imagination.... I am still in my cave so this must be an illusion....

                    C Offline
                    C Offline
                    Chris Losinger
                    wrote on last edited by
                    #28

                    pie would be acceptable, also.

                    image processing toolkits | batch image processing

                    E 1 Reply Last reply
                    0
                    • L Lost User

                      Apparently that went over someone's head. :rolleyes: (5)

                      L u n a t i c F r i n g e

                      C Offline
                      C Offline
                      Chris Losinger
                      wrote on last edited by
                      #29

                      probably a fan of the banana.

                      image processing toolkits | batch image processing

                      1 Reply Last reply
                      0
                      • C Chris Losinger

                        pie would be acceptable, also.

                        image processing toolkits | batch image processing

                        E Offline
                        E Offline
                        El Corazon
                        wrote on last edited by
                        #30

                        Chris Losinger wrote:

                        pie would be acceptable, also.

                        true... but since he is considering cobbling together his own computer... I figured a cobbler was slightly more accurate. :)

                        _________________________ John Andrew Holmes "It is well to remember that the entire universe, with one trifling exception, is composed of others." Shhhhh.... I am not really here. I am a figment of your imagination.... I am still in my cave so this must be an illusion....

                        C 1 Reply Last reply
                        0
                        • E El Corazon

                          Chris Losinger wrote:

                          pie would be acceptable, also.

                          true... but since he is considering cobbling together his own computer... I figured a cobbler was slightly more accurate. :)

                          _________________________ John Andrew Holmes "It is well to remember that the entire universe, with one trifling exception, is composed of others." Shhhhh.... I am not really here. I am a figment of your imagination.... I am still in my cave so this must be an illusion....

                          C Offline
                          C Offline
                          Chris Losinger
                          wrote on last edited by
                          #31

                          good point. cobbler it is!

                          image processing toolkits | batch image processing

                          1 Reply Last reply
                          0
                          • E El Corazon

                            Nicholas Butler wrote:

                            What would you do with 48 cores?

                            Fully describe the high power DE stray energy problem Recreate my college "Lana" (Lexical Analysis) and add AI Make some killer raytraces.... Increase my RF-LOS work even more.... Massive amounts of soft-body physics.... "wouldn't you love to get your hands on one of those gibsons...." :-D

                            _________________________ John Andrew Holmes "It is well to remember that the entire universe, with one trifling exception, is composed of others." Shhhhh.... I am not really here. I am a figment of your imagination.... I am still in my cave so this must be an illusion....

                            modified on Monday, September 20, 2010 3:09 PM

                            C Offline
                            C Offline
                            Chris Losinger
                            wrote on last edited by
                            #32

                            El Corazon wrote:

                            Massive amounts of soft-body physics

                            your internet connection will have more of an effect on that than sheer processing power.

                            image processing toolkits | batch image processing

                            modified on Monday, September 20, 2010 4:31 PM

                            E 1 Reply Last reply
                            0
                            • C Chris Losinger

                              El Corazon wrote:

                              Massive amounts of soft-body physics

                              your internet connection will have more of an effect on that than sheer processing power.

                              image processing toolkits | batch image processing

                              modified on Monday, September 20, 2010 4:31 PM

                              E Offline
                              E Offline
                              El Corazon
                              wrote on last edited by
                              #33

                              Chris Losinger wrote:

                              your internet connection will have more of an effect on that then sheer processing power.

                              I wonder how many heads that will go over?? :laugh:

                              _________________________ John Andrew Holmes "It is well to remember that the entire universe, with one trifling exception, is composed of others." Shhhhh.... I am not really here. I am a figment of your imagination.... I am still in my cave so this must be an illusion....

                              1 Reply Last reply
                              0
                              • N Nicholas Butler

                                So, it's time to replace my main dev box. Request for advice &| experience please :) I built my current box for £1200 2.5 years ago. It's a dual quad-core Xeon and I've learnt just about all I can from it. It's been the best £1 per day I've ever spent, but hey! nothing lasts forever... So I've looked around a bit and AMD have released a new range "Magny-Cours" of 8-core and 12-core Opterons at sensible prices. They are basically two quad- or hex-core CPUs in one package for half the price. They have are NUMA ( albeit cache coherent ) as each half of each socket has it's own memory channel. And they all support 2-socket and 4-socket configurations. Very interesting! I'm sorely tempted to build a 48-core box just for fun, but I'm having difficulty convincing myself as my last client still only had up to 8-core boxes. On the other hand, I think that will change soon, and certainly over the next 2.5 years. BTW, I would be using this for writing concurrent business server software. What would you do with 48 cores? Cheers, Nick

                                P Offline
                                P Offline
                                peterchen
                                wrote on last edited by
                                #34

                                48 AMD cores? Boiling water for tea in 12 seconds!

                                Agh! Reality! My Archnemesis![^]
                                | FoldWithUs! | sighist | WhoIncludes - Analyzing C++ include file hierarchy

                                N 1 Reply Last reply
                                0
                                • N Nicholas Butler

                                  So, it's time to replace my main dev box. Request for advice &| experience please :) I built my current box for £1200 2.5 years ago. It's a dual quad-core Xeon and I've learnt just about all I can from it. It's been the best £1 per day I've ever spent, but hey! nothing lasts forever... So I've looked around a bit and AMD have released a new range "Magny-Cours" of 8-core and 12-core Opterons at sensible prices. They are basically two quad- or hex-core CPUs in one package for half the price. They have are NUMA ( albeit cache coherent ) as each half of each socket has it's own memory channel. And they all support 2-socket and 4-socket configurations. Very interesting! I'm sorely tempted to build a 48-core box just for fun, but I'm having difficulty convincing myself as my last client still only had up to 8-core boxes. On the other hand, I think that will change soon, and certainly over the next 2.5 years. BTW, I would be using this for writing concurrent business server software. What would you do with 48 cores? Cheers, Nick

                                  M Offline
                                  M Offline
                                  Member 96
                                  wrote on last edited by
                                  #35

                                  All I can contribute is that I and many other developers have long ago switched to developing on a virtual workstation using something like VMWare Workstation or Virtual Box so any and all hardware decisions I make always revolve around optimal performance for VMWare workstation machines. If you or other readers have not considered this you should. That being said my secondary consideration these days seems to be power consumption since almost any decent hardware is powerful enough for what I do so there is no reason not to consider power consumption pretty high on the list and the electric bill is not an insignificant business expense.


                                  “If you want to build a ship, don't drum up people together to collect wood and don't assign them tasks and work, but rather teach them to long for the endless immensity of the sea” - Antoine de Saint-Exupery

                                  N 1 Reply Last reply
                                  0
                                  • N Nicholas Butler

                                    So, it's time to replace my main dev box. Request for advice &| experience please :) I built my current box for £1200 2.5 years ago. It's a dual quad-core Xeon and I've learnt just about all I can from it. It's been the best £1 per day I've ever spent, but hey! nothing lasts forever... So I've looked around a bit and AMD have released a new range "Magny-Cours" of 8-core and 12-core Opterons at sensible prices. They are basically two quad- or hex-core CPUs in one package for half the price. They have are NUMA ( albeit cache coherent ) as each half of each socket has it's own memory channel. And they all support 2-socket and 4-socket configurations. Very interesting! I'm sorely tempted to build a 48-core box just for fun, but I'm having difficulty convincing myself as my last client still only had up to 8-core boxes. On the other hand, I think that will change soon, and certainly over the next 2.5 years. BTW, I would be using this for writing concurrent business server software. What would you do with 48 cores? Cheers, Nick

                                    P Offline
                                    P Offline
                                    PIEBALDconsult
                                    wrote on last edited by
                                    #36

                                    Fold faster.

                                    1 Reply Last reply
                                    0
                                    • E El Corazon

                                      Nicholas Butler wrote:

                                      What would you do with 48 cores?

                                      Fully describe the high power DE stray energy problem Recreate my college "Lana" (Lexical Analysis) and add AI Make some killer raytraces.... Increase my RF-LOS work even more.... Massive amounts of soft-body physics.... "wouldn't you love to get your hands on one of those gibsons...." :-D

                                      _________________________ John Andrew Holmes "It is well to remember that the entire universe, with one trifling exception, is composed of others." Shhhhh.... I am not really here. I am a figment of your imagination.... I am still in my cave so this must be an illusion....

                                      modified on Monday, September 20, 2010 3:09 PM

                                      N Offline
                                      N Offline
                                      Nicholas Butler
                                      wrote on last edited by
                                      #37

                                      El Corazon wrote:

                                      "wouldn't you love to get your hands on one of those gibsons...."

                                      Actually, there were some 48-core supercomputers in 1995 ;) Thanks for your suggestions, but wouldn't GPU(s) be better for those applications? How would a 48-core computer perform against a couple of the new Tesla C2050 cards, for example? I am more interested in C# LOB apps that use concurrency. Which, as far as I can tell at the moment, is mostly server software. I was hoping that someone was actually using the cores available on client machines. Nick

                                      E 1 Reply Last reply
                                      0
                                      • P peterchen

                                        48 AMD cores? Boiling water for tea in 12 seconds!

                                        Agh! Reality! My Archnemesis![^]
                                        | FoldWithUs! | sighist | WhoIncludes - Analyzing C++ include file hierarchy

                                        N Offline
                                        N Offline
                                        Nicholas Butler
                                        wrote on last edited by
                                        #38

                                        My current Intel box can already do that :) Nick

                                        P 1 Reply Last reply
                                        0
                                        • M Member 96

                                          All I can contribute is that I and many other developers have long ago switched to developing on a virtual workstation using something like VMWare Workstation or Virtual Box so any and all hardware decisions I make always revolve around optimal performance for VMWare workstation machines. If you or other readers have not considered this you should. That being said my secondary consideration these days seems to be power consumption since almost any decent hardware is powerful enough for what I do so there is no reason not to consider power consumption pretty high on the list and the electric bill is not an insignificant business expense.


                                          “If you want to build a ship, don't drum up people together to collect wood and don't assign them tasks and work, but rather teach them to long for the endless immensity of the sea” - Antoine de Saint-Exupery

                                          N Offline
                                          N Offline
                                          Nicholas Butler
                                          wrote on last edited by
                                          #39

                                          Interesting John, thanks :) I presume all modern CPUs ( especially server ones ) fully support virtualization, no? I do use VMWare Workstation, but not for my main dev box as it is limited to 4-cores [edit: just checked - it now supports up to 8. I'm sure it was 4 last time I looked ] My last client was using 8-core boxes, and part of my job is to be ahead of the curve a bit. So it's not really an option for me, although I do agree they are a good choice for normal development. Nick

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups