Register a SA Forums Account here!
JOINING THE SA FORUMS WILL REMOVE THIS BIG AD, THE ANNOYING UNDERLINED ADS, AND STUPID INTERSTITIAL ADS!!!

You can: log in, read the tech support FAQ, or request your lost password. This dumb message (and those ads) will appear on every screen until you register! Get rid of this crap by registering your own SA Forums Account and joining roughly 150,000 Goons, for the one-time price of $9.95! We charge money because it costs us money per month for bills, and since we don't believe in showing ads to our users, we try to make the money back through forum registrations.
 
  • Post
  • Reply
Potato Salad
Oct 23, 2014

nobody cares


gently caress Moore's law, great integrated graphics for less than $90 is where it's loving at

Gordon Moore can suck it, we're a society of web browsing weebs now, give us our integrated auxiliary acceleration resources

I want to watch scott manley send green men to space in 4K 60FPS and I want to do it on a machine that costs less than an Ikea mattress

Potato Salad fucked around with this message at 19:09 on May 5, 2018

Adbot
ADBOT LOVES YOU

Subjunctive
Sep 12, 2006

✨sparkle and shine✨

Moore just cares about transistor count. We know there are more factors in modern performance that just that count. It’s really not a big deal.

PerrineClostermann
Dec 15, 2012

by FactsAreUseless
Just give me the hardware to run VRchat and I'm happy

EmpyreanFlux
Mar 1, 2013

The AUDACITY! The IMPUDENCE! The unabated NERVE!

SourKraut posted:

I'm sure the answer is "We don't know", but is there any sense of how much of an improvement Ryzen 2 might be over Ryzen+? The R7 2700X is enticing...

Likely a significant improvement TBH, Pinnacle Ridge is merely what AMD wanted Summit Ridge to be, Matisse looks to be a major overhaul.

Deuce
Jun 18, 2004
Mile High Club

SourKraut posted:

I'm sure the answer is "We don't know", but is there any sense of how much of an improvement Ryzen 2 might be over Ryzen+? The R7 2700X is enticing...

We don’t know.

Rastor
Jun 2, 2001

Subjunctive posted:

Moore just cares about transistor count. We know there are more factors in modern performance that just that count. It’s really not a big deal.

That's the best kind of correct, but the practical effect of doubling transistor count was doubling performance.

The world of processors now lines up with the world of batteries: single digit percent gains instead of 200% gains

Subjunctive
Sep 12, 2006

✨sparkle and shine✨

Rastor posted:

That's the best kind of correct, but the practical effect of doubling transistor count was doubling performance.

The world of processors now lines up with the world of batteries: single digit percent gains instead of 200% gains

In Moore’s paper, he talks about doubling every year in the context of chip complexity as it relates to yield and ultimately cost. The double-perf-every-18-months comes from a guy at Intel who was talking about the combination of transistor count and performance per transistor. (Moore’s paper is fun to read, though shallow. Recommend!)

We see double-digit-or-more gains in performance between generations in GPUs (500-1100% on DL tasks between Pascal and Volta) and mobile SoCs (25-50% for A7-A8). A large portion of the world’s processor applications are still seeing excellent gains.

shrike82
Jun 11, 2005

Subjunctive posted:

500-1100% on DL tasks between Pascal and Volta)

LMAO, I'm curious what "DL tasks" these are.

Most benchmarks show a 10-30% difference for training/inferencing between the Titan V and the 1080 Ti.

shrike82 fucked around with this message at 01:17 on May 6, 2018

Subjunctive
Sep 12, 2006

✨sparkle and shine✨

shrike82 posted:

LMAO, I'm curious what "DL tasks" these are.

Looks like ResNet-50, though I can’t find the same numbers. Still up to 3x, though.

https://www.nvidia.com/en-us/data-center/tensorcore/

shrike82
Jun 11, 2005

yeah that's bullshit marketing which isn't borne out in actual benchmarks.

We've actually looked at switching from 1080 Tis to the Titan Vs for our Dev ML servers, the latter aren't much faster.

A big part is the fact that Nvidia crippled FP16 throughput for consumer cards - but that's not used in actual training/inferencing.

Subjunctive
Sep 12, 2006

✨sparkle and shine✨

Do you mean that the ResNet results aren’t going to be reproducible, or that they aren’t representative (possibly because most workloads don’t code to the tensor cores)?

shrike82
Jun 11, 2005

Well I'm guessing you're not a technical guy -

1. Don't take marketing "benchmarks" at face value especially when the X-axis is "time to solution" (presumably convergence) and that can be manipulated
2. They compare the V100 to the P100 when the 1080 Ti is faster than the P100 and cheaper from a TCO standpoint if you're doing dev ML work
3. This is an example of a real-world benchmark

4. ResNet-50 is a pretty small (i.e., toy) image architecture by modern standards

Subjunctive
Sep 12, 2006

✨sparkle and shine✨

Thanks. I hadn’t thought about manipulation of convergence. Should that affect the inference results? I only know it in terms of establishing weights, but my ML experience is years old and poorly remembered.

The point of comparing P100 to V100 was to isolate the inter-generation gap, but I suppose the numbering doesn’t really ensure that. Maxwell-vs-Pascal might have been better because I could have compared the resulting top end of both lines.

MaxxBot
Oct 6, 2003

you could have clapped

you should have clapped!!

Rastor posted:

That's the best kind of correct, but the practical effect of doubling transistor count was doubling performance.

The world of processors now lines up with the world of batteries: single digit percent gains instead of 200% gains


The reason CPU performance has stalled isn't because transistor counts are stalled, as Subjunctive pointed out other areas like GPUs are still seeing big gains every generation. If you could just keep adding more cores like you can with a GPU we'd still be seeing big gains, but that approach doesn't work for CPUs because you need to balance single and multi-threaded performance. CPU performance gains have stalled because clock speed improvements have stalled and IPC gains are not even close to being linear with increased transistor counts of the individual cores. Basically it comes down to the fact that further increases in single-threaded IPC at this point are really difficult.

MaxxBot fucked around with this message at 07:43 on May 6, 2018

Rastor
Jun 2, 2001

MaxxBot posted:

The reason CPU performance has stalled isn't because transistor counts are stalled, as Subjunctive pointed out other areas like GPUs are still seeing big gains every generation. If you could just keep adding more cores like you can with a GPU we'd still be seeing big gains, but that approach doesn't work for CPUs because you need to balance single and multi-threaded performance. CPU performance gains have stalled because clock speed improvements have stalled and IPC gains are not even close to being linear with increased transistor counts of the individual cores. Basically it comes down to the fact that further increases in single-threaded IPC at this point are really difficult.

In other words: Moore's Law is dead.

Bloody Antlers
Mar 27, 2010

by Jeffrey of YOSPOS
I wonder what the Intel engineers that designed the 8086 back in 1976 would say if they could see how far past the point of diminishing returns we've carried their baby.

GRINDCORE MEGGIDO
Feb 28, 1985


Probably quite confused about meltdown.

Avalanche
Feb 2, 2007
This is probably a dumb question but anyone have any idea why fully modular motherboard designs like this are not becoming more of a thing yet? Or is it just due to the motherboard and cpu cartel and/or engineering difficulty?



Seems like a great way to make everything backwards/forward compatible, significantly reduce costs, and reduce waste as you could just keep adding on stuff as tech progresses. But, I'm guessing getting an actual frankenstein computer to work would be a true nightmare in terms of ram frequency differences, somehow getting to completely different cpu sockets to work together, and everything else.

KKKLIP ART
Sep 3, 2004

I would think that a fully modular motherboard like that would be both an engineering and software nightmare due to the mishmash of connection types, PCB traces, and hardware variables.

fishmech
Jul 16, 2006

by VideoGames
Salad Prong

Avalanche posted:

This is probably a dumb question but anyone have any idea why fully modular motherboard designs like this are not becoming more of a thing yet? Or is it just due to the motherboard and cpu cartel and/or engineering difficulty?



Seems like a great way to make everything backwards/forward compatible, significantly reduce costs, and reduce waste as you could just keep adding on stuff as tech progresses. But, I'm guessing getting an actual frankenstein computer to work would be a true nightmare in terms of ram frequency differences, somehow getting to completely different cpu sockets to work together, and everything else.

It can only last as many CPU upgrade cycles as a normal motherboard can, since you'd need to upgrade most of the parts to not be wasting a CPU upgrade anyway. Plus you'd have to pay more for all the parts from the start, broken out as they are.

Really, things like that are designed to support goals like "fit into odd shaped case for a particular task" or "provide easy component replacement for ease of repair".

Theris
Oct 9, 2007

Bloody Antlers posted:

I wonder what the Intel engineers that designed the 8086 back in 1976 would say if they could see how far past the point of diminishing returns we've carried their baby.

It was only 40 years ago. Stephen Morse, Bill Pohlman, and Bruce Ravenel are still alive. Morse did an interview with PC World for the 30th anniversary. He doesn't talk much about modern CPUs but did mention that if they had known that x86 was going to stick around rather than just be a stopgap until the 432 could be released, they'd have done some things differently.

Subjunctive
Sep 12, 2006

✨sparkle and shine✨

Theris posted:

It was only 40 years ago. Stephen Morse, Bill Pohlman, and Bruce Ravenel are still alive. Morse did an interview with PC World for the 30th anniversary. He doesn't talk much about modern CPUs but did mention that if they had known that x86 was going to stick around rather than just be a stopgap until the 432 could be released, they'd have done some things differently.

I worked with Steve Morse post-Intel (mid/late 90s). He’s much better at hardware than software!

redeyes
Sep 14, 2002

by Fluffdaddy

Avalanche posted:

This is probably a dumb question but anyone have any idea why fully modular motherboard designs like this are not becoming more of a thing yet? Or is it just due to the motherboard and cpu cartel and/or engineering difficulty?



Seems like a great way to make everything backwards/forward compatible, significantly reduce costs, and reduce waste as you could just keep adding on stuff as tech progresses. But, I'm guessing getting an actual frankenstein computer to work would be a true nightmare in terms of ram frequency differences, somehow getting to completely different cpu sockets to work together, and everything else.

Most mobo manufactures have done this for years. Nothing new bro. Only difference is the manufacturer does it, not you.

repiv
Aug 13, 2009

Rastor posted:

It's just now moving into higher power situations (Examples: Qualcomm Centriq, Cavium ThunderX2), which may be part of why Intel feels threatened enough to bring in Jim Keller.

Example # xxx why one of the dumbest decisions Intel ever made was selling off their ARM division instead of putting R&D money into it.

This was a few pages back, but uhhh

Qualcomm Plans Exit From Server Chips

Rastor
Jun 2, 2001

repiv posted:

This was a few pages back, but uhhh

Qualcomm Plans Exit From Server Chips

Interesting, maybe Carlyle Group would be interested in adding it to their Ampere Computing startup.


Yesterday Qualcomm announced a new smartwatch SoC, so I'm guessing they see the low-power market as their strength and decided to focus on it.

Cygni
Nov 12, 2005

raring to post

Relevant to the ARM server discussion, STH just released their testing of a ThunderX2 system and it's actually.... really good. Beats both Epyc and Scalable in perf/$ and perf/watt in a lot of tests. Kinda didn't expect that, but thats my own x86 bias.

quote:

The Cavium ThunderX2 that we see today has its origins in the Broadcom Project Vulcan and so many of the features we saw in Cavium ThunderX, such as 40GbE ports, are not present. Instead, we have an Arm chip that can go toe-to-toe with Intel and AMD and come out ahead in some cases. Best of all, the list price of the 32 core top-bin CN9980 part is $1795 about half of the competitive Intel and AMD chips.

https://www.servethehome.com/cavium-thunderx2-review-benchmarks-real-arm-server-option/

But since I'm just a nerd and not in this for a living, baby, i wanna see them glamor shots. Huge rear end socket oh baby yeah:


Devian666
Aug 20, 2008

Take some advice Chris.

Fun Shoe
So how are we supposed to benefit from the 128 pcie lanes on Epyc? What sort of performance could be squeezed out of the hardware?

Kazinsal
Dec 13, 2011



A gargantuan number of PCIe SSDs. :rice:

GRINDCORE MEGGIDO
Feb 28, 1985


Split out the pcie lanes and mine with 100+ cards at once, this is a fantastic idea.

Devian666
Aug 20, 2008

Take some advice Chris.

Fun Shoe

GRINDCORE MEGGIDO posted:

Split out the pcie lanes and mine with 100+ cards at once, this is a fantastic idea.

I could see 128 threads in Epyc being used to produce bulk dried strawberries.

EmpyreanFlux
Mar 1, 2013

The AUDACITY! The IMPUDENCE! The unabated NERVE!

Cygni posted:

Relevant to the ARM server discussion, STH just released their testing of a ThunderX2 system and it's actually.... really good. Beats both Epyc and Scalable in perf/$ and perf/watt in a lot of tests. Kinda didn't expect that, but thats my own x86 bias.


https://www.servethehome.com/cavium-thunderx2-review-benchmarks-real-arm-server-option/

But since I'm just a nerd and not in this for a living, baby, i wanna see them glamor shots. Huge rear end socket oh baby yeah:




Kind buried an important part here.

quote:

We were told that our power consumption results may be 10-15% higher than shipping parts.

quote:

Our Gigabyte/ Cavium ThunderX2 Saber development platform hit a peak of 823W at 100% load.

That's dual socket, what's the known peak draw for Intel and EPYC in dual socket? I feel like it's about 500W, but don't know. So while cheap and effective, it seems insanely hungry.

Also, those single thread results are pretty good considering it's clock deficiency.

https://www.servethehome.com/amd-epyc-7601-dual-socket-early-power-consumption-observations/ - 483W for EPYC dual socket 7601.

I'm assuming Intel is going to be comparable or better with an 8160 or 8180 Platinum.

EmpyreanFlux fucked around with this message at 07:09 on May 10, 2018

SwissArmyDruid
Feb 14, 2014

by sebmojo

Devian666 posted:

I could see 128 threads in Epyc being used to produce bulk dried strawberries.

You joke, but.

https://www.youtube.com/watch?v=1WLIm4XLPAE

EmpyreanFlux
Mar 1, 2013

The AUDACITY! The IMPUDENCE! The unabated NERVE!
Possible 2950X in the wild http://ranker.sisoftware.net/show_run.php?q=c2ffc9ef8eefd2ebdfecdaeec8ba87b791f491ac9cbac9f4cc&l=en

3.89Ghz, 16C/32T, 117W :eyepop:

wargames
Mar 16, 2008

official yospos cat censor

Inter-Core Latency 179.4ns

That seems high even for amd with its inter ccx hops.

eames
May 9, 2009

Pretty normal values for TR looking at their 1950X benchmarks, it kind of is a rebadged server product after all. The target audience won't mind the increased latency and there's the option of disabling half the cores to keep latency sensitive applications on one die.

Methylethylaldehyde
Oct 23, 2004

BAKA BAKA

Kazinsal posted:

A gargantuan number of PCIe SSDs. :rice:

That's legit one of the best uses of the thing, a ridiculous cache box that sits right at the provider edge, with a 1TB of ram, 32 Nvme drives, and enough processing power to flood the twin 40GbE ports. 400k+ IOPS, 57+ GB/sec of aggregate throughput at 128KB, 7 million IOPS and 53 GB/sec at 4k random.

Seamonster
Apr 30, 2007

IMMER SIEGREICH

So what freq would that be at the full beans (180W)?

EmpyreanFlux
Mar 1, 2013

The AUDACITY! The IMPUDENCE! The unabated NERVE!

Seamonster posted:

So what freq would that be at the full beans (180W)?

Probably 4.4Ghz on like 8 cores.

Eletriarnation
Apr 6, 2005

People don't appreciate the substance of things...
objects in space.


Oven Wrangler
I don't think there's an easy formula for something like that, but if you compare the 1950X to 1800X and then look at how much improvement there was from 1800X to 2700X I wouldn't be surprised to see a 2950X that can turbo to 4.5.

Adbot
ADBOT LOVES YOU

Devian666
Aug 20, 2008

Take some advice Chris.

Fun Shoe

I watched that last night. From the building work that I do those old radiators are mostly rust with a little bit of metal. I wasn't surprised to see the state of the water after heating that storage room.

  • 1
  • 2
  • 3
  • 4
  • 5
  • Post
  • Reply