Register a SA Forums Account here!
JOINING THE SA FORUMS WILL REMOVE THIS BIG AD, THE ANNOYING UNDERLINED ADS, AND STUPID INTERSTITIAL ADS!!!

You can: log in, read the tech support FAQ, or request your lost password. This dumb message (and those ads) will appear on every screen until you register! Get rid of this crap by registering your own SA Forums Account and joining roughly 150,000 Goons, for the one-time price of $9.95! We charge money because it costs us money per month for bills, and since we don't believe in showing ads to our users, we try to make the money back through forum registrations.
 
  • Post
  • Reply
distortion park
Apr 25, 2011


we replaced a vendors "ml" process with an afternoon writing regexes and got better result. seems like a rip off op, I wouldn't buy again.

Adbot
ADBOT LOVES YOU

distortion park
Apr 25, 2011


poty posted:

i read like 50% of the op before work today will get to the rest soon

can you use ml to predict where the s&p 500 is going to be in 3 months and let me know? would like to buy some options thanks

no but if you say you can convincingly enough you'll make a lot of money regardless.

distortion park
Apr 25, 2011


to do ml you need lots of data. there are a few types of data:
* open source standard image recognition etc data sets. only use this if you are confident your technique is better than the previous attempts.
* random free data you can find to download on the internet. this is normally boring stuff like poverty rates in different postal areas or how polluted a river is at different points along its course. skip this stuff unless you're doing a thesis for uni
* "free" data which isn't sold as such, but you can scrape from the web. this might how much a phone charger costs on amazon. more interesting but you will need to have some proxies and some dev time to get the data. the cheapest proxies are bot nets or these guys who totally have their endpoints consent https://luminati.io/
* financial data vendors will happily sell you lots of data for lots of money, but everything about the experience will suck. they will all handle edge cases in different ways. there will be bullshit restrictions on how you can use the data (only on one machine, can't make a graph and show it to a client etc) but you can ignore those
* the good stuff is the personal behaviour data. this is geolocation (from phones), internet searches and traffic (from free "antivirus"), emails (from email apps and spam removers), credit card transactions (idk exactly how they get this i think it's apps as well). this stuff is all expensive and the vendors are rubbish at delivery. occaisionally they'll gently caress up and send you some non-anonymised data (as though you couldn't work out who spent 8 hours at a school and then 3 at a strip club was if you really wanted to) at which point they'll email you and tell you to delete it and import a new version. might be tricky to get some of this past your ethics committee at uni but it's all legal, even in the EU as it is anonymised.

for supervised learning you'll need a training set. outsource this to india for best results.

distortion park fucked around with this message at 14:43 on May 31, 2019

distortion park
Apr 25, 2011


Pinterest Mom posted:

is the reason "the outcome you're looking for (presence of cancer) is not the same as outcome you're training on (diagnosis of cancer among women tested)"

The outcome you're looking for is improved quality adjusted life years*, and the relationship between that and seeing a tumour in a screen is so complex that you have to measure that, not just if you can find a tumour.

*Sometimes just a cost reduction is ok too.

distortion park
Apr 25, 2011


I had to read that like 5 times before I realised he meant they used the full date and not like day of week or something

distortion park
Apr 25, 2011


https://twitter.com/osazuwa/status/1274444300894572546?s=19

I had sort of glossed over this the first time but it really does make everyone white lol

distortion park
Apr 25, 2011


https://twitter.com/JoshuaDummer/status/1280877750245453828?s=19


finally a good use for racist ai,,,,

distortion park
Apr 25, 2011


https://www.jcwi.org.uk/news/we-won-home-office-to-stop-using-racist-visa-algorithm

The Home Office hosed up here by having and "expert" systems type thing which was obviously racist, rather than some black box ml model which was also obviously racist as it used a history of home office decisions to train on, but you could maybe convince a court otherwise

distortion park
Apr 25, 2011



The replies to the Simpsons ones where people have done colour,head swaps are kind of interesting. I'd be surprised (not very, but a little) if Twitter had released an algorithm which is as obviously racist as the headline examples make it seem

distortion park
Apr 25, 2011



lmao

Adbot
ADBOT LOVES YOU

distortion park
Apr 25, 2011


https://www.wsj.com/articles/when-the-machines-learn-to-price-gouge-11601281879?redirect=amp#click=https://t.co/BDhFcxs8fs


quote:

In such markets where both stations appeared to adopt algorithmic software, as estimated by sudden changes in the size and rapidity of price changes, margins increased by an average of almost 30%. Without pricing software at both stations, margins were unchanged.

Notably this would be illegal of people were doing it - the computers presumably manage some sort of weird pseudo-communication through their price signals and end up forming a cartel!

  • 1
  • 2
  • 3
  • 4
  • 5
  • Post
  • Reply