View Single Post
  #6  
Old 08-11-2023, 07:20 PM
DeathsSilkyMist DeathsSilkyMist is offline
Planar Protector

DeathsSilkyMist's Avatar

Join Date: Jan 2014
Posts: 8,174
Default

Quote:
Originally Posted by Troxx [You must be logged in to view images. Log in or Register.]
If that will help you sleep better tonight, you go along thinking that sugar-bear.
I am not sure why you think you have provided anything of substance in this thread. So far you have given us a lot of gifs, bad troll attempts, and a parse that you deemed to be invalid. You have contributed nothing, but expect victory to be handed to you because other people agree with you. If you want to let the thread die, just let it die. But you haven't done anything here to support your position.

Quote:
Originally Posted by bcbrown [You must be logged in to view images. Log in or Register.]
Optimization needs:
  1. Some outcome you want to minimize (or maximize), and a way to measure it
  2. A set of parameters to fiddle with

Multi-armed bandits: Imagine you're at a casino with a row of slot machines, which are sometimes called "one-armed bandits". Each slot machine has a different unknown payout rate. How should you choose which machine to play (exploit), and when should you choose to stick with the current machine versus move to another machine (explore)? Epsilon-greedy is one strategy for how to choose when to explore and when to exploit.

Recursive gradient descent: Imagine the space of possible outcomes is a valley, and you're somewhere on the side of that valley. Play an iterated game: from where you are currently, figure out which direction is "most downhill"; take a step in that direction, then repeat.

Simulated annealing: imagine there's a small pond or low spot or "false valley" on the hillside of the valley. To avoid getting stuck there and never finding the actual valley, change the size of your step, and each time add a small random jump. That way, you have a chance of getting out of the false valley and eventually finding the real valley. The metaphor is with metallurgy, where the final crystal structure is a function of the rate of cooling of the hot metal; you add a "temperature" parameter that automatically lowers over the course of the iterations, so that you take smaller and smaller steps with less random jumps. Instead of metallurgy you can also think about tempering chocolate when cooking.

Banana problem: Say you want to recommend some food item to a shopper on your grocery site. You might start by saying, based on the current item the shopper is looking at, what is the item that has been historically bought most often by people who bought that item? This is the "people who bought this also bought that" set of recommendations on Amazon. The problem is, bananas are so popular that no matter what item you're looking at, it's likely that the item most commonly paired with that item is bananas. So you need to adjust the ranking to account for the overall popularity of each item. So the new question is: given this item, what items are bought along with it at a rate above the base rate.

Collaborative recommendation systems: We will recommend something for you based on the past behavior of other shoppers.
Thanks for taking the time to post this!
Reply With Quote