Please use this thread to discuss design and development of stat libraries for SXSW Action...
Hi Aaron.
There is a pretty "normal" (i think) wiki -- an overview of the project is here. From that page there is a link to another wiki page, wherein I was hoping maybe people would help build out a description of alternative approaches. (ie, maybe people who don't have time to code, might at least comment on general directions.)
"We're just trying to visualize the logs of an evolutionary model optimizer? Why? Is there real hope that you can make a better model by presenting a bunch of unlabeled parameters and looking for input from "human intuition"?
Yes, I think I've seen all the pages. When you click "comment" from the project description page, it says this:
"You've followed a link to a page that doesn't exist yet. To create the page, start typingg in the box below. If you are here by mistake, just click your browser's back button."
Does that just mean that there aren't any comments yet?
And I've seen the page for possible approaches. I have two main thoughts:
1) It will be easier for people (at least for me) to think about possible approaches if the overall goal of the project is more clear. What are we trying to do? What would a successful approach lead to? (See my earlier post.)
2) Can we get a complete data set?
Cool, thanks for your responses! I'd just feel better looking at a complete data set, so that I don't feel like I could be chasing things that aren't really there.
More questions:
* The data set has parameters numbered 3, 4, 5, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 24, 25, 26, 27, 28, 29, and 30. What's going on? Is this purely a strange numbering, or did someone already decide that parameters 1 and 2, for example, were no good?
* Are the iterations numbers accurately representing ordered runs by the evolutionary algorithm? In many cases (like parameter_5) the choices seem to be actually getting more wide-spread over time, rather than narrowing in on an ideal value. Or is this just a hint that this parameter is not particularly important?
* I haven't read the linked papers - do they describe the particular model being evaluated here? It might be helpful to have some idea of how the parameters interact in the model.
Aaron -- seems our posts were more or less simultaneous; let me know if the above gives you better insight into what we're trying to accomplish.
But to describe in one other way:
We aim to create a set of computer games. And the idea of the games is to present information to users sufficient for them to make a good "guess" about optimal model parameters. And to your point above, the thought is that people will be able to do this better than an evolutionary algorithm alone.
(*In large part we think people will do it better because we can't afford to let the evolutionary algorthm churn though billions of iterations-- that's too expensive, even taking into account the fact that there are tens of thousands of PCs contributing to the effort.... So the objective is to bring some human intelligence into the mix, to guide the process a bit...)
Let me see if I can get Nick, who knows more about the specifics of the model and data to comment here on your last post
(And then let's remember to put any details he can provide into the data documentation here.)
Aaron,
Thanks for your intersest and questions. I've quoted all that seems relevant from your posts below and put my answers in between.
The objective is to improve convergence rate of the optimization process. We hypothesize that a human-aided approach would help with this. If an improved algorithm achieves the same goal (we tried various alternatives), this would also be OK.
2) Can we get a complete data set? (The comment I had wanted to put up was about the available data set being just 2074 records, when it says there are tens of thousands available. It seems like the whole thing could easily be released.)
Yes, these ids are ordered by sampling order.I don't have a good answer to the second question, just to say that the current algorithm does no not vary the sampling range of parameter values as a function of iteration number.
* I haven't read the linked papers - do they describe the particular model being evaluated here? It might be helpful to have some idea of how the parameters interact in the model. The data set has parameters numbered 3, 4, 5, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 21, 24, 25, 26, 27, 28, 29, and 30. What's going on? Is this purely a strange numbering, or did someone already decide that parameters 1 and 2, for example, were no good?
This publication (http://www.ajtmh.org/content/75/2_suppl/1.full) gives an overview of the baseline model, and Table 1 lists the parameters to be estimated. I can provide a partial mapping of the numbers in the dataset to these parameters if this would be helpful (some new parameters have been added in the model variants we're currently fitting, and others are not relevant for the new models)
Hope this helps,
Nick
I went ahead and made a bunch of graphs, available here:
https://plus.google.com/photos/112658546306232777448/albums/5708466103108764705
There are a lot of them, and they show what we already knew, which is that the relationships between the model parameters and the loss function are pretty complicated.
The one exception seems to be parameter 21, where it seems that all the really awful loss function values come from having that parameter set on the low end of its scale.
More patterns may come out of looking at just the better parameter combinations (not including really high loss function value rows). Perhaps I'll do that too.
From a theoretical visualization standpoint, I really don't know how to show with a graphic more complex interactions between three or more parameters, in any convenient way... I made all the two-parameter graphs, which is already too many to have to visually inspect, really. (I mean I looked at them all, but I don't know how much I got from it.) I think there may be an interesting problem here, and I'm not sure if I'm missing good existing work on it. I'd like to see some. Not sure if I'll be able to do much myself right now.