Skip to content

Commit c4fbdce

Browse files
committed
Update README.md
1 parent 5481d9b commit c4fbdce

File tree

1 file changed

+47
-0
lines changed

1 file changed

+47
-0
lines changed

README.md

Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1 +1,48 @@
11
[Documentation etc.](http://www.davidchudzicki.com/predcomps/)
2+
3+
Interested in getting involved? Here are some ways to help:
4+
5+
- Trying out the package in examples, write about the results. Does it help you better understand your complicated models? If not, what should be different? You can add examples to the documentation here via pull requests, or write in your own space.
6+
- Clarifying the language in the documentation - what can be made clearer?
7+
- Working on any of the "future work" below
8+
9+
10+
## Future Work
11+
12+
(Unsure about the priority of the rest, but the top one is definitely most important.)
13+
14+
### Explicit model for p(u|v)
15+
16+
Currently [the way we assign weights](http://0.0.0.0:4000/more-pairs-and-weights.html) to sample for p(u|v) (roughly as described in the paper) requires a bit of hand-tweaking to work well in individual examples. It's also hard to generalize to categorical inputs. This may be the biggest barrier to widespread adoption.
17+
18+
As an alternative, perhaps we can explicitly build a model for the desired conditional distribution, e.g. maybe by using something like [BART](https://github.com/kapelner/bartMachine).
19+
20+
Todo:
21+
22+
- implement
23+
- see how well it works
24+
25+
26+
### Categorical inputs
27+
28+
Once we've done "Explicit model for p(u|v)", allowing categorical inputs should be much easier, but there's still some thought required.
29+
30+
### Sensivity Analysis
31+
32+
Some of the examples show how to do sensitivity analysis in the spirit of this package, but it'd be great to have that do it for you.
33+
34+
### "Variable Importance"
35+
36+
Implement something like [conditional variable importance](http://www.biomedcentral.com/1471-2105/9/307) in the spirit of this package. (Like "permutation importance", but instead of taking a permutation, you'd sample from the conditional distribution p(u|v).)
37+
38+
### Other tools/methods for understanding complicated models
39+
40+
I'd like to compile a list of other work in this direction, maybe comparing them with this.
41+
42+
I should add a page discussing other methods people have used to get at somewhat the same idea.
43+
44+
- [conditional variable importance](http://www.biomedcentral.com/1471-2105/9/307) - out of everything I've seen, conditional variable importance is the most similar in spirit to this package
45+
- randomForest package in R (partial plots, variable importance)
46+
- earth package in R (variable importance)
47+
48+

0 commit comments

Comments
 (0)