Maximal Methodisch - das Methodenseminar MAXIMAL METHODISCH im INTRAnet: tmikpoh.charite.de/methods.
Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME ....
Transcript of Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME ....
![Page 1: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/1.jpg)
Randomly Sampling Maximal Itemsets Sandy Moens and Bart Goethals
![Page 2: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/2.jpg)
2
Frequent Itemset Mining
• Finding interesting patterns by e.g. support
• Problems: - Much redundancy - Many, many patterns
![Page 3: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/3.jpg)
3
Frequent Itemset Mining
• Finding interesting patterns by support
• Problems: - Much redundancy - Many, many patterns
![Page 4: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/4.jpg)
4
Frequent Itemset Mining
• Finding interesting patterns by support
• Problems: - Much redundancy - Many, many patterns
![Page 5: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/5.jpg)
5
Pattern Set Mining
• Less redundancy • Less patterns • But: large enumeration space!
![Page 6: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/6.jpg)
6
Pattern Set Mining
• Less redundancy • Less patterns • But: large enumeration space!
Step 1: Enumerate
![Page 7: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/7.jpg)
7
Pattern Set Mining
• Less redundancy • Less patterns • But: large enumeration space!
Step 1: Enumerate Step 2: Filter
![Page 8: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/8.jpg)
8
Output Space Sampling
• No explicit enumeration
![Page 9: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/9.jpg)
9
Output Space Sampling
• No explicit enumeration
![Page 10: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/10.jpg)
10
Random Maximal Itemset Sampling
• Long patterns with low support - E.g. microarray data, recommendation
• Simple random walk over extensions - Quality measure q - Approximation measure p
![Page 11: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/11.jpg)
11
Random Walk
![Page 12: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/12.jpg)
12
Random Walk
![Page 13: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/13.jpg)
13
Random Walk
![Page 14: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/14.jpg)
14
Random Walk
![Page 15: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/15.jpg)
15
Spreading the Search
• Uniform Metropolis-Hastings - E.g. Hasan and Zaki, Musk: Uniform sampling of
k-maximal patterns (SDM’09)
• Weight approximation score - Additive - Multiplicative - Adaptive
![Page 16: Randomly Sampling Maximal Itemsetspoloclub.gatech.edu/idea2013//papers/IDEA_RMIS.pdf · DEMO TIME . Author: Sandy Moens Created Date: 8/11/2013 11:44:08 AM ...](https://reader035.fdocuments.us/reader035/viewer/2022063006/5fb6e4173326c73cb302a3c7/html5/thumbnails/16.jpg)
16
DEMO TIME