Exploratory Analysis in Cube Space : Exploratory Analysis in Cube Space Raghu Ramakrishnan
ramakris@yahoo-inc.com
Yahoo! Research
Databases and Data Mining : Databases and Data Mining What can database systems offer in the grand challenge of understanding and learning from the flood of data we’ve unleashed?
The plumbing
Scalability
Databases and Data Mining : Databases and Data Mining What can database systems offer in the grand challenge of understanding and learning from the flood of data we’ve unleashed?
The plumbing
Scalability
Ideas!
Declarativeness
Compositionality
Ways to conceptualize your data
About this Talk : About this Talk Joint work with many people
Common theme—multidimensional view of the data:
Helps handle imprecision
Analyzing imprecise and aggregated data
Defines candidate space of subsets for exploratory mining
Forecasting query results over “future data”
Using predictive models as summaries
Restricting candidate clusters
Potentially, space of “mining experiments”?
Driving Applications : Driving Applications Business Intelligence of combined text and relational data (Joint with IBM)
Burdick, Deshpande, Jayram, Vaithyanathan
Analyzing mass spectra from ATOFMS (NSF ITR project with environmental chemists at UW and Carleton College)
Chen, Chen, Huang, Musicant, Grossman, Schauer
Goal-oriented anonymization of cancer data (NSF CyberTrust project)
Chen, LeFevre, DeWitt, Shavlik, Hanrahan (Chief Epidemiologist, Wisconsin), Trentham-Dietz
Analyzing network traffic data
Chen, Yegneswaran, Barford
Background: The Multidimensional Data ModelCube Space : Background: The Multidimensional Data Model Cube Space
Star Schema : Star Schema SERVICE
pid
timeid
locid
repair PRODUCT
pid
pname
Category
Model TIME
timeid
date
week
year LOCATION
locid
country
region
state “FACT” TABLE DIMENSION TABLES
Dimension Hierarchies : Dimension Hierarchies For each dimension, the set of values can be organized in a hierarchy: PRODUCT TIME LOCATION category week month region model date state year automobile quarter country
Multidimensional Data Model : Multidimensional Data Model One fact table D=(X,M)
X=X1, X2, ... Dimension attributes
M=M1, M2,… Measure attributes
Domain hierarchy for each dimension attribute:
Collection of domains Hier(Xi)= (Di(1),..., Di(k))
The extended domain: EXi = 1≤k≤t DXi(k)
Value mapping function: γD1D2(x)
e.g., γmonthyear(12/2005) = 2005
Form the value hierarchy graph
Stored as dimension table attribute (e.g., week for a time value) or conversion functions (e.g., month, quarter)
Slide10 : MA NY TX CA West East ALL LOCATION Civic Sierra F150 Camry Truck Sedan ALL Automobile Model Category Region State ALL ALL 1 3 2 2 1 3 Multidimensional Data p3 p1 p4 p2 DIMENSION
ATTRIBUTES
Cube Space : Cube Space Cube space: C = EX1EX2…EXd
Region: Hyper rectangle in cube space
c = (v1,v2,…,vd) , vi EXi
Region granularity:
gran(c) = (d1, d2, ..., dd), di = Domain(c.vi)
Region coverage:
coverage(c) = all facts in c
Region set: All regions with same granularity
OLAP Over Imprecise Datawith Doug Burdick, Prasad Deshpande, T.S. Jayram, and Shiv VaithyanathanIn VLDB 05, 06 joint work with IBM Almaden : OLAP Over Imprecise Data with Doug Burdick, Prasad Deshpande, T.S. Jayram, and Shiv Vaithyanathan In VLDB 05, 06 joint work with IBM Almaden
Slide13 : MA NY TX CA West East ALL LOCATION Civic Sierra F150 Camry Truck Sedan ALL Automobile Model Category Region State ALL ALL 1 3 2 2 1 3 p5 Imprecise Data p3 p1 p4 p2
Querying Imprecise Facts : Querying Imprecise Facts p3 p1 p4 p2 p5 MA NY Sierra F150 Truck East Auto = F150
Loc = MA
SUM(Repair) = ??? How do we treat p5?
Allocation (1) : p3 p1 p4 p2 p5 MA NY Truck East Allocation (1)
Allocation (2) : p3 p1 p4 p2 MA NY Truck East Allocation (2) p5 p5 (Huh? Why 0.5 / 0.5?
- Hold on to that thought)
Allocation (3) : p3 p1 p4 p2 MA NY Truck East Allocation (3) p5 p5 Auto = F150
Loc = MA
SUM(Repair) = 150 Query the Extended Data Model!
Allocation Policies : Allocation Policies Procedure for assigning allocation weights is referred to as an allocation policy
Each allocation policy uses different information to assign allocation weight
Key contributions:
Appropriate characterization of the large space of allocation policies (VLDB 05)
Designing efficient algorithms for allocation policies that take into account the correlations in the data (VLDB 06)
Motivating Example : Sierra F150 Truck MA NY East p5 Motivating Example Query: COUNT
Desideratum I: Consistency : Desideratum I: Consistency Consistency specifies the relationship between answers to related queries on a fixed data set
Sierra F150 Truck MA NY East p1 p3 p5 p2
Desideratum II: Faithfulness : Desideratum II: Faithfulness Faithfulness specifies the relationship between answers to a fixed query on related data sets Sierra F150 MA NY Data Set 1 Data Set 2 Data Set 3
Slide22 : p1 p2 p4 p1 p3 p5 p2 p1 p3 p4 p5 p2 p4 p1 p3 p5 p2 w1 w2 w3 w4 Imprecise facts
lead to many
possible worlds
[Kripke63, …]
Query Semantics : Query Semantics Given all possible worlds together with their probabilities, queries are easily answered using expected values
But number of possible worlds is exponential!
Allocation gives facts weighted assignments to possible completions, leading to an extended version of the data
Size increase is linear in number of (completions of) imprecise facts
Queries operate over this extended version
Storing Allocations using the Extended Data Model : Storing Allocations using the Extended Data Model p3 p1 p4 p2 p5 Truck East
Allocation Policy: Count : p3 p1 p4 p2 MA NY Sierra F150 Truck East Allocation Policy: Count p5 p5 p6 c1 c2
Allocation Policy: Measure : p3 p1 p4 p2 MA NY Sierra F150 Truck East Allocation Policy: Measure p5 p5 p6 c1 c2
Allocation Policy Template : Allocation Policy Template
Allocation Graph : Allocation Graph
Example Processing of Allocation Graph : Example Processing of Allocation Graph Cell(NY,F150) Cell(NY,Sierra) Cell(MA,F150) Cell(MA,Sierra) Precise Cells Imprecise Facts 1) Compute Qsum(r) 2) Compute pc,r 2 1 3 2 / 3 1 / 3 Cell(MA,Civic)
Processing Allocation Graph : Processing Allocation Graph p6 p7 p8 p9 p10 p11 p12 p13 p14 Cell(MA,Civic) Cell(MA,Sierra) Cell(NY,F150) Cell(CA,Civic) Cell(CA,Sierra) c1 c2 c3 c4 c5 What if precise cells and imprecise facts do not fit into memory?
Need to scan precise cells twice for each imprecise fact Identify groups of imprecise facts that can be processed in same scan
Algorithm will process these groups
Summary : Summary Consistency and faithfulness
Desiderata for designing query semantics for imprecise data
Allocation is the key to our framework
Aggregation operators with appropriate guarantees of consistency and faithfulness
Efficient algorithms for allocation policies
Lots of recent work on uncertainty and probabilistic data processing
Sensor data, errors, Bayesian inference … VLDB 05 (semantics), 06 (implementation)
Bellwether Analysis:Global Aggregates from Local Regionswith Beechun Chen, Jude Shavlik, and Pradeep TammaIn VLDB 06 : Bellwether Analysis: Global Aggregates from Local Regions with Beechun Chen, Jude Shavlik, and Pradeep Tamma In VLDB 06
Motivating Example : Motivating Example A company wants to predict the first year worldwide profit of a new item (e.g., a new movie)
By looking at features and profits of previous (similar) movies, we predict expected total profit (1-year US sales) for new movie
Wait a year and write a query! If you can’t wait, stay awake …
The most predictive “features” may be based on sales data gathered by releasing the new movie in many “regions” (different locations over different time periods).
Example “region-based” features: 1st week sales in Peoria, week-to-week sales growth in Wisconsin, etc.
Gathering this data has a cost (e.g., marketing expenses, waiting time)
Problem statement: Find the most predictive region features that can be obtained within a given “cost budget”
Key Ideas : Key Ideas Large datasets are rarely labeled with the targets that we wish to learn to predict
But for the tasks we address, we can readily use OLAP queries to generate features (e.g., 1st week sales in Peoria) and even targets (e.g., profit) for mining
We use data-mining models as building blocks in the mining process, rather than thinking of them as the end result
The central problem is to find data subsets (“bellwether regions”) that lead to predictive features which can be gathered at low cost for a new case
Motivating Example : Motivating Example A company wants to predict the first year’s worldwide profit for a new item, by using its historical database
Database Schema: The combination of the underlined attributes forms a key
A Straightforward Approach : A Straightforward Approach Build a regression model to predict item profit
There is much room for accuracy improvement! By joining and aggregating tables in the historical database
we can create a training set: Item-table features Target An Example regression model:
Profit = 0 + 1 Laptop + 2 Desktop +
3 RdExpense
Using Regional Features : Using Regional Features Example region: [1st week, HK]
Regional features:
Regional Profit: The 1st week profit in HK
Regional Ad Expense: The 1st week ad expense in HK
A possibly more accurate model:
Profit[1yr, All] = 0 + 1 Laptop + 2 Desktop + 3 RdExpense +
4 Profit[1wk, HK] + 5 AdExpense[1wk, HK]
Problem: Which region should we use?
The smallest region that improves the accuracy the most
We give each candidate region a cost
The most “cost-effective” region is the bellwether region
Basic Bellwether Problem : Basic Bellwether Problem
Basic Bellwether Problem : Basic Bellwether Problem Historical database: DB
Training item set: I
Candidate region set: R
E.g., { [1-n week, Location] }
Target generation query:i(DB) returns the target value of item i I
E.g., sum(Profit) i, [1-52, All] ProfitTable
Feature generation query: i,r(DB), i Ir and r R
Ir: The set of items in region r
E.g., [ Categoryi, RdExpensei, Profiti, [1-n, Loc], AdExpensei, [1-n, Loc] ]
Cost query: r(DB), r R, the cost of collecting data from r
Predictive model: hr(x), r R, trained on {(i,r(DB), i(DB)) : i Ir}
E.g., linear regression model Location domain hierarchy
Basic Bellwether Problem : Basic Bellwether Problem Aggregate over data records
in region r = [1-2, USA] Features i,r(DB) Target i(DB) Total Profit
in [1-52, All] For each region r, build a predictive model hr(x); and then choose bellwether region:
Coverage(r) fraction of all items in region minimum coverage support
Cost(r, DB) cost threshold
Error(hr) is minimized r
Experiment on a Mail Order Dataset : Experiment on a Mail Order Dataset Bel Err: The error of the bellwether region found using a given budget
Avg Err: The average error of all the cube regions with costs under a given budget
Smp Err: The error of a set of randomly sampled (non-cube) regions with costs under a given budget [1-8 month, MD] Error-vs-Budget Plot (RMSE: Root Mean Square Error)
Experiment on a Mail Order Dataset : Experiment on a Mail Order Dataset Uniqueness Plot Y-axis: Fraction of regions that are as good as the bellwether region
The fraction of regions that satisfy the constraints and have errors within the 99% confidence interval of the error of the bellwether region
We have 99% confidence that that [1-8 month, MD] is a quite unusual bellwether region [1-8 month, MD]
Basic Bellwether Computation : Basic Bellwether Computation OLAP-style bellwether analysis
Candidate regions: Regions in a data cube
Queries: OLAP-style aggregate queries
E.g., Sum(Profit) over a region
Efficient computation:
Use iceberg cube techniques to prune infeasible regions (Beyer-Ramakrishnan, ICDE 99; Han-Pei-Dong-Wang SIGMOD 01)
Infeasible regions: Regions with cost > B or coverage < C
Share computation by generating the features and target values for all the feasible regions all together
Exploit distributive and algebraic aggregate functions
Simultaneously generating all the features and target values reduces DB scans and repeated aggregate computation
Subset Bellwether Problem : Subset Bellwether Problem
Subset-Based Bellwether Prediction : Subset-Based Bellwether Prediction Motivation: Different subsets of items may have different bellwether regions
E.g., The bellwether region for laptops may be different from the bellwether region for clothes
Two approaches: Bellwether Tree Bellwether Cube R&D Expenses Category
Bellwether Tree : Bellwether Tree How to build a bellwether tree
Similar to regression tree construction
Starting from the root node, recursively split the current leaf node using the “best split criterion”
A split criterion partitions a set of items into disjoint subsets
Pick the split that reduces the error the most
Stop splitting when the number of items in the current leaf node falls under a threshold value
Prune the tree to avoid overfitting 1 2 7 3 4 8 9 5 6
Bellwether Tree : Bellwether Tree How to split a node
Split criterion:
Numeric split: Ak
Categorical split: Ak
(Ak is an item-table feature)
Pick the best split criterion
Best split: The split that can reduce the error the most Find bellwether region for S
h: Bellwether model for S Find bellwether region for Sp
hp: Bellwether model for Sp (S is the set of items at the parent node, and Sp is the set of items at the pth child node) Total parent error Total child error
Problem of Naïve Tree Construction : Problem of Naïve Tree Construction A naïve bellwether tree construction algorithm will scan the dataset nm times
n is the number of nodes
m is the number of candidate split criteria
Idea: Extending the RainForest framework [Gehrke et al., 98] 1 2 7 3 4 8 9 5 6 For each node:
Try all candidate split criteria to find the best one
It needs to scan the dataset m times
Efficient Tree Construction : Efficient Tree Construction Idea: Extending the RainForest framework [Gehrke et al., 98]
Build the tree level by level
Scan the entire dataset once per level and keep small sufficient statistics in memory (size: O(nsc))
Sufficient Statistics for a split criterion
|Sp| and Error(hp | Sp),
for p = 1 to # of children
Split all the nodes at that level
after the scan based on the sufficient statistics
Further improved by a hybrid algorithm 1 2 3 4 5 6 7 8 9 1st scan 2nd scan 3rd scan 4th scan
Bellwether Cube : Bellwether Cube R&D Expenses R&D Expenses Category Category The number in a cell is the error of the bellwether region for that subset of items Rollup Drilldown
Problem of Naïve Cube Construction : Problem of Naïve Cube Construction A naïve bellwether cube construction algorithm will conduct a basic bellwether search for the subset of items in each cell
A basic bellwether search involves building a model for each candidate region For each cell:
Build a model for each
candidate region
Efficient Cube Construction : Efficient Cube Construction Idea: Transform model construction into computation of distributive or algebraic aggregate functions
Let S1, …, Sn partition S
S = S1 … Sn and Si Sj =
Distributive function: (S) = F({(S1), …, (Sn)})
E.g., Count(S) = Sum({Count(S1), …, Count(Sn)})
Algebraic function: (S) = F({G(S1), …, G(Sn)})
G(Si) returns a length-fixed vector of values
E.g., Avg(S) = F({G(S1), …, G(Sn)})
G(Si) = [Sum(Si), Count(Si)]
F({[a1, b1], …, [an, bn]}) = Sum({ai}) / Sum({bi})
Efficient Cube Construction : Efficient Cube Construction Build models for each finest-grained cells
For higher-level cells, use data cube computation techniques to compute the aggregate functions For each finest-grained cell:
Build models to find the
bellwether region
For each higher-level cell:
Compute aggregate functions
to find the bellwether region
Efficient Cube Construction : Efficient Cube Construction Classification models:
Use the prediction cube [Chen et al., 05] execution framework
Regression models: (Weighted linear regression model; builds on work in Chen-Dong-Han-Wah-Wang VLDB 02)
Having the sum of squared error (SSE) for each candidate region is sufficient to find the bellwether region
SSE(S) is an algebraic function, where S is a set of item
SSE(S) = q( { g(Sk) : k = 1, …, n } )
S1, …, Sn partition S
g(Sk) = YkWkYk, XkWkXk, XkWkYk
q({Ak, Bk, Ck : k = 1, …, n}) = k Ak (k Ck)(k Bk)1(k Ck) Yk is the vector of target values for set Sk of items
Xk is the matrix of features for set Sk of items
Wk is the weight matrix for set Sk of items where
Experimental Results : Experimental Results
Experimental Results: Summary : Experimental Results: Summary We have shown the existence of bellwether regions on a real mail-order dataset
We characterize the behavior of bellwether trees and bellwether cubes using synthetic datasets
We show our computation techniques improve efficiency by orders of magnitude
We show our computation techniques scale linearly in the size of the dataset
Characteristics of Bellwether Trees & Cubes : Characteristics of Bellwether Trees & Cubes Dataset generation:
Use random tree to generate
different bellwether regions
for different subset of items
Parameters:
Noise
Concept complexity: # of tree nodes
Result:
Bellwether trees & cubes have better accuracy than basic bellwether search
Increase noise increase error
Increase complexity increase error 15 nodes Noise level: 0.5
Efficiency Comparison : Efficiency Comparison Naïve computation
methods Our computation
techniques
Scalability : Scalability
Exploratory Mining:Prediction Cubeswith Beechun Chen, Lei Chen, and Yi LinIn VLDB 05; EDAM Project : Exploratory Mining: Prediction Cubes with Beechun Chen, Lei Chen, and Yi Lin In VLDB 05; EDAM Project
The Idea : The Idea Build OLAP data cubes in which cell values represent decision/prediction behavior
In effect, build a tree for each cell/region in the cube—observe that this is not the same as a collection of trees used in an ensemble method!
The idea is simple, but it leads to promising data mining tools
Ultimate objective: Exploratory analysis of the entire space of “data mining choices”
Choice of algorithms, data conditioning parameters …
Example (1/7): Regular OLAP : Example (1/7): Regular OLAP Goal: Look for patterns of unusually
high numbers of applications: Z: Dimensions Y: Measure
Example (2/7): Regular OLAP : Example (2/7): Regular OLAP Goal: Look for patterns of unusually
high numbers of applications: Z: Dimensions Y: Measure Finer regions
Example (3/7): Decision Analysis : Example (3/7): Decision Analysis Goal: Analyze a bank’s loan decision process w.r.t. two dimensions: Location and Time Z: Dimensions X: Predictors Y: Class Fact table D
Example (3/7): Decision Analysis : Example (3/7): Decision Analysis Are there branches (and time windows) where approvals were closely tied to sensitive attributes (e.g., race)?
Suppose you partitioned the training data by location and time, chose the partition for a given branch and time window, and built a classifier. You could then ask, “Are the predictions of this classifier closely correlated with race?”
Are there branches and times with decision making reminiscent of 1950s Alabama?
Requires comparison of classifiers trained using different subsets of data.
Example (4/7): Prediction Cubes : Example (4/7): Prediction Cubes Build a model using data from USA in Dec., 1985
Evaluate that model Measure in a cell:
Accuracy of the model
Predictiveness of Race
measured based on that
model
Similarity between that
model and a given model
Example (5/7): Model-Similarity : Example (5/7): Model-Similarity Given:
- Data table D
- Target model h0(X)
- Test set w/o labels The loan decision process in USA during Dec 04
was similar to a discriminatory decision model
Example (6/7): Predictiveness : Example (6/7): Predictiveness Given:
- Data table D
- Attributes V
- Test set w/o labels Data table D Test set Level: [Country, Month] Predictiveness of V Race was an important predictor of loan approval decision in USA during Dec 04 Build models h(X) h(XV) Yes
No
.
.
No Yes
No
.
.
Yes
Example (7/7): Prediction Cube : Example (7/7): Prediction Cube Cell value: Predictiveness of Race
Efficient Computation : Efficient Computation Reduce prediction cube computation to data cube computation
Represent a data-mining model as a distributive or algebraic (bottom-up computable) aggregate function, so that data-cube techniques can be directly applied
Bottom-Up Data Cube Computation : Bottom-Up Data Cube Computation Cell Values: Numbers of loan applications
Functions on Sets : Functions on Sets Bottom-up computable functions: Functions that can be computed using only summary information
Distributive function: (X) = F({(X1), …, (Xn)})
X = X1 … Xn and Xi Xj =
E.g., Count(X) = Sum({Count(X1), …, Count(Xn)})
Algebraic function: (X) = F({G(X1), …, G(Xn)})
G(Xi) returns a length-fixed vector of values
E.g., Avg(X) = F({G(X1), …, G(Xn)})
G(Xi) = [Sum(Xi), Count(Xi)]
F({[s1, c1], …, [sn, cn]}) = Sum({si}) / Sum({ci})
Scoring Function : Scoring Function Represent a model as a function of sets
Conceptually, a machine-learning model h(X; Z(D)) is a scoring function Score(y, x; Z(D)) that gives each class y a score on test example x
h(x; Z(D)) = argmax y Score(y, x; Z(D))
Score(y, x; Z(D)) p(y | x, Z(D))
Z(D): The set of training examples (a cube subset of D)
Bottom-up Score Computation : Bottom-up Score Computation Key observations:
Observation 1: Score(y, x; Z(D)) is a function of cube subset Z(D); if it is distributive or algebraic, bottom-up data cube computation techniques can be directly applied
Observation 2: Having the scores for all the test examples and all the cells is sufficient to compute a prediction cube
Scores predictions cell values
Details depend on what each cell means (i.e., type of prediction cubes); but straightforward
Machine-Learning Models : Machine-Learning Models Naïve Bayes:
Scoring function: algebraic
Kernel-density-based classifier:
Scoring function: distributive
Decision tree, random forest:
Neither distributive, nor algebraic
PBE: Probability-based ensemble (new)
To make any machine-learning model distributive
Approximation
Probability-Based Ensemble : Probability-Based Ensemble Decision trees built on the lowest-level cells Decision tree on [WA, 85] PBE version of decision tree on [WA, 85]
Probability-Based Ensemble : Probability-Based Ensemble Scoring function:
h(y | x; bi(D)): Model h’s estimation of p(y | x, bi(D))
g(bi | x): A model that predicts the probability that x belongs to base subset bi(D)
Outline : Outline Motivating example
Definition of prediction cubes
Efficient prediction cube materialization
Experimental results
Conclusion
Experiments : Experiments Quality of PBE on 8 UCI datasets
The quality of the PBE version of a model is slightly worse (0 ~ 6%) than the quality of the model trained directly on the whole training data.
Efficiency of the bottom-up score computation technique
Case study on demographic data vs. PBE
Efficiency of Bottom-up Score Computation : Efficiency of Bottom-up Score Computation Machine-learning models:
J48: J48 decision tree
RF: Random forest
NB: Naïve Bayes
KDC: Kernel-density-based classifier
Bottom-up method vs. Exhaustive method
PBE-J48
PBE-RF
NB
KDC J48ex
RFex
NBex
KDCex
Synthetic Dataset : Synthetic Dataset Dimensions: Z1, Z2 and Z3.
Decision rule: Z1 and Z2 Z3
Efficiency Comparison : Efficiency Comparison Using exhaustive
method Using bottom-up
score computation # of Records Execution Time (sec)
Conclusions : Conclusions
Related Work: Building models on OLAP Results : Related Work: Building models on OLAP Results Multi-dimensional regression [Chen, VLDB 02]
Goal: Detect changes of trends
Build linear regression models for cube cells
Step-by-step regression in stream cubes [Liu, PAKDD 03]
Loglinear-based quasi cubes [Barbara, J. IIS 01]
Use loglinear model to approximately compress dense regions of a data cube
NetCube [Margaritis, VLDB 01]
Build Bayes Net on the entire dataset of approximate answer count queries
Related Work (Contd.) : Related Work (Contd.) Cubegrades [Imielinski, J. DMKD 02]
Extend cubes with ideas from association rules
How does the measure change when we rollup or drill down?
Constrained gradients [Dong, VLDB 01]
Find pairs of similar cell characteristics associated with big changes in measure
User-cognizant multidimensional analysis [Sarawagi, VLDBJ 01]
Help users find the most informative unvisited regions in a data cube using max entropy principle
Multi-Structural DBs [Fagin et al., PODS 05, VLDB 05]
Take-Home Messages : Take-Home Messages Promising exploratory data analysis paradigm:
Can use models to identify interesting subsets
Concentrate only on subsets in cube space
Those are meaningful subsets, tractable
Precompute results and provide the users with an interactive tool
A simple way to plug “something” into cube-style analysis:
Try to describe/approximate “something” by a distributive or algebraic function
Big Picture : Big Picture Why stop with decision behavior? Can apply to other kinds of analyses too
Why stop at browsing? Can mine prediction cubes in their own right
Exploratory analysis of mining space:
Dimension attributes can be parameters related to algorithm, data conditioning, etc.
Tractable evaluation is a challenge:
Large number of “dimensions”, real-valued dimension attributes, difficulties in compositional evaluation
Active learning for experiment design, extending compositional methods