logging in or signing up shen 1 Mahugani Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 41 Category: Entertainment License: All Rights Reserved Like it (0) Dislike it (0) Added: October 12, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript A Maximum Entropy-based Model for Answer Extraction: A Maximum Entropy-based Model for Answer Extraction Dan Shen IGK, Saarland University Supervisors: Prof. Dietrich Klakow Dr. ir. Geert-Jan M. KruijffPart I -- Introduction: Part I -- Introduction Answer Extraction Module in QA Statistical Method for Answer Extraction Motivation FrameworkAnswer Extraction Module in QA: Answer Extraction Module in QA Open-Domain factoid Question Answering Basic modules Information Retrieval Module a set of relevant sentences / paragraphs Answer Extraction (AE) Module the appropriate answer phrase Q: What is the capital of Japan ? A: Tokyo Q: How far is it from Earth to Mars ? A: 249 million miles Techniques and Resources for AE: Techniques and Resources for AE How to incorporate them ? Pipeline structure Mathematical frameworkMotivation – Use Statistical Methods ?: Motivation – Use Statistical Methods ? Flexibility Integrating various techniques / resources Easy to extend to span more in the future Effectiveness Research Issues: Research Issues Answer Candidate Selection Which constituent is regarded as an AC ? Methods classification / ranking / … Features Part II – ME-based model: Part II – ME-based model Method Features Experiments and ResultsPart II – ME-based model: Part II – ME-based model Method Features Experiments and ResultsMaximum Entropy Formulation I: Maximum Entropy Formulation I Given a set of answer candidates Model the probability Define Features Functions Decision Rule Maximum Entropy Formulation II: Maximum Entropy Formulation II Given a set of answer candidates Model the probability Define Features Functions Decision Rule Some Considerations: Some Considerations Model I Judge whether each candidate is a correct answer √ Can find more than one correct answer in a sentence ? Is the probability comparable ? × Suffer from the unbalanced data set (1Pos / >20Neg) Model II Find the best answer among the candidates × In a sentence, it just find one correct answer √ Directly make the probabilities of the candidates comparable Experiment Model II outperform Model I by about 5%Part II – ME-based model: Part II – ME-based model Method Features Experiments and ResultsQuestion Analysis: Question Analysis Q: What US biochemists won the Nobel Prize in medicine in 1992 ? Question Word -- what Target Word – biochemist Subject Word -- Nobel Prize / medicine / 1992 Verb – win Q: What is the name of the highest mountain in Africa ? Question Word -- what Target Word -- mountain Subject Words -- highest / Africa Verb -- be PERSON LOCATIONAnswer Candidate Selection: Answer Candidate Selection Preprocessing Named Entity Recognition Parsing [Collins Parser] To dependency tree Answer Candidate Selection Base noun phrase Named entities Leaf nodes Answer Candidate Coverage 11876 / 14039 = 84.6 % Missing some sentences to consider all of the nodes ?Features – Syntactic / POS Tag Features: Features – Syntactic / POS Tag Features Observation For who / where Question, answers = Proper Noun For how / when Question, answers = CD Question Word × Syntactic tag / Pos tag QWord = “how” & SynTag = “CD” QWord = “who” & SynTag = “NNP” QWord = “when” & SynTag = “NNP” QWord = “when” & SynTag = “CD” …Features – Surface Word Features: Features – Surface Word Features Word formations Length / Capitalized / Digits, … Question Word × Word formations QWord = “who” & word is capitalized QWord = “who” & word length < 3 Words co-occurrence between Q and A Observation -- Answer aren’t a subsequence of questionFeatures – Named Entity Features: Features – Named Entity Features Question Type × NE type QType = Person & NE type = Person QType = Date & NE type = Date QType = how much & NE type = Money … Useful for who, where, when … Question But for What / Which / How questions ? Many expected answer types not belong to a defined NE type Q1: What language is most commonly used in Bombay ? Q2: What city is … Q3: Which movie win ….Features – TWord Relation for WHAT I: Features – TWord Relation for WHAT I TWord is a hypernym of answer TWord is the head of answer Q: What is the name of the airport in Dallas Ft. Worth ? A: Wednesday morning , the low temperature at the Dallas-Fort Worth International Airport was 81 degrees . Q: What city is Disneyland in ? A: Not bad for a struggling actor who was working at Tokyo Disneyland just a few years ago . Features – TWord Relation for WHAT II: Features – TWord Relation for WHAT II TWord is the Appositive of answer Feature Function QWord = what & TWord is hypernym of answer candidate … Q: What book did Rachel Carson write in 1962 ? A1: In her 1962 book Silent Spring , Rachel Carson , a marine biologist , chronicled DDT 's poisonous effects , …. A2: In 1962 , former U.S. Fish and Wildlife Service biologist Rachel Carson shocked the nation with her landmark book , Silent Spring . Features – Tword Relation for HOW: Features – Tword Relation for HOW How many / much + NN … How long / far / tall / fast … How long … year / day / month / … How tall … feet / inch / mile / … How fast … per day / per hour / … Use some trigger word features Q: How many time zones are there in the world ? A: The world is divided into 24 time zones . Features – Subject Word Relations I: Features – Subject Word Relations I Q: Who invented the paper clip ? S1: The paper clip , weighing a desk-crushing 1320 pounds , is a faithful copy of Norwegian Johan Vaaler ‘s 1899 invention, said … S2: “ Like the guy who invented the safety pin , or the guy who invented the paper clip “ , David says . ×Features – Subject Word Relations II: Features – Subject Word Relations II Match subject word in the answer sentence Minimal Edit Distance Dependency Relationship Matching Observation – answer are close to SWord in Dependency Tree answer and SWord have some relation Answer candidate is a subject word Answer candidate is the parent / child / brother of SWord The path from the answer candidate to SWord Q: What is the name of the airport in Dallas Ft. Worth ? A: Wednesday morning , the low temperature at the Dallas-Fort Worth International Airport was 81 degreesPart II – ME-based model: Part II – ME-based model Method Features Experiments and Results Experiment Settings: Experiment Settings Training Data TREC 1999, TREC 2000, TREC 2002 Total Number of Questions: 1108 Total Number of Sentences: 11331 Test Data TREC 2003 Total Number of Questions: 362 (remove NIL question) Total Number of Sentences: 2708Question Word Distribution: Question Word DistributionOverall Performance: Overall Performance MRR – Mean Reciprocal Rank return five answers for each question Contribution of Different Features: Contribution of Different FeaturesFeatures – Syntactic / POS Tag Features: Features – Syntactic / POS Tag FeaturesFeatures – + Surface Word Features: Features – + Surface Word FeaturesFeatures – + Named Entity Features: Features – + Named Entity FeaturesFeatures – + TWord Relations for WHAT: Features – + TWord Relations for WHATFeatures – + TWord Relations for HOW: Features – + TWord Relations for HOWFeatures – + Subject Word Relations: Features – + Subject Word RelationsError Analysis – I: Error Analysis – I Target Word Concept Unresolved Q: What is the traditional dish served at Wimbledon? √A: And she said she wasn't wild about Wimbledon 's famed strawberries and cream . ×A: And she said she wasn't wild about Wimbledon 's famed strawberries and cream . Choosing the Wrong Entity Q: What actress has received the most Oscar nominations? √A: Oscar perennial Meryl Streep is up for best actress for the film , tying Katharine Hepburn for most acting nominations with 12 . ×A: Oscar perennial Meryl Streep is up for best actress for the film , tying Katharine Hepburn for most acting nominations with 12 . Error Analysis – II: Error Analysis – II Answer Candidate Granularity Q: What city is Disneyland in? √A: Not bad for a struggling actor who was working at Tokyo Disneyland just a few years ago . ×A: Not bad for a struggling actor who was working at Tokyo Disneyland just a few years ago . Repeated Target Word in Answer Q: How many grams in an ounce? √A: NOTE : 30 grams is about 1 ounce . ×A: NOTE : 30 grams is about 1 ounce . Misc.Future Work: Future Work Extract answer from Web Evaluate on other data sets Knowledge Master Corpus How to deal with NIL question ? Incorporate more linguistic-motivated features The End: The End You do not have the permission to view this presentation. In order to view it, please contact the author of the presentation.
shen 1 Mahugani Download Post to : URL : Related Presentations : Share Add to Flag Embed Email Send to Blogs and Networks Add to Channel Uploaded from authorPOINTLite Insert YouTube videos in PowerPont slides with aS Desktop Copy embed code: (To copy code, click on the text box) Embed: URL: Thumbnail: WordPress Embed Customize Embed The presentation is successfully added In Your Favorites. Views: 41 Category: Entertainment License: All Rights Reserved Like it (0) Dislike it (0) Added: October 12, 2007 This Presentation is Public Favorites: 0 Presentation Description No description available. Comments Posting comment... Premium member Presentation Transcript A Maximum Entropy-based Model for Answer Extraction: A Maximum Entropy-based Model for Answer Extraction Dan Shen IGK, Saarland University Supervisors: Prof. Dietrich Klakow Dr. ir. Geert-Jan M. KruijffPart I -- Introduction: Part I -- Introduction Answer Extraction Module in QA Statistical Method for Answer Extraction Motivation FrameworkAnswer Extraction Module in QA: Answer Extraction Module in QA Open-Domain factoid Question Answering Basic modules Information Retrieval Module a set of relevant sentences / paragraphs Answer Extraction (AE) Module the appropriate answer phrase Q: What is the capital of Japan ? A: Tokyo Q: How far is it from Earth to Mars ? A: 249 million miles Techniques and Resources for AE: Techniques and Resources for AE How to incorporate them ? Pipeline structure Mathematical frameworkMotivation – Use Statistical Methods ?: Motivation – Use Statistical Methods ? Flexibility Integrating various techniques / resources Easy to extend to span more in the future Effectiveness Research Issues: Research Issues Answer Candidate Selection Which constituent is regarded as an AC ? Methods classification / ranking / … Features Part II – ME-based model: Part II – ME-based model Method Features Experiments and ResultsPart II – ME-based model: Part II – ME-based model Method Features Experiments and ResultsMaximum Entropy Formulation I: Maximum Entropy Formulation I Given a set of answer candidates Model the probability Define Features Functions Decision Rule Maximum Entropy Formulation II: Maximum Entropy Formulation II Given a set of answer candidates Model the probability Define Features Functions Decision Rule Some Considerations: Some Considerations Model I Judge whether each candidate is a correct answer √ Can find more than one correct answer in a sentence ? Is the probability comparable ? × Suffer from the unbalanced data set (1Pos / >20Neg) Model II Find the best answer among the candidates × In a sentence, it just find one correct answer √ Directly make the probabilities of the candidates comparable Experiment Model II outperform Model I by about 5%Part II – ME-based model: Part II – ME-based model Method Features Experiments and ResultsQuestion Analysis: Question Analysis Q: What US biochemists won the Nobel Prize in medicine in 1992 ? Question Word -- what Target Word – biochemist Subject Word -- Nobel Prize / medicine / 1992 Verb – win Q: What is the name of the highest mountain in Africa ? Question Word -- what Target Word -- mountain Subject Words -- highest / Africa Verb -- be PERSON LOCATIONAnswer Candidate Selection: Answer Candidate Selection Preprocessing Named Entity Recognition Parsing [Collins Parser] To dependency tree Answer Candidate Selection Base noun phrase Named entities Leaf nodes Answer Candidate Coverage 11876 / 14039 = 84.6 % Missing some sentences to consider all of the nodes ?Features – Syntactic / POS Tag Features: Features – Syntactic / POS Tag Features Observation For who / where Question, answers = Proper Noun For how / when Question, answers = CD Question Word × Syntactic tag / Pos tag QWord = “how” & SynTag = “CD” QWord = “who” & SynTag = “NNP” QWord = “when” & SynTag = “NNP” QWord = “when” & SynTag = “CD” …Features – Surface Word Features: Features – Surface Word Features Word formations Length / Capitalized / Digits, … Question Word × Word formations QWord = “who” & word is capitalized QWord = “who” & word length < 3 Words co-occurrence between Q and A Observation -- Answer aren’t a subsequence of questionFeatures – Named Entity Features: Features – Named Entity Features Question Type × NE type QType = Person & NE type = Person QType = Date & NE type = Date QType = how much & NE type = Money … Useful for who, where, when … Question But for What / Which / How questions ? Many expected answer types not belong to a defined NE type Q1: What language is most commonly used in Bombay ? Q2: What city is … Q3: Which movie win ….Features – TWord Relation for WHAT I: Features – TWord Relation for WHAT I TWord is a hypernym of answer TWord is the head of answer Q: What is the name of the airport in Dallas Ft. Worth ? A: Wednesday morning , the low temperature at the Dallas-Fort Worth International Airport was 81 degrees . Q: What city is Disneyland in ? A: Not bad for a struggling actor who was working at Tokyo Disneyland just a few years ago . Features – TWord Relation for WHAT II: Features – TWord Relation for WHAT II TWord is the Appositive of answer Feature Function QWord = what & TWord is hypernym of answer candidate … Q: What book did Rachel Carson write in 1962 ? A1: In her 1962 book Silent Spring , Rachel Carson , a marine biologist , chronicled DDT 's poisonous effects , …. A2: In 1962 , former U.S. Fish and Wildlife Service biologist Rachel Carson shocked the nation with her landmark book , Silent Spring . Features – Tword Relation for HOW: Features – Tword Relation for HOW How many / much + NN … How long / far / tall / fast … How long … year / day / month / … How tall … feet / inch / mile / … How fast … per day / per hour / … Use some trigger word features Q: How many time zones are there in the world ? A: The world is divided into 24 time zones . Features – Subject Word Relations I: Features – Subject Word Relations I Q: Who invented the paper clip ? S1: The paper clip , weighing a desk-crushing 1320 pounds , is a faithful copy of Norwegian Johan Vaaler ‘s 1899 invention, said … S2: “ Like the guy who invented the safety pin , or the guy who invented the paper clip “ , David says . ×Features – Subject Word Relations II: Features – Subject Word Relations II Match subject word in the answer sentence Minimal Edit Distance Dependency Relationship Matching Observation – answer are close to SWord in Dependency Tree answer and SWord have some relation Answer candidate is a subject word Answer candidate is the parent / child / brother of SWord The path from the answer candidate to SWord Q: What is the name of the airport in Dallas Ft. Worth ? A: Wednesday morning , the low temperature at the Dallas-Fort Worth International Airport was 81 degreesPart II – ME-based model: Part II – ME-based model Method Features Experiments and Results Experiment Settings: Experiment Settings Training Data TREC 1999, TREC 2000, TREC 2002 Total Number of Questions: 1108 Total Number of Sentences: 11331 Test Data TREC 2003 Total Number of Questions: 362 (remove NIL question) Total Number of Sentences: 2708Question Word Distribution: Question Word DistributionOverall Performance: Overall Performance MRR – Mean Reciprocal Rank return five answers for each question Contribution of Different Features: Contribution of Different FeaturesFeatures – Syntactic / POS Tag Features: Features – Syntactic / POS Tag FeaturesFeatures – + Surface Word Features: Features – + Surface Word FeaturesFeatures – + Named Entity Features: Features – + Named Entity FeaturesFeatures – + TWord Relations for WHAT: Features – + TWord Relations for WHATFeatures – + TWord Relations for HOW: Features – + TWord Relations for HOWFeatures – + Subject Word Relations: Features – + Subject Word RelationsError Analysis – I: Error Analysis – I Target Word Concept Unresolved Q: What is the traditional dish served at Wimbledon? √A: And she said she wasn't wild about Wimbledon 's famed strawberries and cream . ×A: And she said she wasn't wild about Wimbledon 's famed strawberries and cream . Choosing the Wrong Entity Q: What actress has received the most Oscar nominations? √A: Oscar perennial Meryl Streep is up for best actress for the film , tying Katharine Hepburn for most acting nominations with 12 . ×A: Oscar perennial Meryl Streep is up for best actress for the film , tying Katharine Hepburn for most acting nominations with 12 . Error Analysis – II: Error Analysis – II Answer Candidate Granularity Q: What city is Disneyland in? √A: Not bad for a struggling actor who was working at Tokyo Disneyland just a few years ago . ×A: Not bad for a struggling actor who was working at Tokyo Disneyland just a few years ago . Repeated Target Word in Answer Q: How many grams in an ounce? √A: NOTE : 30 grams is about 1 ounce . ×A: NOTE : 30 grams is about 1 ounce . Misc.Future Work: Future Work Extract answer from Web Evaluate on other data sets Knowledge Master Corpus How to deal with NIL question ? Incorporate more linguistic-motivated features The End: The End