Page 1 :
CBSE | DEPARTMENT OF SKILL EDUCATION, ARTIFICIAL INTELLIGENCE (SUBJECT CODE 417), CLASS X (SESSION 2021-2022), SAMPLE QUESTION PAPER FOR TERM - II, Max. Time Allowed: 1 Hour (60 min), , Max. Marks: 25, , General Instructions:, 1. Please read the instructions carefully, 2. This Question Paper is divided into 03 sections, viz., Section A, Section B and Section C., 3. Section A is of 05 marks and has 06 questions on Employability Skills., a) Questions numbers 1 to 4 are one mark questions. Attempt any three questions., b) Questions numbers 5 and 6 are two marks questions. Attempt any one question., 4. Section B is of 12 marks and has 12 questions on Subject Specific Skills., a) Questions numbers 7 to 12 are one mark questions. Attempt any four questions., b) Questions numbers 13 to 18 are two marks questions. Attempt any four questions., 5. Section C is of 08 marks and has 03 competency-based questions., a) Questions numbers 19 to 21 are four marks questions. Attempt any two questions., 6. Do as per the instructions given in the respective sections., 7. Marks allotted are mentioned against each section/question., , SECTION A, , (3 + 2 = 5 marks), , Answer any 3 questions out of the given 4 questions. Each question is of the mark., , Q.1, , Q.2, , Q.3, , Q.4, , Write any two qualities for being a good entrepreneur., Ans:, Any two of the following points• They are confident. They believe in themselves and their abilities., • They keep trying new ideas in their business., • They are patient., • They are creative and think differently about business ideas., • They take responsibility for their actions., • They make decisions after thinking about them., • They work hard., • They do not give up when they face a difficulty, (½ mark for each point; ½ x 2=1), What is sustainable development?, Ans:, Sustainable development is the development that satisfies the needs of the present, without compromising the capacity of future generations, guaranteeing the balance, between economic growth, care for the environment and social well-being., (1 mark for correct answer/explanation), Entrepreneurship has a positive impact on society. Write down any two., Ans:, 1. Some of them work towards saving the environment., 2. Some of them give money to build schools and hospitals., (½ mark for each point; ½ x 2=1), How many sustainable development goals were formulated by the United, Nations?, Ans:, There are 17 sustainable development goals formulated by the United Nations., (1 mark for correct answer), , 417-X-MS-Term II (2021-2022), , 1x3=3, , 1, , 1, , 1, , 1, , 1
Page 2 :
Answer any 1 question out of the given 2 questions. Each question is of mark., , Q.5, , Q.6, , 2x1=2, , “Entrepreneurs are born, not made.” Do you agree with this statement? Justify, your answer., Ans:, No, this is a myth/misconception about entrepreneurship., Being an entrepreneur starts with a way of thinking. One must believe that anything, is possible and it shall be achieved. It starts with thinking of an idea that you want to, work on, making it different., (1 mark for the option(NO);1 mark for correct explanation), Enlist any 2 SDGs which are formulated to address the problems related to, water?, Ans:, Clean water and sanitation, Life below water, Responsible consumption, and production, (any 2 SDG s related to water; 1 mark for each SDG), , SECTION B, , Q.8, , Q.9, , 2, , (4 + 8 = 12 marks), , Answer any 04 questions out of the given 06 questions, , Q.7, , 2, , What will be the output of the word “studies” if we do the following:, a. Lemmatization, b. Stemming, Ans:, The output of the word after lemmatization will be study., The output of the word after stemming will be studi., (½ mark for for lemmatization, ½ mark for stemming), How many tokens are there in the sentence given below?, Traffic Jams have become a common part of our lives nowadays. Living in an, urban area means you have to face traffic each and every time you get out on the, road. Mostly, school students opt for buses to go to school., Ans:, 46 tokens are there in the given sentence, (1 mark for correct answer), What is a corpus?, Ans:, The term used to describe the whole textual data from all the documents altogether is, known as corpus., (1 mark for any correct explanation), Identify any 2 stopwords in the given sentence:, , 1x4=4, , 1, , 1, , 1, , Pollution is the introduction of contaminants into the natural environment, that cause adverse change.The three types of pollution are air pollution, water, Q.10, , Q.11, , Q.12, , pollution and land pollution., Ans:, Stopwords in the given sentence are: is, the, of, that, into, are, and, (any two correct answers; ½ mark each), Why should we avoid using the training data for evaluation?, Ans:, This is because our model will simply remember the whole training set, and will, therefore always predict the correct label for any point in the training set., (1 mark for any correct explanation), What should be the value of F1 score if the model needs to have 100% accuracy?., Ans:, The model will have an F1 score of 1 if it has to be 100% accurate., (1 mark for correct answer), , 417-X-MS-Term II (2021-2022), , 1, , 1, , 1, , 2
Page 3 :
Answer any 04 questions out of the given 06 questions, , Q.13, , “Automatic summarization is used in NLP applications”. Is the given statement, correct? Justify your answer with an example., Ans:, Yes, the given statement is correct. Automatic summarization is relevant not only, for summarizing the meaning of documents and information, but also to understand the, emotional meanings within the information, such as in collecting data from social media., Automatic summarization is especially relevant when used to provide an overview of a, news item or blog post, while avoiding redundancy from multiple sources and, maximizing the diversity of content obtained., , 2x4=8, , 2, , (1 mark for explanation, 1 mark for example), , Q.14, , Give an example of a situation wherein false positive would have a high cost, associated with it., Ans:, Let us consider a model that predicts that a mail is spam or not. If the model always, predicts that the mail is spam, people would not look at it and eventually might lose, important information. Here False Positive condition (Predicting the mail as spam while, the mail is not spam) would have a high cost., , 2, , (2 marks for any correct example with explanation; 1 marks can be given if only, explanation is written without example), , Q.15, , Write any two applications of TFIDF, Ans:, 1. Document Classification, Helps in classifying the type and genre of a document., 2. Topic Modelling, It helps in predicting the topic for a corpus., 3. Information Retrieval System, To extract the important information out of a corpus., 4. Stop word filtering, Helps in removing the unnecessary words out of a text body., , 2, , (1 mark for each application name/explanation), , Q.16, , Write down the steps to implement bag of words algorithm., Ans:, The steps to implement bag of words algorithm are as follows:, 1. Text Normalisation: Collect data and pre-process it, 2. Create Dictionary: Make a list of all the unique words occurring in the corpus., (Vocabulary), 3. Create document vectors: For each document in the corpus, find out how many times, the word from the unique list of words has occurred., 4. Create document vectors for all the documents., , 2, , (½ mark for each step), , Q.17, , What is a confusion matrix? What is it used for?, Ans:, The confusion matrix is used to store the results of comparison between the prediction, and reality.From the confusion matrix, we can calculate parameters like recall, precision, ,F1 score which are used to evaluate the performance of an AI model., , 2, , (1 mark for definition, 1 mark for use), , 417-X-MS-Term II (2021-2022), , 3
Page 4 :
Explain from the given graph, how the value and occurrence of a word are related, in a corpus?, , Q.18, , 2, , Ans:, As shown in the graph, occurrence and value of a word are inversely proportional. The, words which occur most (like stop words) have negligible value. As the occurrence of, words drops, the value of such words rises. These words are termed as rare or valuable, words. These words occur the least but add the most value to the corpus., (complete explanation 2 marks), , SECTION C, , (2, , x 4 = 8 marks), , (COMPETENCY-BASED QUESTIONS), Answer any 02 questions out of the given 03 questions, Q.19, , 4, , Through a step-by-step process, calculate TFIDF for the given corpus, Document 1: Johny Johny, Yes Papa,, Document 2: Eating sugar? No Papa, Document 3: Telling lies? No Papa, Document 4: Open your mouth, Ha! Ha! Ha!, Ans:, 1. Create document vectors for the given documents (Term Frequency Table), Johny, , Yes, , Papa, , Eating, , Sugar, , No, , Telling, , Lies, , Open, , your, , Mouth, , Ha, , 2, , 1, , 1, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 1, , 1, , 1, , 1, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 1, , 0, , 0, , 1, , 1, , 1, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 0, , 1, , 1, , 1, , 3, , 2., , Record the occurrence of word in the document using term frequency table (Document, Frequency Table), , Johny, , Yes, , Papa, , Eating, , Sugar, , No, , Telling, , Lies, , Open, , your, , Mouth, , Ha, , 1, , 1, , 3, , 1, , 1, , 2, , 1, , 1, , 1, , 1, , 1, , 1, , 3., , Draw the inverse document frequency table wherein, we need to put the document, frequency in the denominator while the total number of documents is the numerator., Here, the total number of documents are 4, hence inverse document frequency becomes:, , 417-X-MS-Term II (2021-2022), , 4
Page 5 :
Johny, , Yes, , Papa, , Eating, , Sugar, , No, , Telling, , Lies, , Open, , your, , Mouth, , Ha, , 4/1, , 4/1, , 4/3, , 4/1, , 4/1, , 4/2, , 4/1, , 4/1, , 4/1, , 4/1, , 4/1, , 4/1, , 4., , Q.20, , The formula of TFIDF for any word W becomes: TFIDF(W) = TF(W) * log (IDF(W)), , Johny, , Yes, , Papa, , Eating, , Sugar, , No, , Telling, , Lies, , Open, , your, , Mouth, , Ha, , 2*log(, 4/1), , 1*log(, 4/1), , 1*log(4, /3), , 0*log(4/1, ), , 0*log(4/, 1, , 0*lo, g(4/2, ), , 0*log(4/, 1, , 0*log(, 4/1), , 0*log(4/, 1), , 0*log(4/, 1), , 0*log(4/, 1), , 0*log, (4/1), , 0*log(, 4/1), , 0*log(, 4/1), , 1*log(4, /3), , 1*log(4/1, ), , 1*log(4/, 1), , 1*lo, g(4/2, ), , 0*log(4/, 1), , 0*log(, 4/1), , 0*log(4/, 1), , 0*log(4/, 1), , 0*log(4/, 1), , 0*log, (4/1), , 0*log(, 4/1), , 0*log(, 4/1), , 1*log(4, /3), , 0*log(4/1, ), , 0*log(4/, 1), , 1*lo, g(4/2, ), , 1*log(4/, 1), , 1*log(, 4/1), , 0*log(4/, 1), , 0*log(4/, 1), , 0*log(4/, 1), , 0*log, (4/1), , 0*log(, 4/1), , 0*log(, 4/1), , 0*log(4, /3), , 0*log(4/1, ), , 0*log(4/, 1), , 0*lo, g(4/2, ), , 0*log(4/, 1), , 0*log(, 4/1), , 1*log(4/, 1), , 1*log(4/, 1), , 1*log(4/, 1), , 3*log, (4/1), , (1 mark for each correct table), The world is competitive nowadays. People face competition in even the tiniest, tasks and are expected to give their best at every point in time. When people are, unable to meet these expectations, they get stressed and could even go into, depression. We get to hear a lot of cases where people are depressed due to, reasons like peer pressure, studies, family issues, relationships, etc. and they, eventually get into something that is bad for them as well as for others. So, to, overcome this, Cognitive Behavioural Therapy (CBT) is considered to be one of, the best methods to address stress as it is easy to implement on people and also, gives good results. This therapy includes understanding the behaviour and, mindset of a person in their normal life. With the help of CBT, therapists help, people overcome their stress and live a happy life., For the situation given above,, 1. Write the problem statement template, 2. List any two sources from which data can be collected., 3. How do we explore the data?, Ans, 1. The problem statement template for the given scenario would be, Our, , people undergoing stress, , Who?, , have a, problem that, , they are not being able to share their feelings, , What?, , while, , they need help to vent out their emotions, , Where?, , An ideal, solution would, be, , To provide a platform to share their thoughts, anonymously and suggest help whenever required., , Why?, , 4, , 2. Data can be collected from one of the following sources:, a. surveys, b. observing therapist’s sessions, c. databases available on the internet, d. interviews, 3. Once the textual data has been collected, it needs to be processed and cleaned, so that an easier version can be sent to the machine. Thus, the text is, normalised through various steps and is lowered to minimum vocabulary since, the machine does not require grammatically correct statements but the essence, of it., (2 marks for problem statement template; ½ marks for each data sources; 1 mark for, correct explanation of data exploration), , 417-X-MS-Term II (2021-2022), , 5
Page 6 :
Q.21, , Take a look at the confusion matrix:, The Confusion, , Matrix, , Yes, Prediction, No, , 4, Reality, , Yes, , No, , True, Positive, (TP), False, Negative, (FN), , False, Positive, (FP), True, Negative, (TN), , How do you calculate F1 score?, Ans:, We begin the calculation by first using the formula to calculate Precision, Precision is defined as the percentage of true positive cases versus all the cases where, the prediction is true. That is, it takes into account the True Positives and False, Positives., 𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒, Precision =, ×100%, 𝐴𝑙𝑙 𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠, Precision =, , 𝑇𝑃, , ×100%, , 𝑇𝑃+𝐹𝑃, Next, we calculate recall as the fraction of positive cases that are correctly identified., 𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒, Recall =, 𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒+𝐹𝑎𝑙𝑠𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒, , Recall =, , 𝑇𝑃, 𝑇𝑃+𝐹𝑁, , Finally, we calculate the F1 score as the measure of balance between precision and, recall., 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 ×𝑅𝑒𝑐𝑎𝑙𝑙, , F1 score = 2 ×, , 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 +𝑅𝑒𝑐𝑎𝑙𝑙, , (1 mark for precision formula; 1 mark for recall formula; 1 mark for F1 score formula;1, mark for explanation), , 417-X-MS-Term II (2021-2022), , 6