Chapter Six

  1. Web mining is the discovery of useful patterns on the Web.
  2. A grouping of characters into a word, a group of words, or a complete number is called a record.
  3. Every record in a file should contain at least one key field.
  4. Both Oracle and Microsoft Access use SQL to retrieve information from the database.
  5. DBMS simplify how end users work with databases by separating the logical and physical views of the data.
  6. Most back-end databases are able to interpret HTML commands.
  7. Hadoop can process large quantities of any type of data, including video and Twitter feeds.
  8. The two key services of Hadoop are HDFS and in-memory computing.
  9. A data warehouse may include information from legacy systems.
  10. A data warehouse is typically comprised of several smaller data marts.
  11. OLAP is a key tool of BI.
  12. OLAP is used to find hidden patterns and relationships in large databases, and infer rules from these to infer future behavior.
  13. A foreign key is a field that links to a separate table.
  14. In-memory computing relies on a computer's main memory for storing data.
  15. Implementing a Web interface for a database requires changes to the database itself.
  16. Common Gateway Interface is a specification for processing data on a Web server.
  17. In linking databases to the Web, the role of the application server is to host the DBMS.
  18. A firm's information policy lays out who is responsible for updating and maintaining the information in a database system.
  19. The use of different terms for identifying data in a firm's various information systems is an example of redundant data.
  20. Data cleansing is the same process as data scrubbing.
  21. MongoDB and SimpleDB are both examples of:
    NoSQL databases.
  22. An example of a pre-digital database is a:
    library's card-catalog.
  23. What is the first step you should take in managing data for a firm?
    Identify the data needed to run the business
  24. Which of the following best illustrates the relationship between entities and attributes?
    The entity CUSTOMER with the attribute ADDRESS
  25. The type of logical database model that treats data as if they were stored in two-dimensional tables is the:
    relational DBMS.
  26. What are the relationships that the relational database is named for?
    Relationships between entities
  27. A characteristic or quality describing an entity is called a(n):
  28. The most basic business database is comprised of:
    ) five tables: for customers, employees, suppliers, parts, and sales.
  29. In a table for customers, the information about a single customer would reside in a single:
  30. In a relational database, a record is also called a(n):
  31. A field identified in a table as holding the unique identifier of the table's records is called the:
    primary key.
  32. A field identified in a record as holding the unique identifier for that record is called the:
    key field.
  33. A schematic of the entire database that describes the relationships in a database is called a(n):
    entity-relationship diagram.
  34. A one-to-one relationship between two entities is symbolized in a diagram by a line that ends:
    in two short marks.
  35. A one-to-many relationship between two entities is symbolized in a diagram by a line that ends:
    with a crow's foot preceded by a short mark.
  36. A table that links two tables that have a many-to-many relationship is often called a(n):
    intersection relation.
  37. The process of streamlining data to minimize redundancy and awkward many-to-many relationships is called:
  38. A DBMS makes the:
    physical database available for different logical views.
  39. The logical view of a database:
    presents data as they would be perceived by end users.
  40. Which of the following is not a typical feature of DBMS?
    Query wizard tool
  41. Access is a:
    DBMS for desktop PC systems.
  42. In clustering, a data mining tool will:
    find new groupings within data.
  43. MySQL is a:
    popular open-source DBMS.
  44. Data mining is more ________ than OLAP.
    discovery driven
  45. In a relational database, the three basic operations used to develop useful sets of data are:
    select, project, and join.
  46. The select operation:
    creates a subset consisting of all records in the file that meet stated criteria.
  47. ) All of the following are tools or technologies for extracting information from unstructured data sets except:
    SQL queries.
  48. The project operation:
    creates a subset consisting of columns in a table.
  49. Which of the following best describes the importance of creating an accurate data model for your business's database?
    Critical, as without one, your data may be inaccurate, incomplete, or difficult to retrieve
  50. An automated or manual file that stores information about data elements and data characteristics such as usage, physical representation, ownership, authorization, and security is the:
    data dictionary.
  51. You are working in the IT department of a small paper supply company and planning a new database that monitors employee benefits and productivity. What would be the relationship you need to establish between Employee_ID in the Employee table and Parking_spot_number in the Perks table?
  52. The most prominent data manipulation language today is:
  53. DBMS typically include report-generating tools in order to:
    display data in an easier-to-read format.
  54. What is the purpose of a DBMS's data definition function?
    Structuring the database
  55. Which of the following database types would be best suited for storing multimedia?
    Non-relational DBMS
  56. Pre-configured hardware-software systems that use both relational and non-relational technology optimized for analyzing large datasets are referred to as:
    Analytic platforms.
  57. Which of the following statements about data warehouses is not true?
    Data warehouse systems provide easy-to-use tools for managers to easily update data.
  58. A data mart usually can be constructed more rapidly and at lower cost than a data warehouse because:
    it typically focuses on a single subject area or line of business.
  59. Tools for consolidating, analyzing, and providing access to vast amounts of data to help users make better business decisions are known as:
    business intelligence.
  60. The tool that enables users to view the same data in different ways using multiple dimensions is:
  61. OLAP is a tool for enabling:
    users to obtain online answers to ad hoc questions in a rapid amount of time.
  62. Data mining is a tool for allowing users to:
    find hidden relationships in data.
  63. In terms of the data relationships found by data mining, associations refers to:
    occurrences linked to a single event.
  64. In terms of the data relationships found by data mining, sequences refers to
    events linked over time.
  65. Which of the following would you use to find patterns in user interaction data recorded by Web servers?
    Web usage mining
  66. Which of the following would you use to find out which Web sites with content related to database design were the most often linked to by other Web sites?
    Web structure mining
  67. Within a corporate information services department, the task of creating the physical database and its logical relations are responsibilities of the ________ function.
    database administration
  68. Businesses use ________ tools to search and analyze unstructured data sets, such as e-mails and memos.
    text mining
  69. The confusion created by ________ makes it difficult for companies to create customer relationship management, supply chain management, or enterprise systems that integrate data from different sources.
    data redundancy
  70. Detecting and correcting data in a database or file that are incorrect, incomplete, improperly formatted, or redundant is called:
    data scrubbing.
  71. Relational database systems use ________ to ensure that relationships between coupled tables remain consistent.
    referential integrity rules
  72. A DBMS is special software for creating, storing, organizing, and accessing data from a database.
  73. A physical view shows data as it is organized on the storage media.
  74. End users use a data manipulation language to manipulate data in the database.
  75. Hadoop is a(n):
    open-source software framework designed for distributing data-processing over inexpensive computers.
  76. A data ________ stores current and historical data of potential interest to decision makers throughout the company.
  77. A(n) ________ is a dedicated computer in a client/server environment that hosts a DBMS.
    database server
  78. An information policy specifies the procedures and rules for defining the structure and content of corporate databases.
  79. Data administration is a special organizational function that manages the policies and procedures through which data can be managed as an organizational resource.
  80. A(n) ________ is a structured survey of the accuracy and level of completeness of the data in an information system.
    data quality audit
Card Set
Chapter Six