We consider the problem of maintaining a warehouse of sampled data that "Shadows" a full-scale data warehouse, in order to support quick approximate analytics and metadata discovery. The full-scale warehouse comprises many "Data sets," where a data set is a bag of values; the data sets can vary enormously in...
We were recently asked to evaluate a large data set for data value anomalies as part of an overall data quality assessment that we hoped would establish the business case for senior management's investment in a data quality program. The particular data set we were examining contained a table with...
This paper describes the construction of a panel data set from the U.S. patent data that contains measures of inventors' life-cycle R&D (Research & Development) productivity - patents and patent citations. It matches the data set to information on the U.S. pharmaceutical and semiconductor firms for whom they work. This...
With advances in x-ray microtomography, it is now possible to obtain three-dimensional representations of a material's microstructure with a voxel size of less than one micrometer. The Visible Cement Data Set represents a collection of 3-D data sets obtained using the European Synchrotron Radiation Facility in Grenoble, France in September...
A data warehouse consists of a set of materialized views defined over a number of data source, collects copies of data from remote, distributed, autonomous and heterogeneous data sources into a central repository to enable analysis and mining of the integrated information. Data Warehousing and On-Line Analytical Processing OLAP are...
Chip set simplifies node design and network maintenance for FDDI LAN The Fiber-Distributed-Data-Interface FDDI chip set offers you two ways to use fiber optics for data communications. You can use the set to create a LAN node compliant with the pr Chip set simplifies node...
This paper proposes a novel way of automatically developing data warehouse configuration in rule-based CRM systems. Rule-based CRM systems assume that marketing activities are represented as a set of IF-THEN rules. Currently, to provide good quality CRM functionalities, CRM systems seek to combine conventional CRM methodologies with data warehousing technology....
Microsoft Excel data validation lets you define what type of data you want entered in a cell. This white paper describes how to set up data validation, including the types of data you can validate and the messages you can display, and provides a workbook that you can download to...
A set of engineering quality audits showed a consistent level of error in engineering data tested across aerospace, automotive, consumer products and electronics industries. Each audit took approximately two weeks and included a review of multiple sets of engineering data from organizations with as few as...
The paper conducts a research to develop a music information retrieval system which retrieves music based on user preferences. In order to conduct evaluation experiments, it is necessary to accumulate an experiment set of music data, and collect user ratings of each music data included in this data set. Two...
Having an enormous amount of information at your disposal to help you make decisions is a wonderful experience. Yet, people often complain of the overload problem that comes from having too much data. Some researchers believe a partial solution may be found in metadata--essentially data about data that describe...
Many users at remote locations can work on the same set of data. NASA's jet Propulsion Laboratory, Pasadena, California MECS is a computer program for the automated, secure, rapid, and efficient transfer of data between a central source and users at multiple distant locations. "MECS" signifies "Multi-mission Encrypted Communication System."...
In product design CAD data quality plays a major role in achieving the target date in the limited time for product life cycle development. Ensuring the quality of CAD data is necessary for data to be used within the product development process chain. A clean set of CAD data saves...
The recent terrorist attacks in London have prompted EU ministers to give greater momentum to proposed legislation on data retention. Meeting on July 13 at an extraordinary meeting in response to the London bombs, EU Justice and Home Affairs Ministers set a deadline of October 1 ...
MINNEAPOLIS -- ADC (NASDAQ:ADCT; www.adc.com) announced today the availability of a complete portfolio of data center-grade infrastructure solutions. Designed to meet the rigorous guidelines of data center design set forth by standards such as TIA-942, ADC's data center product set is led by its flagship TrueNetR Structured Cabling System. ADC's...
NIST has completed the production of the latest Protein Data Bank PDB CD-ROM set. The CD-ROMs are produced quarterly; currently there are 1200 subscribers. This set contains the 16 972 protein or nucleic acid structures in the PDB as of Jan. 1, 2002. Eight CD-ROMs are required to contain the...
A Bubble chart is a variation of a Scatter chart in which the data points are replaced with bubbles. A Bubble chart can be used instead of a Scatter chart if your data has three data series. Each data series in a chart has a unique color or pattern and...
This paper explains how to create and run an update query. You use an update query when you need to update or change existing data in a set of records. As you proceed, remember that you cannot use an update query to add new records to a database, or to...
An audit is defined as “a formal examination of an individual or organization’s accounting records, financial situation, or compliance with some other set of standards”. If, for audit purposes, the mountains of digital data now existing or to be generated in the future can be trusted, sufficient safeguards must be...