STANDARDIZATION IN COINS FILES
Document Type:
Collection:
Document Number (FOIA) /ESDN (CREST):
CIA-RDP80B01139A000100090006-8
Release Decision:
RIPPUB
Original Classification:
S
Document Page Count:
4
Document Creation Date:
December 19, 2016
Document Release Date:
November 23, 2005
Sequence Number:
6
Case Number:
Publication Date:
January 17, 1969
Content Type:
MF
File:
Attachment | Size |
---|---|
![]() | 249.08 KB |
Body:
Approved For Rase 2006/02 t JDP80B011, t8_%W01 00090
17 January 1969
MEMORANDUM FOR: COINS Project Manager
SUBJECT Standardization in COINS Files
REFERENCES 1) My Memo to You, 8 Jan 1969,
Same Subject
2) COINS/192, Draft COINS Operating
Procedure, on Data Standards in
the COINS Experiment, 2 Jan 1969
1. In my memorandum to you dated 8 January
1969 (Reference 1), I enumerated several problems,
which would inhibit attempts to standardize data
entries in COINS files. Receipt of your most recent
memorandum (Reference 2) proposing a COINS Operating
Procedure for Data Standardization prompts me to
further elaborate on this subject.
2. I believe an attempt to achieve data standards
in the COINS Experiment is commendable. However, I
question whether on a basis of priorities it is the
most critical problem we should be attacking at the
moment. Right now I am most concerned with seeing
the communications/software aspects of the COINS net-
work operating in an efficient manner. I would not
recommend diluting this task by diverting our programming
support to standardization efforts. My next major con-
cern is with obtaining additional data on the query
languages and files maintained by other COINS participants'
so that I will be in a position to train CIA personnel
to query those systems prior to the start of the opera-
tional test. I fear that effort devoted to standardiza-
tion might delay my receiving this information.
3. I also question whether it is feasible to
expect to achieve any degree of standardization in
the near future, and if something cannot be achieved
in the short term, then it should be deferred until
uded Iroid autoApproved For Release 2006/02/06^ CA F 80B0Exc1
. 13 %Q90006-8
Approved For'' ease 2006/0 /eV AIRDP80B01 40"0100090006-8
after the experiment. Anyone with experience in
information storage and retrieval experiments can
attest that you have to freeze the experimental
design prior to start of the operational test
because further change will have an adverse, un-
measurable effect on results. I think COINS users
would become understandably confused and discouraged
during the experiment if they suddenly discovered
that certain data was no longer retrievable in a
manner to which they had become accustomed or worse
if they were unaware that data had been changed and
their searches were negative because they did not
specify the correct data item in the search rescriptiori.
The potential COINS users in CIA are not a wed defined
group that I could easily transmit changes to. For
this reason, I am opposed to effecting changes in data
standards during the operational test not only in CIA
files, but also in the files of other COINS participants.
4. I am not optimistic concerning the capability
of COINS participants to adopt common data standards
in the near future. The COINS Managers could probably
agree, after several discussion sessions, to a limited
set of standards on less controversial fields such as
how to enter dates. However, as you are aware, COINS
Managers do not control the COINS files (at least this
manager does not); therefore, negotiations are required
with the file builders and others within each agency.
This would be followed by renegotiation among agencies
and probably coordination w t t If after this
procedure, common s andards can be agreed upon, then
each agency must find and allocate the manpower
necessary to effect changes. This is a time consuming
effort and I can cite numerous instances where agreed
upon standards have never been applied to existing
files because of the large effort that would be
required to effect the changes. I personally have
participated in two interagency committees that were
established to develop respectively a common geo-political
area code and a standard subject code. Each of these
efforts took well over a year and the resultant agreed
upon codes were never officially adopted by some agencies,
even for new EDP files.
aix
Approved For Release 2006/02/06 :-Cl -RDP80B01139A000100090006-8
Approved Forl ase 2006/OSQJffW RDP80B011'9i 0100090006-8
L_ I
5. The above discussion is not presented to
imply that I am opposed to standardization. Standardi-
zation would obviously facilitate the use of COINS
files. However, the tone of your proposed operating
procedure implies that something is achievable now,
whereas it is my opinion that if we start working on
the problem now we may be able to obtain agreed upon
standards for a very limited set of data items by
1970. Further, I feel that the IHC should be direct
involved in an stan ar izati n f s c s 1 we
adopt standards or files, we are in effect
adopting standards for a significant number of inter-
agency EDP files, and these standards will continue
in effect regardless of the outcome of COINS. This
should be of concern to the IHC.
6. There is one area where I think we could
rather easily achieve standardization with significant
benefit to the user. This area concerns field names
or data elements as defined in your memorandum. This
seems to me the most logical place to begin because:
a. Users have become accustomed to dealing
with different code schemes and data
entry procedures among different files.
However, they will not be able to com-
prehend why the same field in two different
files is referred to with different
acronyms, e.g., the surname field is
referred to as NAME, SRNAM, and NAML in
three different COINS files.
b. These data element acronyms are merely
labels for which no standardization
precedence exists among agencies. We
would not be dealing with a sensitive
area involving previously established
prerogatives.
c. Changing these labels is a simple task
compared to changing data items.
7. I recommend that we undertake this task
immediately. Perhaps a member of your staff could
compile a list of data element acronyms in use in
COINS files with a recommended standard for those
Approved For Release 2006/02/06V1'X WT801301139A000100090006-8
Approved Forrease 20"06SEETF-RDP80B01'0100090006-8
fields in which multiple acronyms exist. If we
cannot arrive at agreement in this area, then I
would not be very hopeful concerning our ability to
agree on data standards in other areas.
8. I have no objection to the basic principles
enunciated in the referenced proposed Operating
Procedure. I question whether it is necessary to
formalize the procedure in writing, i.e., I am
opposed to documentation for the sake of documentation.
However, if you feel that it is desirable to publish
it, I have several recommended changes which I can,
transmit separately.
-GIA COINS Subsystem Manager
Approved For Release 2006/02f P80B01139A000100090006-8