GCN Home > 06/28/04 issue
Would a governmentwide XML schema registry cut duplication?
By Joab Jackson, GCN Staff
The Federal CIO Council wants to keep agencies from reinventing the wheel as they adopt Extensible Markup Language.

Owen Ambur, who co-chairs the CIO Councils XML working group, argues that a governmentwide XML schema registry, open to all agencies, would be a big help in describing data elements that already have XML names, instead of writing them anew.

It is far harder than it should be for folks to discover and reuse data elements, not to mention actual instances of data, said Ambur, a systems analyst for the Fish and Wildlife Service. They are often left with no practical choice but to reinvent the data elements they need.

Others, however, believe a federal schema registry would be redundant. An existing interagency collaboration space, Core.gov, already can do that job, said Marion Royal, a components expert in the General Services Administrations Office of Governmentwide Policy.

Core.gov, overseen by the Office of Management and Budget, was designed as a repository and collaboration space for sharing components. And the CIO Councils Emerging Technology Subcommittee takes a broad view of what constitutes a component: anything from a small Java script to an entire e-government initiative. XML schemas can be considered components.

Core.gov looks at the broad scope of not only component-based development but also service-oriented architectures, Royal said.

But Ambur said the federal clearinghouse for components of all sorts cannot work as a registry for schemas.

Once we have a bunch of XML elements and schemas [in Core.gov], will people be able to readily discover them? he asked.

XML provides an open format for sharing information among different systems. In order for systems to share, they must have a common set of data definitions. And as agencies use XML in ever-more complicated ways, they inevitably start writing schemasdictionary definitions or labelsto keep all the different types straight.

No one is enforcing a common terminology to eliminate the slight variations that stymie cross-agency data exchanges. And no one is watching out for duplication of work, Ambur said.

More news on related topics: Content / Record Management, Management, New Products / Technology, IT Management, Web Strategies