A metadata-driven engagement agenda for the Akoma Ntoso media type v1.0

Lewis McGibbney, Chris Mattman, Bimal Kumar

    Research output: Contribution to conferencePaper

    Abstract

    This paper describes a practical, specification first software implementation of the Akoma Ntoso (AKN) Media Type v1.0 as a parser within the Apache Tika content analysis toolkit. We further our intention of extending the OASIS AKN
    committee specification (with the intention of lowering the barrier to entry) as a commonly identified IANA media type from which users, developers and publishers can benefit. Within the scope of this work we describe (i) the community driven development of the open source Akomantoso-lib parser
    as a java class representation of the AKN XML schema, (ii) a software driven evolutionary argument as to why extended engagement, interoperability and use of software clients for the AKN legal document specification is an essential component within the advancement of legal informatics, and (iii) a detailed
    description of the AKN parser and extraction functionality within Apache Tika; a metadata and content analysis toolkit. Tika, an open source project permissively licensed under the Apache License v2.0, currently has the ability to detect, parse
    and extract metadata and data from over 1,400 IANA media types making it the digital babelfish of software content analysis toolkits available across the open source software spectrum. Our work to implement Tika detection, parse and
    extraction wrappers for AKN presents a significant lowering of the barrier to entry for stakeholders across the AKN spectrum. Additionally this work also provides AKN consumers with a reliable, heavily supported, community-driven, flexible software implementation for continued use of the AKN
    standard for the representation, manifestation and interpretation of legal documentation.
    Original languageEnglish
    Publication statusPublished - 3 Aug 2015

    Fingerprint

    Metadata
    Specifications
    Interoperability
    XML

    Keywords

    • web-services
    • data integration
    • legal informatics
    • Akoma Ntoso
    • Apache Tika
    • metadata

    Cite this

    McGibbney, L., Mattman, C., & Kumar, B. (2015). A metadata-driven engagement agenda for the Akoma Ntoso media type v1.0.
    McGibbney, Lewis ; Mattman, Chris ; Kumar, Bimal. / A metadata-driven engagement agenda for the Akoma Ntoso media type v1.0.
    @conference{59110b4ec1684f8d8f36b5418b7bdf28,
    title = "A metadata-driven engagement agenda for the Akoma Ntoso media type v1.0",
    abstract = "This paper describes a practical, specification first software implementation of the Akoma Ntoso (AKN) Media Type v1.0 as a parser within the Apache Tika content analysis toolkit. We further our intention of extending the OASIS AKNcommittee specification (with the intention of lowering the barrier to entry) as a commonly identified IANA media type from which users, developers and publishers can benefit. Within the scope of this work we describe (i) the community driven development of the open source Akomantoso-lib parseras a java class representation of the AKN XML schema, (ii) a software driven evolutionary argument as to why extended engagement, interoperability and use of software clients for the AKN legal document specification is an essential component within the advancement of legal informatics, and (iii) a detaileddescription of the AKN parser and extraction functionality within Apache Tika; a metadata and content analysis toolkit. Tika, an open source project permissively licensed under the Apache License v2.0, currently has the ability to detect, parseand extract metadata and data from over 1,400 IANA media types making it the digital babelfish of software content analysis toolkits available across the open source software spectrum. Our work to implement Tika detection, parse andextraction wrappers for AKN presents a significant lowering of the barrier to entry for stakeholders across the AKN spectrum. Additionally this work also provides AKN consumers with a reliable, heavily supported, community-driven, flexible software implementation for continued use of the AKNstandard for the representation, manifestation and interpretation of legal documentation.",
    keywords = "web-services, data integration, legal informatics, Akoma Ntoso, Apache Tika, metadata",
    author = "Lewis McGibbney and Chris Mattman and Bimal Kumar",
    note = "Link to First International Akoma Ntoso Conference (IANC 2015) webpage: http://www.akomantoso.org/akoma-ntoso-conference/",
    year = "2015",
    month = "8",
    day = "3",
    language = "English",

    }

    A metadata-driven engagement agenda for the Akoma Ntoso media type v1.0. / McGibbney, Lewis; Mattman, Chris; Kumar, Bimal.

    2015.

    Research output: Contribution to conferencePaper

    TY - CONF

    T1 - A metadata-driven engagement agenda for the Akoma Ntoso media type v1.0

    AU - McGibbney, Lewis

    AU - Mattman, Chris

    AU - Kumar, Bimal

    N1 - Link to First International Akoma Ntoso Conference (IANC 2015) webpage: http://www.akomantoso.org/akoma-ntoso-conference/

    PY - 2015/8/3

    Y1 - 2015/8/3

    N2 - This paper describes a practical, specification first software implementation of the Akoma Ntoso (AKN) Media Type v1.0 as a parser within the Apache Tika content analysis toolkit. We further our intention of extending the OASIS AKNcommittee specification (with the intention of lowering the barrier to entry) as a commonly identified IANA media type from which users, developers and publishers can benefit. Within the scope of this work we describe (i) the community driven development of the open source Akomantoso-lib parseras a java class representation of the AKN XML schema, (ii) a software driven evolutionary argument as to why extended engagement, interoperability and use of software clients for the AKN legal document specification is an essential component within the advancement of legal informatics, and (iii) a detaileddescription of the AKN parser and extraction functionality within Apache Tika; a metadata and content analysis toolkit. Tika, an open source project permissively licensed under the Apache License v2.0, currently has the ability to detect, parseand extract metadata and data from over 1,400 IANA media types making it the digital babelfish of software content analysis toolkits available across the open source software spectrum. Our work to implement Tika detection, parse andextraction wrappers for AKN presents a significant lowering of the barrier to entry for stakeholders across the AKN spectrum. Additionally this work also provides AKN consumers with a reliable, heavily supported, community-driven, flexible software implementation for continued use of the AKNstandard for the representation, manifestation and interpretation of legal documentation.

    AB - This paper describes a practical, specification first software implementation of the Akoma Ntoso (AKN) Media Type v1.0 as a parser within the Apache Tika content analysis toolkit. We further our intention of extending the OASIS AKNcommittee specification (with the intention of lowering the barrier to entry) as a commonly identified IANA media type from which users, developers and publishers can benefit. Within the scope of this work we describe (i) the community driven development of the open source Akomantoso-lib parseras a java class representation of the AKN XML schema, (ii) a software driven evolutionary argument as to why extended engagement, interoperability and use of software clients for the AKN legal document specification is an essential component within the advancement of legal informatics, and (iii) a detaileddescription of the AKN parser and extraction functionality within Apache Tika; a metadata and content analysis toolkit. Tika, an open source project permissively licensed under the Apache License v2.0, currently has the ability to detect, parseand extract metadata and data from over 1,400 IANA media types making it the digital babelfish of software content analysis toolkits available across the open source software spectrum. Our work to implement Tika detection, parse andextraction wrappers for AKN presents a significant lowering of the barrier to entry for stakeholders across the AKN spectrum. Additionally this work also provides AKN consumers with a reliable, heavily supported, community-driven, flexible software implementation for continued use of the AKNstandard for the representation, manifestation and interpretation of legal documentation.

    KW - web-services

    KW - data integration

    KW - legal informatics

    KW - Akoma Ntoso

    KW - Apache Tika

    KW - metadata

    M3 - Paper

    ER -