Skip to main content
How Can We Help?

Search for answers or browse our knowledge base.

Return to Support Front Page

Categories
Print

Utilities

Currently the name of the Utilities menu item is an exaggeration as there is only one utility feature available, the File Listing Utility. Selecting the this gives the ability to generate a list of all files in a particular directory. This is beneficial after running the DocumentExtraction, DocumentExtractionTextOnly or a TableExtraction. While the utility will list all files in a directory, if the files are named in a manner that looks like a CIK-RYYYYMMDD-CYYYYMMDD~ format the utility will report the full file name and also break the file name into the components.

Select the Browse button next to the box for the Folder to List Files of to navigate to and select the Folder whose contents you need listed and then select a Folder to Save Results (you can save the results in the same folder that you want listed). Hit the OK button and then look in the directory that you specified for the saved results for a file named filelisting.csv. If there is already a filelisting.csv file in the destination directory you will be notified and offered an option to overwrite the existing file or select another destination.

Column

Description

CIK

The Central Index Key of the filer associated with the source document of this artifact.

RDATE

The EDGAR dissemination date of the version of the filing that this content was extracted from.

CDATE

The CONFORMED date as reported in the header file.  For financial reports this is generally the balance sheet date.  For 8-K filings this is generally the date of the first underlying reportable event.  Historically this has been a good proxy for the annual meeting date for Proxy filings.  If there is not a CONFORMED date in the header we use the RDATE.

ACCESSION_KEY

Usually the letter F (but some artifacts might have a different first letter) followed by the last two digits of the accession number of the original filing.

TABLE_NUM

Whether the source document is a table or a file from a directory - this value is the index number of the object in the container it was pulled from.  For tables, this would be the nth table (as defined by table tags (<table>,</table>) in the document.  If the object is a file then this would be the nth file listed in the directory when using Python's glob function.

PLACEHOLDER

Some artifacts have another identifier.

FILE_TYPE

The extension of the file

FILE_SIZE

The size of the file as measured by a Python function.

FILE_PATH

The full path to the object.

Table of Contents