Cspro 75
Cspro 75
Cspro 75
Version 7.5.0
International Programs
Population Division
U.S. Census Bureau
4600 Silver Hill Road
Washington, DC 20233
[email protected]
Table of Contents
Table of Contents 1
CSPro User's Guide 20
The CSPro System 20
What is CSPro? 20
CSPro Capabilities 21
Release History 22
What's New in CSPro? 24
What's New in CSPro? 24
What's New in CSPro 7.0? 24
What's New in CSPro 7.1? 26
What's New in CSPro 7.2? 28
What's New in CSPro 7.3? 29
What's New in CSPro 7.4? 31
What's New in CSPro 7.5? 34
CSPro Applications 36
Data Entry Applications 36
Batch Edit Applications 36
Tabulation Applications 37
Data Dictionary 37
Forms Design 37
Tool List 38
CSPro General Concepts 41
CSPro Initial Screen Layout 41
Trees 41
Windows 42
CSPro General Functionality 43
Data Sources 43
Connection String 44
Encrypted Data 45
Unicode Primer 46
Synchronization Overview 48
Paradata 52
Multiple Language Applications 53
Mapping 53
How To ... 55
Create a CSPro Application 55
Open an Existing Application 55
Change the View 56
Designer Font Preferences 56
Change Windows 57
Add Files to an Application 57
Drop Files from an Application 57
Change the Print Page Setup 57
Print All or Part of a Document 58
Save an Application 58
Close an Application 58
Save an Application with a New Name 58
Specify Application File Names 59
Pa ge 1 of 684 Ta ble of Contents
Pack an Application 60
CSPro Settings 60
Get Help 60
Collaborating on CSPro Development 60
Data Dictionary Module 62
Introduction to Data Dictionary 62
Organization 63
Questionnaire and Dictionary Organization 63
Data File Type Structure 64
Dictionary Hierarchy 65
Dictionary Concepts 67
General 67
Labels 67
Names 67
Notes 67
Levels 68
Level Description 68
Level Properties 68
Records 70
Record Description 70
Record Properties 70
Record Type 71
Required Record 71
Maximum Number 71
Items 73
Item Description 73
Identification Items 73
Subitems 73
Item Properties 74
Starting Position 75
Length 75
Data Type 75
Occurrences 75
Decimal Places 76
Decimal Character 76
Zero Fill 76
Value Sets 78
Value Sets Description 78
Value Set Properties 78
Value Set Images 79
Values 80
Value Description 80
Value Properties 80
Relations 81
Relation Description 81
Relation Properties 81
Data Dictionary Application 82
Creating a Dictionary for a New File 82
Creating a Dictionary for an Existing File 82
CSPro runs under Windows Vista, 7, 8, and 10. It does not run under other operating systems such as Linux or Mac
OS. It is a public domain product, so it can be used and distributed at no cost.
CSPro can be used to process data from censuses and surveys, both small and large. Typical subject areas include:
CSPro uses data dictionaries to provide a common description of each data file used. CSPro encodes data in UTF-8
format. CSPro provides tools to view data and text files, to view tables created by CSPro, to convert IMPS and ISSA data
dictionaries to and from CSPro.
CSPro is not intended to provide database management capabilities. However, the data generated and/or manipulated by
a CSPro application can be imported into a database system. While CSPro provides some tabulation capabilities, it is
not intended to replace more sophisticated statistical analysis software such as R, SAS, SPSS, Stata, etc.
If you have never used CSPro before, you can refer to the Getting Started Guide, a tutorial that gives an overview of
CSPro's capabilities. If you are a previous user, you may want to look at a list of this version's new features.
CSPro Capabilities
CSPro Applications
General Concepts
General Functionality
How to ...
CSPro Capabilities
Process Census or Survey Data
Given an existing data file, a user can develop a CSPro application that will examine the file for inconsistencies,
structural defects, or other errors. CSPro permits the user to generate detailed reports on all errors found; the user may
also create subfiles from the original data, and may use multiple look-up files during the validation and/or report-
generation process.
Once a case has been completely entered, the operator can modify any part of the existing data and can add or remove
information, as well (subject to application constraints).
CSPro supports both dependent and independent verification (double keying) to ensure the accuracy of the data entry
operation. Using independent verification, operators can key data into separate data files and use CSPro utilities to
compare them. Using dependent verification, operators can key data a second time and have CSPro immediately
compare it to what was keyed the first time on a field-by-field basis.
Tabulate Data
The user can create an application to produce frequency distributions or cross-tabulations using two to four variables.
Results can be displayed either globally (for the totality of the data file) or according to one or more elements of the
geographic hierarchy. Tabulations may show only percentages, or percentages in conjunction with counts. Data may be
weighted or unweighted.
Interactive Editing
CSPro language elements can be used to construct a series of tests to be carried out on a case-by-case basis using the
CSEntry module. Whether adding a new case or modifying an existing case, CSPro instructions permits interactive
editing and correction of data elements. If the user desires, a report on editing activity may be generated and saved for
printing after the session is completed.
Release History
CSPro was first released in 2000. Over the years, many new features have been added to the software, and in addition to
the Windows desktop version, new versions have been released for Android. The following table shows the software's
release history:
CSPro 7.x
CSPro 7.5
CSPro 7.4
CSPro 7.3
CSPro 7.2
CSPro 7.1
CSPro 7.0
CSPro Designer
You can resize a roster to its ideal size by right-clicking on it and selecting AutoFit.
Data Sources
Introduction of the new CSPro DB and none data sources.
Using the synchronization functions, CSPro DB data files can be synchronized with Dropbox and FTP servers on
a case by case, rather than data file, basis.
The creation of listing and operator statistics files can be disabled.
Data Entry
ID items can be set to auto increment.
Addition of several data entry options for mobile devices:
Automatically advance on selection.
Show field labels above question text.
Display error message numbers.
Addition of several new PFF attributes:
CaseListingFilter: Filters the list of cases shown in the case listing.
ShowInApplicationListing: Specifies whether an application is shown in the list of applications.
Pa ge 24 of 684 Wha t's New in CSPro?
OnExit: Automatically launches a CSPro application after executing the current one.
Lock: New lock flags CaseListing (does not show the case listing screen) and Delete (prevents the user
from deleting cases).
A case note can be defined, which will display in the case listing.
Multiple operators can leave notes on the same field.
On Windows, the dialog box for the accept (and selcase, show, and showarray) function has been redesigned to
be more tablet friendly.
Cases can now be verified on a case by case basis, rather than having to verify cases in file order.
Portuguese language strings added to Android.
Batch
A new PFF attribute, InputOrder, allows you to override the default order in which cases are processed in a data
file.
Logic
Introduction of new functions and statements:
ask: A new way to skip fields by programming using enablement conditions.
compress and decompress: A pair of functions to compress and decompress .zip files.
config: A way to initialize string variables to values defined on a local machine.
diagnostics: A troubleshooting function for getting information about the CSEntry build.
forcase: A loop that simplifies the processing of each case in an external data file.
getcaselabel and setcaselabel: Functions to set and retrieve a human readable label associated with a
case.
isverified: A way to determine whether a case has been verified.
keylist: A function that returns a list of keys in a data file, including the input file in a data entry
application.
setoperatorid: A way to set the operator ID in logic.
syncconnect, syncconnect, syncdata, syncdisconnect, syncfile, and syncserver: A set of functions
to support the new synchronization functionality.
timestamp: A way to easily calculate time spans by working with UNIX times.
The skip statement can be called without an argument, which will skip to the next field in the application.
The systime function can take an optional argument to extract just the current time's hours, minutes, or seconds.
The ispartial function can return the partial save status of cases in external dictionaries.
Passing ID arguments to writecase is deprecated and the arguments are ignored.
The key function can be called on the main input dictionary of an application.
All settings can be removed by calling savesetting with the argument clear.
The select options of an errmsg can now be used to skip to a field instead of only reentering a field.
You can manipulate the location and order that external dictionary and batch input data sources are processed
using: set access, set first, and set last.
The setfile, open, and close functions can be used on the input dictionary in a data entry and batch
application, allowing you to dynamically modify the data source being processed.
The execsystem function on Android can send SMS text messages.
Tools
Pa ge 25 of 684 Wha t's New in CSPro?
A new tool, Data Viewer, allows for the viewing of data files in dictionary format (in lieu of using Text Viewer). The
tool is also used to download data from a synchronization server.
The Export Data tool can now output Stata exports in Unicode format. To open these exports, you need Stata
version 14 or later.
The Excel to CSPro tool is much improved, allowing you to create dictionaries from Excel data and to convert
data from multiple worksheets.
The Concatenate Data tool and the fileconcat function now combine case notes, partial save statuses, etc.,
and the tool can identify duplicate cases in files.
CSWeb
CSWeb introduced as a convenient way to synchronize CSPro data files on a central server.
Other
Introduction of a lightweight CSPro installer, with the CSPro examples now installed to the Documents folder.
Introduction of a new custom help documentation system.
Fixes to allow CSPro to fully work on Windows 10 systems.
Paradata
The introduction of a new system for collecting paradata during an application's run. The paradata events are
saved to a log file with the extension .cslog. You can control what kinds of events are stored by modifying an
application's paradata options.
A new logic function, paradata, that can be used to control or query the paradata log.
A new tool, Paradata Viewer, that displays reports about the collected paradata.
Another new tool, Paradata Concatenator, that combines multiple paradata logs into a single log.
Deployment
A new tool, Deploy Application, that deploys one or multiple applications to a server (or a local file) so that they
can be downloaded onto interviewer devices.
Deployed applications can be downloaded using CSEntry on mobile devices.
Reports
An interface for generating HTML-templated reports from case data as well as from results from queries into
paradata and other data sets.
New logic functions, setreportdata and report, for setting the inputs and creating the reports.
Messages
Expanding multiple language application support, messages defined in the message file can be specified in
multiple languages. These messages, along with string literals wrapped in the tr function, will be displayed in the
application's current language.
CSPro Designer
You can move a value set to the first position, to become the default one shown in a data entry application, by
right-clicking on a value set and selecting Make Primary Value Set.
If you right-click on a dictionary in the tree, an option, View Data, allows you to open in Data Viewer the data file
last associated with that dictionary (as stored in the PFF file).
Data Entry
A data source option that allows you to specify the number of minutes after which a case will automatically be
partially saved.
Question text fills can show the return values of user-defined functions.
A data entry option, Validate alpha fields, instructs CSEntry to validate values entered into alphanumeric fields
using an item's value set, as happens to numeric fields by default.
A mobile option, Display value set codes alongside labels, displays a value set's codes in addition to the
labels when displaying a field using a control other than a text box.
Logic
An errmsg-like function, warning, that can be used for "soft check" error messages. These messages will not be
displayed to the operator when advancing in the case (such as when resuming from a partial save).
A multiple language application function, tr, for correctly displaying string literals or numbered messages in the
application's current language.
A new function, countcases, counts the number of cases in an external file that meet a certain criterion.
New functions, getproperty and setproperty, for accessing and changing the properties of fields or the entire
application.
A new function, logtext, for writing user-defined messages to the paradata log file.
A new function, sqlquery, that executes a query on a SQLite database and returns a result set in a variety of
formats. Paradata logs are SQLite databases.
A new message formatter, "%v", that displays items in the format specified in the dictionary.
The timestamp function returns the number of milliseconds in addition to the seconds.
A new function, timestring, returns a custom formatted date and time string based on a given timestamp.
The sysdate and systime functions can return a formatted local date or time from a UNIX time value returned by
the timestamp function.
The uuid function can return the UUID (internal key) associated with a case.
The loadcase and delcase functions will give compiler warnings if the zero fill settings of the arguments do not
match the settings of the external dictionary's IDs.
The writecase and delcase functions now work in a forcase loop.
Multiple string literals are automatically concatenated into a single string when separated by whitespace (e.g.,
"A" "B" is evaluated as "AB").
The selcase, show, and showarray functions can display title headings.
The setfont function now sets the number pad's font differently from that of the other extended controls.
In a data entry application, when splitting items from a multiply occurring record into multiple groups, you no
longer have to specify an occurrence number when referring to items on the split groups (assuming that you want
the value of the current occurrence).
Tools
Pa ge 27 of 684 Wha t's New in CSPro?
From Data Viewer's Tools menu, you can launch the Export Data and Tabulate Frequencies tools to directly
process the cases of the open file.
Data Viewer can display each value's label instead of only the code.
Data Viewer now shows additional information about data files, including the synchronization history. The tool also
has new filtering options and can show deleted or duplicate cases and can display the contents of a file in sorted
key order.
Excel to CSPro conversions can now be run via PFF files to streamline deployment conversions.
The Excel to CSPro tool has two new options, one for conditionally running conversions based on the existence of
file changes, and the other to allow you to update existing files rather than to always create new files.
Synchronization
Faster Bluetooth transfer speeds.
Fix to allow faster transfer speeds when calling syncdata or syncfile in the GET direction across all
synchronization options (Bluetooth, CSWeb, Dropbox, and FTP).
General
A new concept, blocks, for grouping several fields into a related unit. Blocks can be used in various ways in logic
and can have question text that appears when entering any of the fields in the block. On mobile devices, all fields
on a block can be displayed on the same screen and the operator can enter values into these fields in any order.
Dictionary names can be longer than 32 characters.
CSPro Designer
Several reports about dictionary completeness can be generated using dictionary analysis.
A variety of tasks can be performed and reports generated from CAPI question text using question text macros.
Data Entry
The Windows desktop version of CSEntry has options to view cases with duplicate case IDs and to view deleted
cases. Cases can be undeleted.
A new option specifies that combo boxes should display only discrete (non-range) values.
Logic
A function, dirdelete, deletes an empty directory or group of empty directories.
A function, getbluetoothname, returns the name of the device as broadcast to other Bluetooth devices.
A function, getvaluelabel, returns the label from the value set corresponding to an item's current value.
A function, regexmatch, returns whether a string matches the regular expression.
Additional variables (array, list, and file) can be declared locally. The value of each variable is reset upon
entry to the PROC or function where it is declared.
The gps function, when called with the read argument, returns the most accurate reading, not the most recent
reading, after timing out.
Synchronization
The synchronization functionality from CSPro 6.x (.pnc files and the sync function) has been removed. Syncing
files and data must be done with the approaches introduced in CSPro 7.0.
Messages
Translations of some runtime messages added for Chinese, French, Portuguese, Russian, and Spanish. You can
help translate additional messages or add messages in another language.
Android
CSEntry is distributed in both 32-bit and 64-bit versions.
When making calls to the gps function, if the device's GPS receiver is not enabled, the operator will be prompted
to enable location services.
The properties and behaviors of CSEntry's menus can be modified via logic. The way that hidden applications are
shown is now controlled using this mechanism.
Automatically launching an application, when only one exists on the device, is no longer the default behavior.
The Start New Case option is now a floating green icon.
CSWeb
Add many users at once by uploading a .csv file that specifies user details.
Mapping
CSPro Designer
Replacement of the logic editor with a Scintilla-derived editor with better performance and functionality, including
the ability to change the font size, view line numbers, go to a specific line, display autocomplete suggestions, and
more. There are additional keyboard shortcuts to improve efficiency while writing logic, and the reference window
has been reworked to provide more useful information.
When right-clicking on the main dictionary of a batch application, an option allows you to view the output data in
Data Viewer.
Data Entry
A field's capture type is now considered when validating fields. Date fields must now contain a valid date and
check box fields can now only contain values that exist in the value set.
The Force Out-of-Range field property has been renamed to the Validation Method field property. This change
allows you to specify out-of-range fields without operator confirmations directly from the form designer (rather than
requiring logic).
The Validate alpha fields data entry option has been removed. If an alphanumeric field has a capture type other
than textbox, it will automatically be validated.
A new PFF attribute, Key, combines the functionality of StartMode along with the ability to automatically fill in IDs
for new cases.
Batch
The default listing width has been increased from 80 to 120 and the space allocated to displaying denominators
now supports 10-digit values.
Logic
Introduction of logic objects with functions that can be called on the object using dot notation.
New functions for arrays: clear and length.
New functions for files: close, open, read, and write.
The list object is now fully functional, with assignment and new functions: add, clear, insert, length, remove,
seek, and show.
Simplified creation of dynamic value sets with the new valueset object that has the functions: add, clear, length,
remove, and show.
Easier way to manipulate and run PFF files with the new pff object that has the functions: exec, getproperty,
load, save, and setproperty.
A new map object that allows the displaying of maps on Android that has the functions: addImageButton,
addMarker, addTextButton, clearButtons, clearMarkers, getLastClickLatitude, getLastClickLongitude,
getMarkerLatitude, getMarkerLongitude, hide, removeButton, removeMarker, setBaseMap,
setMarkerDescription, setMarkerImage, setMarkerLocation, setMarkerOnClick,
setMarkerOnClickInfoWindow, setMarkerOnDrag, setMarkerText, setOnClick, setTitle, show,
showCurrentLocation, zoomTo.
A new function, pathconcat, simplifies combining multiple strings into a single string representing a file path.
A new function, view, launches the system's default viewer to display a file or website.
A new message formatter, "%l", that displays an item's label from the value set corresponding to an item's
current value.
Android
Reading GPS coordinates now uses Google's Fused Location Provider, which makes it easier to capture
coordinates while indoors.
The way that hidden applications are shown has been restored to the behavior used up to CSPro 7.1.
The add/insert/delete occurrence options on the case tree has been restored to the behavior used up to CSPro
7.1 (displaying in both operator- and system-controlled modes), though these options can be disabled via logic.
Tools
The Map Viewer and Convert Shape to Map tools have been removed from the CSPro installation. Users still
needing these tools can find them on www.csprousers.org.
The Export Data tool now writes labels at the end of processing, so if an item's value set is changed during the
run, labels from that value set will be exported.
The Reformat Data and Sort Data tools allow you to output data to a different data source type than the input
data. The Concatenate Data tool no longer requires the specification of the data source type of the output file.
The Excel to CSPro PFF parameters have changed from InputData to Excel.
The Paradata Concatenator PFF parameters have changed from InputData to InputParadata and OutputData to
OutputParadata.
The ExportXMLMetadata tool's HTML export properly outputs Unicode question text.
General
A new special value, refused, allows for processing refusals differently from other numeric responses. Refusals are
defined in a dictionary value set, referenced in logic using refused, and are conditionally shown to operators. The
entry of refusals can be overridden using the OnRefused function.
A CSPro Settings dialog, accessible in the CSPro Designer as well as in CSEntry, allows users to clear saved
synchronization and data source credentials.
Licenses for open-source software that CSPro uses have been included and can be accessed using the Help ->
About menu in the CSPro Designer.
Data Sources
Change to the format of the CSPro DB data source to store data in a relational database format that can be
queried using SQL. CSPro DB files created in version 7.4 cannot be read in earlier versions of CSPro.
Introduction of the new Encrypted CSPro DB data source, allowing for file-based data encryption with the ability to
Pa ge 31 of 684 Wha t's New in CSPro?
cache passwords.
Ability to include, as part of a PFF or in a setfile function call, a connection string that details how to open a
data source.
Dictionary options include the ability to disallow use of a dictionary to modify data in Data Viewer or to export data
to another output format.
CSPro Designer
Enhancements to the logic editor, including parentheses matching and improvements to the find and replace
dialogs.
The synchronization options have been updated. Synchronization can be removed from an application by
unchecking the Enable Synchronization box. The options for downloading application files has been removed in
favor of the new Update Installed Applications.
Question text occurrence values can be harmonized with dictionary records using a new question text macro.
Data Entry
When starting a new case, fields can be prefilled from values specified in a PFF's Parameters section.
The compiler no longer issues an error when reentering a protected field.
The Combo Box extended control can now be used on discrete numeric fields and alphanumeric fields.
On Windows, the search bar on extended control popup windows is now hidden by default.
Inserting a case now only allows for the addition of a single case at a time (regardless of the PFF's AutoAdd
value).
Data Viewer
The tool has been rewritten with additional features including multiple language support and additional ways to
view case data.
A case's notes can be extracted as another CSPro data file, which can then be used for exporting or other data
operations.
Ability to download or synchronize data from the local Dropbox folder or a local FTP server folder which can be
faster than downloading data over the Internet.
Logic
A new function, ischecked, returns whether a code is checked as part of a check box field's selections.
A new function, protect, simplifies changing the protected property of a field.
A new function, filetime, returns the last modification date/time of a file or a directory.
A new function, hash, returns the hash value of a string.
A new statement, when, executes a statement based on the value of one or more other variables.
A new version of recode, sharing the syntax of when, with additional options for what the recoded value can be
assigned to.
A new function, startswith, that returns if a string begins with a specified prefix.
A new function, syncapp, for downloading new versions of an application from a deployment server.
A new function, syncmessage, sends and receives string messages from a Bluetooth server.
A new function, barcode.read, reads a barcode using an Android device's camera.
A new object, SystemApp, that simplifies interacting with external applications. It has the functions: clear, exec,
getResult, and setArgument.
Using valueset.add it is possible to add a single value or a range of values from an existing value set.
Pa ge 32 of 684 Wha t's New in CSPro?
New options for the syncconnect function to connect to a local Dropbox or FTP folder.
A range of occurrences can be supplied to insert for mass insertions or delete for mass deletions.
The tolower and toupper functions work properly with accented characters.
The view function, on Windows, now displays webpages in an embedded browser.
The field name of the last location of a partial save can now be retrieved in any procedure or function.
The loadcase and delcase functions can now load or delete cases based on a full key string argument.
The countcases and keylist functions can now be called, in certain circumstances, on the input dictionary.
The find and locate functions have a new relational operator option, uuid, that allows loading cases by a case's
UUID.
The set access function, or optional arguments to the dictionary argument of countcases, forcase, keylist, or
selcase, allows permanent or temporary modification of the dictionary access parameters, which control the
order and what kinds of cases are processed.
The set first and set last functions now use the dictionary access parameters when determining what is the
first or last case.
The sysparm function can be used to determine the presence of command line arguments on Windows.
Messages
Messages can be defined for multiple languages by including multiple language names in the Language directive.
Translations of some runtime messages added for Vietnamese. You can help translate additional messages or
add messages in another language.
Android
An operator (or respondent) can sign the Android screen using the execsystem function and have the signature
saved as an image file.
A new menu option, Update Installed Applications, to automatically check for and install new versions of an
application that has been updated on a deployment server.
Applications can be added to the device by scanning a QR code.
A new Settings screen, accessed via the menu on the Entry Applications screen, allows user to clear saved
credentials.
The option for selecting that hidden applications are shown has been moved to the Settings screen.
The default setting for whether or not the case tree shows on phones and tablets can be modified via logic.
Tools
The Concatenate Data, Index Data, and Reformat Data tools now allow the saving of PFFs with the specified
settings and the loading of the settings of previous runs.
The Reformat Data tool has been completely rewritten. If using a data source with an embedded dictionary, such
as a CSPro DB file, you do not need to specify the input dictionary. The dictionary differences are displayed with
more detailed information, including information on destructive changes that would occur during a reformat.
The Index Data tool has also been completely rewritten. It has a new interface that simplifies running the program
on multiple input files; it now works with non-text data files; and there is a more modern interface for viewing
cases while selecting which duplicate cases to keep.
The Sort Data tool allows case sorts on non-text data files that have duplicates.
When the Sort Data tool processes a sorting key item that does not exist, it treats it as the lowest possible value,
sorting it first when sorting in ascending order. In the past, such values were sorted as 0 (so they would be sorted
after negative values when sorting in ascending order).
The Deploy Application tool supports setting files to only be updated when an application is first installed and not
on subsequent updates.
Pa ge 33 of 684 Wha t's New in CSPro?
The Deploy Application tool can generate a QR code that can be used to install applications on mobile devices.
Case Processing
Items not defined in the dictionary are not written out when a case is saved. In the past, when using absolute
positioning, contents of a record that were not defined in the dictionary were still written out when the case was
saved.
Items that are not valid for the item type are not written as they initially appeared in the input data. For example, if
the value for a numeric item was "XX," previously, as long as it was not modified, it would be written as "XX." Now,
because it cannot be represented as a valid numeric value, it will be written as default.
CSPro processes items using the item type specified in the dictionary. Previously, some tools processed items
by using the text representation of the item. For example, the Compare Data tool would indicate that "05" and " 5"
were different values because of the differing zero fill setting, but now the tool treats those values as equal.
When values are saved, special values are converted only based on linkages defined in the item's primary value
set. Any special value specifications defined in non-primary or dynamic value sets are ignored.
Case construction is more forgiving of errors, allowing level 2+ records to appear before their parent level's records.
Additionally, level 2+ nodes are joined (by ID) even if some records from a different node separate the nodes. For
example, if there is a case with a first level ID of 1 and second level IDs of A and B, in the past, 1/1A/1B/1A would
be processed as three second-level nodes, but now they will be handled as two second-level nodes (1/1A+1A/1B).
The notes and status files, used when processing text data files, are now created only when necessary. (For
example, if there are no notes to save, the file is not created.) The information in those files can now be accessed
in non-entry applications using functions like getnote. Case labels are also now supported for text data files, with
the labels saved to the status file.
CSWeb
A Sync Report summarizes the total number of cases uploaded to CSWeb.
Create custom roles that specify dashboard and dictionary permissions.
Downloading data from a CSWeb server is now faster.
General
On Android devices it is possible to record and play audio in two modes: interactive mode, with the enumerator
able to control the recording; and background mode, with the recording controlled by logic functions using the new
audio object.
Listing files can now be written as comma-separated values (CSV) or CSPro data files. Traditional listing files in
text format are still supported.
Android
The csentry directory on Android has moved. For new installations the csentry directory will be located at:
<external storage>/Android/data/gov.census.cspro.csentry/files/csentry
If you are upgrading from an earlier version of CSEntry, the csentry directory will remain at:
<external storage>/csentry
This change is required to support the more stringent security requirements introduced in the latest versions of
Android.
Pa ge 34 of 684 Wha t's New in CSPro?
Logic
A new object, audio, that allows for recording or playing audio. It has the functions: clear, concat, length, load,
play, record, recordInteractive, save, and stop.
A new object, hashmap, that facilitates storage of numbers or strings in an associative array. It has the functions:
clear, contains, getKeys, length, and remove.
A new function, syncparadata, similar in behavior to syncdata, allows for the syncing of paradata between
devices over Bluetooth, or between a device and a CSWeb, Dropbox, or FTP server.
A new function, replace, returns a string with one of more instances of a substring replaced with new text.
A new function, encode, escapes special characters to facilitate writing to HTML or CSV files, or encodes
characters when writing out URIs or URI components.
A new namespace, path, has functions for interacting with file paths: path.concat, path.getDirectoryName,
path.getExtension, path.getFileName, and path.getFileNameWithoutExtension.
User-defined functions now support optional parameters as well as passing numeric and string values by
reference.
Conditional compilation of logic is now possible due to the addition of a logic preprocessor.
New functions for list objects: list.removeDuplicates removes duplicate values; list.removeIn removes
values specified in an in list; and list.sort sorts a list.
The list.show and valueset.show functions work with string lists and string value sets.
The valueset.randomize function is a new way to randomize value sets (which previously could be done using
randomizevs). These value set randomizations now obey any seed value provided using the seed function.
The valueset.sort function sorts the order of entries in the value set by either label or code.
The randomin function now accepts non-integer values in the input in list.
New numeric and string variables can be declared inline in the recode statement's destination variables section.
A new callback function, OnSystemMessage, provides a way to override the displaying of system error messages.
It is possible to check a value against all special values using the code special in an in list (which are also used
by recode and when statements).
Lists can also be used as inputs of an in list.
The loadsetting and savesetting functions accepts numeric values as part of the attribute-value pair.
The hash function accepts numeric values as inputs.
Tools
The Deploy Application tool has options to refresh all files or all data entry applications within a folder.
CSWeb
CSWeb can convert case data to MySQL/MariaDB relational tables using a command line process, allowing for
more dynamic reporting.
You use CSPro to develop the batch edit application. You use CSBatch to run the application. For small surveys and for
testing applications, you can run CSBatch directly from CSPro, on the same computer. For large surveys and censuses,
which require a production environment, you can transfer the application files to other computers and run CSBatch on
them.
Tabulation Applications
A Tabulation application contains a set of table specifications and a data dictionary describing a data file to be
tabulated. When you create your application, you can use an existing data dictionary or you may create one as you
create the application.
In a Tabulation application, you can:
• Cross-tabulate a virtually unlimited number of variables.
• Tabulate variables created "on the fly" under program control.
• Select the universe of tabulation.
• Tabulate values and weights.
• Tabulate simple counts and percents .
• Tabulate mean, median, mode, standard deviation, variance, n-tiles, proportions, min, max .
• Perform table cell manipulation after tabulation .
• Define detailed table formatting .
• Save tabulations in several file formats.
• Copy tables to spreadsheets or word-processing documents.
• Produce tables by geographic area.
Data Dictionary
A Data Dictionary describes the overall organization of a data file, in other words, it gives a description of how data are
stored in a file. CSPro requires that a data dictionary be created for each different file being used. A Data Dictionary file
has the extension .DCF.
In the Data Dictionary you can:
• Define simple or complex hierarchical file organization.
• Define hierarchical levels, identification items, records, items (fields or variables), value sets (categories of values), and
values.
• Create descriptive notes for documentation.
• Define multiple-occurring items.
• Produce reports of file organization.
See also: Creating a Dictionary for a New File, Creating a Dictionary for an Existing File
Forms Design
Pa ge 37 of 684 CSPro Applica ons
You can create an unlimited number of forms (screens) for data entry. These can be designed independently or as part of
the data entry application.
There is usually one forms file (.fmf) per application, but there may be multiple forms files. Each forms file contains one
data dictionary file (.dcf) that represents the primary data file that is being created or modified.
Tool List
To run a tool, open the Tools menu and select one of the tools listed below. There is a user's guide for most of the tools.
Data Viewer
View the contents of a CSPro data file, displayed in tables based on the dictionary contents. The tool allows you to view,
but not modify, all of the cases in the file. You can filter the cases based on a search key. In addition to showing the
data described by the dictionary, the tool also display the notes and other metadata saved for each case.
Text Viewer
View the contents of any text file up to a maximum of 32,000 characters wide and up to two gigabytes in size. You can
copy, save, or print all or part of the contents of the text file. You can also find text in the file, identify line and character
position in the file, and copy tabular reports to spreadsheet programs. The file cannot be modified within the Text Viewer
utility.
Table Viewer
Examine, but not change, the contents of any CSPro tables file. A table file is produced by running tabulation
applications or using the Tabulate Frequencies tool. You can copy, save, or print all or parts of the tables in RTF (for
word processing programs), or HTML (for Internet), or TAB delimited (for spreadsheet) formats.
Table Retrieval
Retrieve and display tables, maps, and other previously prepared documents from a large database of documents based
on geography, subject matter, and title. It is very useful as a data dissemination tool.
Deploy Application
Deploy one or multiple applications to a server so that they can be downloaded onto interviewer devices.
Pack Application
Pack all the files in a CSPro application into a ZIP file so the application can be moved to another computer or sent as
an email attachment.
Tabulate Frequencies
Produce frequency distributions of all or some of the variables in a data file. You simply select the variables (value sets)
you want to tabulate and provide the name of the data file. More than one data file can be tabulated.
Export Data
Export selected data records or parts of data records to tab- or comma- delimited files. These files can be imported into
spreadsheets or databases. The tool also allows you to export data to SAS, SPSS, Stata, or R formats.
Reformat Data
Reformat data from one file format to another using an input and output data dictionary. Fields with corresponding names
are copied from the input to output file. This is useful for reorganizing data records or lengthening data items.
Compare Data
Compare the contents of two data files and identify the differences. The data files must have the same structure, that is,
the same CSPro dictionary must describe both data files.
Concatenate Data
Concatenate (join end-to-end) two or more CSPro or text files. You do not need a dictionary for this tool when working
with text-based files.
Paradata Viewer
Display reports about the paradata collected during an application's run.
Paradata Concatenator
Combine multiple paradata logs into a single log.
Index Data
Generate indices for data files or identify duplicate cases in a data file.
Excel to CSPro
Convert data from Excel workbooks to CSPro data files. The tool can also create a CSPro dictionary for the data in an
Excel worksheet, performing an analysis to determine the best CSPro format for the data.
PFF Editor
Edit and view all options available when running a CSPro application file via a PFF file.
Production Runner
Set up a series of CSPro processing tasks and then run them all at once.
Deprecated Tools
These tools are no longer distributed with CSPro but are instead available on www.csprousers.org.
Convert Dictionary
Convert IMPS and ISSA data dictionaries to CSPro dictionaries, or convert CSPro dictionaries to IMPS or ISSA
dictionaries. You can also convert ISSA dictionaries to CSPro dictionary and data entry forms.
Form Viewer
View snapshots of the data collected during a data entry application.
Map Viewer
View, create, and manipulate thematic maps of data.
Create a new application: This allows you to create a new application when CSPro is launched. After you specify the
names of the applications files, the new application is opened.
Open an existing application: This allows you to open an existing application either by selecting a recently used
application from the list provided or to select, using other files, any CSPro application available on the computer or
connected servers.
If you cancel the dialog box, CSPro will remain open so that you can use CSPro tools or at a later time open an
application or create a new application.
Trees
After you create an application, the tree will display the application(s) that are currently open, all the files that belong to
that application and their relationships with one another. The files tree is always present. When you close the application
or file, it is removed from the files tree.
There are five kinds of trees in CSPro:
• Files tree shows all the applications that are open, and the files they contain.
To change the tree on the left side, click the tab of the tree you want to see.
The tabs at the bottom of the tree indicate which file tree is displayed. To change the tree, click the tab of the tree you
want to see. Use Ctrl+T to see the full file names (labels) of the files you have open. Double-click on the files tree to
switch the frame on the right side of the screen.
See also: Data Dictionary Tree, Data Entry Tree, Batch Edit Tree
Windows
The window on the right side of the screen allows you modify the contents of a dictionary or application. Each different
window has different functions associated with it. That is, you will see a different menu and toolbar with each different
window.
Part of the toolbar to the left of the help button shown below allows you to switch between different types of windows:
Dictionary, Forms Design, Batch Editing, and Tabulation.
To change the contents on the right side of the screen press the button of the type of window you want to view. If there is
more than one window of that type, the most recent one viewed will be displayed.
If you need to select a particular window, from the Window menu, select the file name you want to view.
CSPro DB is now the default type for newly created data files, and future CSPro features may only be added for the
CSPro DB type. When a file exists, CSPro attempts to determine what kind of file it is and then opens it with the
appropriate data source handler.
Data sources are described using connection strings, which are generally simply file names. There are four types of data
sources:
Text
The text data source is what was always used in CSPro prior to version 7.0. Cases are stored as text in UTF-8 format.
Supporting text data sources allows CSPro to maintain backwards compatibility with existing data files. Text data
sources can use any file extension with the exception of .csdb. If other programs directly use the CSPro data file (as
opposed to using exported data), you may still want to use a text data source as other programs will not be able to read
a CSPro DB file.
CSPro DB
The CSPro DB data source is the default type used for newly created data entry files. Files created using this type have
the extension .csdb and are not text files so they cannot be viewed in Text Viewer. All information about a case is stored
in this file, including the case contents, case labels, notes, partial save statuses, verification information, revision history,
and a copy of the dictionary. The CSPro DB data source simplifies data collection, particularly if conducting a CAPI
survey, as all information about cases is stored in a single place and the revision history allows CSPro, when syncing
data, to only sync data that has changed.
If you use the CSPro DB type, then the data you collect cannot be directly used with previous versions of CSPro or tools
that process text files. You can use the Data Viewer tool to convert data from the CSPro DB type to text and vice versa
in case you need to use a previous version of CSPro or a preexisting tool that only works with text files.
Starting with CSPro 7.4, the CSPro DB file format stores the data captured in a relational database file using SQLite.
See CSPro DB File Format for details.
Encrypted CSPro DB
The Encrypted CSPro DB data source is a CSPro DB file that is encrypted with a password and can only be opened with
that password. Files created using this type have the extension .csdbe. Without the password there is no way to recover
data from the file so it is important to implement a sufficient password management policy.
None
The none data source is not associated with a file and allows you to run programs without needing to associate a
dictionary with a file, usually with the assumption that the file association will be determined during the program and
assigned using the setfile function. Adding cases to a none data source will result in nothing happening and any
attempt to load or write cases to the file will appear successful but nothing will happen. This data source is useful for
menu programs or instances when you do not initially know what an external dictionary should be set to.
See also: Connection String, Data Source Options, CSPro DB File Format
Pa ge 43 of 684 CSPro Genera l Func ona lity
Connection String
A connection string is a text string that details how to open a data source. Generally a connection string is just a file
name. Connection strings are used in PFF files as well as in logic when data sources are specified, such as when using
setfile. A connection string is made up of several components:
The file_name is the file name of the data source. It can be specified as a relative or absolute path. If only the file name
is specified, then the connection string is complete. However, if any properties are specified, then the properties must be
separated from the file name by a pipe | character.
Multiple properties can be specified, with each grouping separated by the ampersand & character. The property name is
listed, followed by an equals = sign, and then the property value is given. Most property values can be defined in human-
readable text, but if the value contains special characters such as & or = characters, it must be percent-encoded.
The property "type" is used by CSPro to specify the type of a data file. Generally this is not necessary to define because
the data file type can be deduced from the file extension, but in one case it is necessary. This, for example, is how the
none data source is specified in a PFF:
InputData=|type=None
Other properties are documented in the topics describing the data sources. An example of one property is the
"password" property for encrypted data files. It can be specified in logic:
Properties List
Properties include:
Property Description
password The password that opens an encrypted data file.
cache Indicates to CSPro that cases should be cached in memory. This may be useful to advanced users who
want to optimize programs that do a lot of case lookups.
An Encrypted CSPro DB file is like a CSPro DB file and can be used in any CSPro application. The only difference is
that, upon opening the file, CSPro requires the specification of a password. There are two ways to specify a password:
Password entry: A dialog box will appear allowing the user to enter the password. The password must be at least four
characters. If the data file does not exist and is being created for the first time, the user must enter the password twice
to ensure that the password is entered correctly. You can reduce the number of times that a user must enter the
password by allowing the password to be cached for a specified duration on the machine. When opening an existing file,
if the password is not correct, the user will be prompted to enter the password again.
Connection string: The password can be specified in a connection string used in a PFF file or by setfile. If the
password is specified in the connection string, then the user will not be prompted to enter a password. If the data file
does not exist and is being created for the first time, the specified password will be used to encrypt the file. When
opening an existing file, if the password is not correct, the opening of the file will fail. For example, the following
connection string opens the file pilot-data.csdbe with the password jiw~d_fpF9.
To encrypt the entire drive where the CSPro application and data files reside, using encryption such as Windows
BitLocker or Android's full-disk encryption.
Not to store passwords in plaintext anywhere (such as in a CSPro logic file or in a PFF file, as is done in the
above image).
To synchronize your data using a secure protocol (such as CSWeb over https). If you encrypt your data file on a
tablet but then transfer the data over http using syncdata, that defeats much of the purpose of encrypting.
Technical Details
Encrypted CSPro DB files are SQLite files encrypted using the SQLite Encryption Extension (SEE) using "AES-256 in
OFB mode." The specified password is not used as the key input to SEE but is instead hashed to create a 256-byte key
that is used to encrypt the file. If allowed, this hash, not the password, is cached on the machine. A fixed salt is used
during the hashing process because there is no suitable place to store a dynamic salt. This means that the same
password will always result in the same encryption key.
What is Unicode?
Unicode is a widely adopted system for representing characters for all languages currently in use. Early computer
programs generally used only one byte to represent a character, which led to a limit in the number of characters that
could be displayed on screen and used in computations. This limit of 256 characters was used effectively by people who
only required English characters, or characters from most European languages, but it could not represent the languages
used by more than a billion people, including most notably, many Asian languages.
The Unicode standard now defines more than 100,000 characters, but many of these characters represent extinct
languages. For that reason, Windows only provides native support for a subset of Unicode characters. In a Unicode
program, a character is represented by two bytes, which allows for up to 65,536 characters.
Whereas in English it is very clear what makes up one character—one keystroke—in other languages it is not as
straightforward. For example, in Chinese, typing two characters, "天津," requires seven keystrokes ("Tianjin"). In Bangla,
the character requires two keystrokes ( + ), which combine to create one character. Both the Chinese and
Bangla examples require four bytes in memory and return a string length of 2.
What is UTF-8?
With each character stored as two bytes in memory, there are several ways to write characters to a disk. One way is to
write two bytes for every character, but this is costly for most users, the bulk of whom only use characters that can be
expressed properly with either the ASCII or ANSI encoding systems. The computer world has settled on using UTF-8, a
variable-length encoding scheme that uses between one and four bytes to represent a character. ASCII characters are
represented using one byte, other ANSI characters are represented using two bytes, and most Asian characters are
represented using three bytes. To identify a file as encoded in UTF-8, a three-byte BOM (byte order mark) is placed at
the beginning of a text file. For example, here is the UTF-8 representation of: "I am François from 法国."
With a UTF-8 encoding, a file's size is not equal to the number of characters in the file. An empty file is three bytes
because of the BOM. (The fileempty function can be used to determine whether a file is actually empty.) An ASCII file
converted to UTF-8 will generally be the size of the original file plus three bytes. An ANSI file that uses non-English
characters will be even larger. The name François takes eight bytes to represent in ANSI but uses nine bytes in UTF-8.
Some text editors, including Notepad, allow a user to change the encoding of a text file. These editors can be used to
convert text files from UTF-8 to ANSI. If using Notepad, open the file and then select Save As. At the bottom of the dialog
box is an option to change the encoding.
If you are distributing files to a broad audience and if your data file does not contain any non-ANSI characters, you might
consider converting it to ANSI for maximum compatibility. To illustrate an example of what might happen with a non-
Unicode program reading a UTF-8 file, imagine a CSPro data file that has two record types, P (population) and H
(housing). The ANSI file might look like this:
P<case id><data>
P<case id><data>
P<case id><data>
H<case id><data>
<BOM>P<case id><data>
P<case id><data>
P<case id><data>
H<case id><data>
An older version of CSPro would regard the first row of data as an invalid record because the BOM was in the place of the
record type. It would thus register the household as having two members instead of three. If any non-ASCII characters
existed in the data file, it would further shift the data and then items would not be read correctly.
If you release a UTF-8-encoded data file widely, you may want to include a documentation file with text similar to the
following:
Synchronization Overview
Synchronization Strategies
When using mobile devices for a survey or census it is important to be able to transfer data collected in the field back to
the head office. This allows for faster processing and analysis as well as better monitoring of the progress of the field
operation.
In CSPro, transferring data between devices in the field and the head office is referred to as synchronization. CSPro
supports data synchronization over the Internet between interviewers' devices in the field and a central server. Interviewers
use CSEntry to collect data on tablets, phones, or laptops and then use the Internet to synchronize the data on their
devices with a server at the head office or in the cloud. In CSPro, the data collection itself may be done entirely offline,
with no Internet connection. When an interviewer is able to connect to the Internet they may synchronize with the server
and transfer any data collected since the last synchronization. Synchronization may be done using Wi-Fi or a mobile
data connection (2G/3G/4G).
Direct synchronization between interviewers and central server over the Internet
In situations where an interviewer rarely or never has access to the Internet, CSPro supports peer-to-peer
synchronization between devices using Bluetooth. Bluetooth synchronization does not require an Internet connection. It
is a direct connection between two devices that are in close proximity. Using Bluetooth, an interviewer may synchronize
their device with a supervisor's device, transferring their data to the supervisor. Later, the supervisor travels to a location
where they are able to connect their device to the Internet in order to synchronize with the central server. In this scenario,
a supervisor might visit multiple interviewers in order to synchronize with them and later upload their data to the server at
the head office.
CSPro supports both one-way and two-way data synchronization. This means that is possible to send data from the
interviewer's device to the server as well as to download data from the server to the interviewer's device. You can
configure your synchronization to only send data to the server, to only receive data from the server, or to both send and
receive data.
It is possible to synchronize both CSPro data files and other non-data files such as application files, images, and text
files. It is possible to update data entry applications in the field by downloading the latest .pen and .pff files from the
server. This way, modifications to the application at the head office can easily be distributed to interviewers in the field. It
is, however, simpler to deploy and update application files on mobile devices using the Deploy Application tool. This tool
packages application files for deployment, uploads them to a server and allows interviewers in the field to install and
update them on a tablet by choosing Add Application from the menu on their mobile device.
Synchronization Server
For synchronization over the Internet, a central server is required. CSPro supports three types of servers: CSWeb,
Dropbox, and FTP.
CSWeb: CSWeb is a web server running the CSPro synchronization server software. It is best for very large
surveys and censuses. The server software is written in PHP and can be run on any web server that has PHP and
the MySQL database software installed. The server can be set up on a computer at the head office that is
connected to the Internet or it may be set up on a hosted website or virtual server in the cloud. Setting and
maintaining a server for CSWeb requires knowledge and experience with web server maintainence and cyber
security. Users that do not have these skills should consider using Dropbox or FTP instead. For more information
about CSWeb servers, see the CSWeb help documentation.
Dropbox: Dropbox is a free cloud based synchronization service. It is ideal for those doing small and medium
survey operations. Dropbox requires no server setup or maintenance and avoids the cost and difficulty of setting
up a CSWeb server.
FTP: CSPro synchronization can be used with a FTP (file transfer protocol) server. For those doing small and
medium survey operations who do not want to use Dropbox and are not able to configure a server to use CSWeb,
FTP can be used for synchronization.
Pa ge 49 of 684 CSPro Genera l Func ona lity
When synchronizing CSPro data files, synchronization is done at the case (questionnaire) level. CSPro keeps track of
which cases have been added or updated and only sends cases that are new or have been modified since the last
synchronization. This significantly reduces the amount of data transferred and therefore reduces bandwidth and the cost
of air time. It also reduces the chance that two interviewers overwrite each other's work by both syncing to the same data
file on the server. As long as two interviewers do not modify the same case at the same time, they may both
synchronize to the same server without overwriting each other's data.
Once interviewers in the field have used data synchronization to upload data to the server, you can use the Data Viewer
tool to download the data. When using CSWeb for data file synchronization, data is stored in a MySQL database on the
server. When using either Dropbox or FTP, data will be stored on the server in a set of files in the directory
CSPro/DataSync. In both cases, the data on the server cannot be opened directly by CSPro. In order to retrieve the data
that has been uploaded to the server, use the Data Viewer's download function. This will download all the data on the
server into a single CSPro data file that can be used by other CSPro applications and tools.
Simple Synchronizations
Basic synchronizations can be set up by specifying a few options documented here.
Advanced Synchronizations
For more advanced synchronization scenarios, CSPro has logic functions that can be used to implement data and file
synchronizations. Using logic, it is possible to synchronize data files, including those associated with external
dictionaries, and non-data files over either the Internet or Bluetooth.
syncconnect is used to connect to the server and initiate the synchronization session.
syncdisconnect is used after synchronizing to end the session.
syncdata is used to synchronize cases in CSPro data files with the server.
syncfile is used to upload/download non-data files to/from the server.
syncparadata is used to synchronize paradata to/from the server.
syncapp is used to update an application on the device from a new version on the server.
syncmessage is used to send to and receive string messages from a Bluetooth server.
While it is possible to use syncfile to synchronize CSPro data files, it is more efficient to use syncdata since it will
only send/receive cases that have been modified whereas syncfile always sends the entire file.
You will generally call syncconnect once to start the session followed by multiple calls to syncdata, syncapp, and
syncfile, followed by a call to syncdisconnect. In the example below, we connect to the server, download the latest
application files and upload cases from the data file associated with the data dictionary SURVEY_DICT.
The client and server devices must be in close physical proximity in order to connect to each other. The server device
must first execute the syncserver function which causes it to wait for connections from nearby devices. Then the client
device executes syncconnect which initiates the connection between the two devices. The client then executes one or
more calls to syncdata and/or other sync functions, followed by a call to syncdisconnect, which ends the session. This
requires that the two operators using the devices coordinate to start the synchronization routines on the two devices at
roughly the same time.
The following example shows the logic that would be written for the client and for the server:
Client Logic:
Server Logic:
Troubleshooting
Many errors that you encounter during synchronization are caused by lack of network connectivity. An easy way to
check if an error is a problem with your CSPro program rather than a general network error is to try to connect to the
sync server in the web browser on the device you are trying to synchronize. For example, if you are trying to synchronize
data on a tablet using Dropbox, try to connect to https://www.dropbox.com in the web browser on your tablet. If you are
unable to connect in the web browser then there is a problem with your connection to the Internet rather than a problem
with your CSPro application.
To get additional details on synchronization errors you can look at the sync.log file. This files lists all synchronization
oprtations along with any errors encountered. This includes errors in uploading/downloading applications in Application
Deployment and data download in Data Viewer. On Android devices you can find the sync.log file in the CSEntry
directory on your tablet or phone. On Windows you can find the file in the directory %AppData%\CSPro. On Windows
you can also locate the sync.log file from the Help menu in CSPro Designer under Troubleshooting. When asking for
technical support with synchronization issues, please attach the sync.log file from your device to your email or forum
post.
See also: Simple Synchronizations, SyncConnect Function, SyncData Function, SyncApp Function, SyncFile
Function, SyncParadata Function, SyncMessage Function, SyncDisconnect Function, SyncServer Function
Paradata
Introduction to Paradata
CSPro can automatically collect paradata for your application. Paradata is defined as "data about the process by which
the survey data [was] collected." In CSPro this includes information about fields and values entered, error messages
encountered, and the state of the data collection device.
The paradata log, with the extension .cslog, is not a typical data file but is instead a SQLite file. Advanced users with an
understanding of SQL can query the file from tools outside of CSPro. Alternatively, there is a tool, the Paradata Viewer,
which allows you to look at the contents of paradata logs. If you have several paradata logs coming from multiple
devices, you can combine these using another tool, the Paradata Concatenator. You can also query the paradata log
from within CSPro using the paradata or sqlquery functions. Reports can be generated from paradata by creating a
templated report. Paradata can be synchronized between devices or servers by using the syncparadata function.
CSPro collects information about many different kinds of events, which are described in more detail in the Paradata
Options page. You may wish to collect information about only a subset of possible events.
1. Open you application and then select Options -> Paradata. This will bring up the Paradata Options dialog.
2. Using that dialog, indicate that you want to collect paradata events and specify any additional options. By default,
when creating a new system-controlled data entry application, paradata will be collected, though with a limited set
of options.
3. The action in step #2 only indicates that CSPro may collect paradata. To actually turn on the collection, you must
specify a filename for the paradata log when running your application. CSPro may suggest a default filename,
which you can modify.
Pa ge 52 of 684 CSPro Genera l Func ona lity
Paradata can be collected for data entry or batch edit applications. The paradata log is flexible and can be used
simultaneously by multiple applications or users.
Uses of Paradata
If you conduct a pilot census or survey, you may find it useful to collect paradata on all devices. By analyzing this data,
you may be able to improve and streamline operations for the actual census or survey. You can use paradata to answer
certain questions about your application, such as:
Which questions in my survey take the longest to collect? Should I to change the wording of the question to make
it simpler?
What out-of-range values are interviewers entering? Should these be valid values?
What error messages are being triggered the most frequently?
Where were interviewers located at a given time?
How long did interviewers work?
Which tablets had the best battery life?
This is just a sampling of questions that can be answered based on the events that are stored in the paradata log. In
addition to storing previously unavailable data, the paradata log also contains all of the information that was traditionally
saved to the listing file, the operator statistics log (.log), and the imputation frequencies listing file (.freq.lst).
See also: Paradata Options, Paradata Function, SyncParadata Function, LogText Function
Question text: In a data entry application, the question text for each field can be defined in multiple languages.
Dictionary labels: Dictionary labels can be specified in multiple languages. When a field's screen text is linked to
the dictionary item, then the form text will also change when a data entry application's language changes.
Messages: Messages displayed or created using functions such as errmsg and maketext can be defined in
multiple languages.
Logic string literals: Using the tr function, string literals can be be specified in multiple languages.
In addition to the above functionality, there are several logic functions for working with multiple language applications,
including getlanguage, setlanguage, and OnChangeLanguage.
The default starting language can also be specified in the application's PFF file using the Language parameter.
Mapping
CSEntry supports displaying interactive maps during data entry on Android. There are two ways to integrate maps into a
data entry application:
Display the case listing on a map (enabled using the mapping options), where each case in the data file is
represented by a marker on the map.
Use the map object in your application logic to display a map with markers and buttons that can be customized
using map functions.
Maps can use Google Maps for base maps if the device has an Internet connection or you can supply an offline map file
to use if there is no connection.
Select the type of application you want to create and press OK. You can create one of the following applications:
You will then be prompted to enter the file name of the application. Enter the file name of the application and press
Create.
The File Associations dialog will be displayed. Specify the names of other files that make up the application, such as the
data dictionary file and any additional external lookup dictionaries.
You may open a data dictionary and make changes to it, even if it already belongs to an application. Be aware that if you
later open an application to which it belongs, CSPro will automatically make necessary adjustments in other files. For
example, if you delete or rename a dictionary item, then later open an application that contains the data dictionary, any
corresponding fields on forms will be deleted.
You may open a forms file and make changes to it, even if it already belongs to an application. However, you will not
Pa ge 55 of 684 How To ...
have access to the associated Logic file and you will not be able to run it.
Any changes you make to applications and files are not made permanent until you save the file or application that you
modified.
If you would like to view both the name and the label for each item in the tree, select the Append Labels to Names in
Tree option.
Screen
To toggle between trees on left (i.e., split screen) and full screen, open the View menu and select Full Screen, or press
Ctrl+J. A check mark appears next to the Full Screen menu item when the display is in full screen mode. This setting
affects all applications.
To view these old applications correctly, you can manually change the font used to render the text in the dictionary grid,
the dictionary and form tree, and, if the font is monospaced, the application's logic.
In this example, an Arial Armenian font has been selected, at an increased zoom rate. After this selection, the dictionary
editor displays the contents better:
Some users may find that the CSPro Designer renders the Unicode characters for their language at a very small size. It
is possible to change the zoom factor while leaving the font name blank, thus making the characters easier to read.
Change Windows
• Cascade
Use this command to arrange multiple opened windows in an overlapping fashion.
• Tile Top-to-Bottom
Use this command to arrange multiple opened windows one above the other in a non overlapping fashion.
• Tile Side-by-Side
Use this command to arrange multiple opened windows one beside the other in a non-overlapping fashion.
• 1,2,…
View displays a list of currently open files at the bottom of the Window menu. A check mark appears in front of the
name of the file in the active window. Activate a window by choosing the name of its file from this list.
To insert a file, from the File menu, select Add File. Select an existing file or enter the name of the file to be created.
To print part of a document select the text you want to print then click on the toolbar; or from the File menu, select
"Print"; or press Ctrl+P.
To preview the printing, click on the tool bar; or from the File menu, select "Print Preview."
Save an Application
Click on the toolbar; or, from the File menu, select Save; or press Ctrl+S.
The file associated with the current frame (right side of the screen) will be saved. If that file belongs to an application that
is open, the entire application will be saved. If the file belongs to more than one application, CSPro will ask you which
one you want to save. In that case, select the file or files you wish to close or save and click on OK.
To choose all of the files, click on the Select All button. To choose several files, hold down the Ctrl key and click on
files you wish to select.
Close an Application
From the File menu, select Close.
The file associated with the current frame (right side of the screen) will be closed. If that file belongs to an application that
is open, the entire application will be closed. If the file belongs to more than one application, CSPro will ask you which
one you want to close. In that case, select the file or files you wish to close or save and click on "OK."
To choose all of the files, click on the Select All button. To choose several files, hold down the Ctrl key and click on
files you wish to select.
Make sure the application is showing in the current frame (right side of the screen).
Pa ge 58 of 684 How To ...
From the File menu, select Save As.
Enter the file name for the new application in the file open dialog box.
Using the File Associations Dialog specify the names of other files in the application.
When the Save As is complete you will be editing the new dictionary.
Only the name of the input dictionary must be supplied. A default input dictionary file name is given, but it may be
changed. Other external dictionary file names are specified as needed.
If any of the files names specified already exist, those files will be attached to the application. If any of the file names
specified are new, those files will be created.
Save As Application
When an application is saved under a new name, a files dialog box like the one below is displayed.
Names for all the files requested must be supplied. Default file names are given, but they may be changed.
Pack an Application
There are instances when it is helpful to collect all the files in an application to:
Move them to another computer (for example to move data entry applications to all your data entry computers).
Give the application to a colleague to use.
Send the application when requesting help.
There is a CSPro tool to perform this function. This tool copies all the files in the application into a ZIP file. To use this
tool go to the Tools menu and select Pack Application.
CSPro Settings
The CSPro Settings dialog is accessed from the CSPro Designer or from CSEntry by selecting CSPro Settings from the
File menu. The dialog has the following options:
Saved Credentials: By selecting Clear Credentials, any cached credentials saved to secure storage will be removed.
These credentials include access tokens to CSWeb servers and Dropbox, passwords to FTP servers, and hashed
password keys used to open encrypted data files.
Get Help
To contact the CSPro development team with comments, questions, or to report problems, please contact:
International Programs
Population Division
U.S. Census Bureau
4600 Silver Hill Road
Washington, DC 20233
Phone: +1 301-763-1451
Support email: [email protected]
Official website: www.census.gov/data/software/cspro.html
CSPro Users forum: www.csprousers.org/forum
When you contact us, please mention that you are using CSPro 7.5.0.
Help Documentation
The help documentation that you are currently reading is created using a custom tool, Help Generator. The tool, along
with the documentation, is hosted on GitHub:
https://github.com/CSProDevelopment/helps
The repository has all the tools needed to develop the help documentation. The Help Generator tool is built using Visual
Studio. Microsoft distributes a free version, the community edition, that you can use to create the tool. If you do not have
Visual Studio, but want to collaborate on help documentation, email [email protected] and you will be sent the
Help Generator executable and a guide on modifying the help documentation.
CSPro Examples
The CSPro Examples folder is also hosted on GitHub:
https://github.com/CSProDevelopment/examples
If you want to modify any existing example, or want to add a new example that you think will be useful to the broader
CSPro community, you can collaborate on this project.
Each dictionary will allow you to give text labels for all levels, records, items, and value sets in the file. It also allows you
to describe the organization of each record in the file and the characteristics of each item in the record: names, position
in the data record, type of data, length, number of decimal places, valid values, and other documentation.
Before you convert the information from a questionnaire to computer-readable form, you usually create a data dictionary.
You can also create a data dictionary for an existing data file if you have a description of its contents showing the
location of each item.
CSPro requires that a Data Dictionary be created for each different file being used.
Organization
Dictionary Concepts
Levels
Records
Items
Value Sets
Values
Relations
Data Dictionary Application
How to ...
Questionnaire: Section
Any type of questionnaire will have an identification section that uniquely identifies the form, as well as one or more
sections on different topics. Some sections may occur once per questionnaire while other sections are repeated many
times. For example, in a typical housing and population census, a questionnaire would contain a section for the housing
questions, and a section for the population questions. The questions in the housing section will be answered once per
questionnaire [household], while the questions on the population section will be answered by every person in the
household. If the census is collecting information on vacant housing units then the questions on the population section
will not be answered. In a school survey, for example, the questionnaire would have an identification section and only one
section to collect basic information for each student. The questionnaires for the different students are not related.
Questionnaire: Identification
The identification section identifies the questionnaire, usually with numeric geographic codes. The combination of
identification codes (such as province, district, village, household) on a questionnaire uniquely identifies the form. These
are the codes you would need to locate a specific questionnaire.
Questionnaire: Questions
The basic element of the questionnaire is the question. Each section of the questionnaire contains a set of one or more
questions being asked for this census or survey.
Questionnaire: Responses
The valid options in response to a question are usually listed in the questionnaire. Some responses are quantitative,
such as "size of farm" or "age of person," and some are qualitative, such as "relationship to head of household" or "crop
grown." Responses can be numeric or alphanumeric. Most descriptive responses are equated to numeric codes that are
placed on the questionnaire. However, some descriptive responses remain as alphabetic text.
Red text refers to the record type. In our example, 1 is a household record, and 2 is a
population record.
Blue text refers to the (Id Items). Note that the numbers are unique for each
questionnaire: the 101001 household contains three people whereas the 101002
household contains two people.
Black text describes the individual data items for each specific record.
A questionnaire designed for an agricultural census might consist of the following records:
- one farm household record
- multiple (one or more) crop records
- multiple (one or more) farm worker records
A questionnaire for a reproductive health survey might consist of the following records:
- one record for data on the woman
- multiple (zero or more) children-ever-born records
- one contraceptive use record
- one immunization record
Dictionary Hierarchy
A data dictionary is structured in a hierarchical order. The top hierarchy is the case, followed by the level, then record.
Case
A case is the primary unit of data in the data file. A case usually corresponds to a questionnaire. However, some
complex applications might have a hierarchical set of questionnaires, or many levels. For example, the main
questionnaire may consist of a household roster and other household information, and there may be a separate
questionnaire for each woman in the household. The data entry application may then contain two levels — one for the
household and one for each woman in the household. The set of forms corresponding to the household make up level
one. The set of forms corresponding to each woman make up level two. Each case would consist of two type of
questionnaires: a single level one and a variable number of occurrences for level two.
Level
A level is a type of questionnaire. By default, all new dictionaries have one level. This is normally sufficient to describe,
for example, a population or agriculture census. However, if you have a hierarchically-structured set of questionnaires,
you will probably need to use additional levels. A level can have many records corresponding to different record types.
Record
The dictionary tree displays either the labels or names of dictionary elements. You can press Ctrl+T or from the View
menu, select Names in Trees at any time to toggle between labels and names.
Names
Names identify the dictionary and its elements when they are referenced in CSPro procedures. Names are required for
the dictionary and most of its elements. Names consist of upper case letters (A-Z), digits (0-9), and embedded
underscores (_). The first character of a name must be a letter; the last character cannot be an underscore.
Names can be any length but must not be CSPro reserved words. Names cannot be duplicated within a dictionary.
However, the same name can be used in different dictionaries, and in some cases, it maybe desirable to do so.
The dictionary tree displays either the labels or names of dictionary elements. You can press Ctrl+T or from the View
menu, select Names in Trees at any time to toggle between labels and names.
Notes
Notes document the dictionary and its elements. The designer may create notes for the dictionary as a whole and/or any
of its elements: levels, records, items, value sets, values. Notes may contain any printable character and spaces, and
can be up to 65,000 characters in length.
In this example, you would want each child to be associated with its mother, rather than the household record. If you
were to structure your dictionary in a single level, there would be no way to easily identify which mother and child(ren)
belonged together during data entry or during tabulation. To accomplish this, you would want to design your dictionary
with three levels, each level containing a single type of record, as follows:
Level 1
Household Record
Level 2
Woman of Reproductive Age Record
Child Record (one for each child)
In the Forms Designer you will be required to place each record's data on different forms. However, this facilitates the
desired data entry behavior. You will first be asked to enter information from Level 1, i.e., the household. After completing
the household form(s), you will then enter information for the first woman and her children (Level 2). When data entry is
finished for this woman record the keyer will advance to the child record, and enter information for each child (if any) for
this women.
If there are no further children (or no children at all for this female), finish the level by pressing F12 (EndLevel Occurrence)
and resume entering information for the second woman and her children. Continue in this manner until all women and
their children have been entered for the household—when finished, press Ctrl+F12 (EndLevel) from the Woman Form to
complete data entry for this case.
Keep in mind that, when using more than one level, there are implications with respect to the order of executing logic in a
data entry application or in a batch edit application.
See also: Dictionary Hierarchy, Level Properties, Add or Modify Levels
Level Properties
Level properties are visible when the dictionary has been selected in the tree tab. To reflect your intended usage for a
Pa ge 68 of 684 Levels
level we suggest you change the level properties pressing Ctrl+M, which will activate the appropriate entry in the right-
hand screen.
Property Meaning
Label A descriptive text label which identifies this level.
Name The name given to this level for use in the CSPro language procedures.
Pa ge 69 of 684 Levels
Records
Record Description
A record is a group of related data items. In the process of creating a record to define (a portion of) the questionnaire,
you will also be defining the physical layout of the data file. For example, suppose your (very simple) population record
looks like the following (only item name, starting position, and length properties are shown; starting positions show that
ID items occupy the first 9 positions in the record):
If an operator had keyed a questionnaire for a 35-year-old female (Sex = 2) head of household (Relationship = 1), you
would see a line in the data file, corresponding to the population record defined above:
12345678901234567890 (position)
--------------------
1235 (line in data file)
In deciding on a file structure, there is often the choice of defining a record type which occurs once within a questionnaire
but contains repeating sets of data, or to define a record type which occurs multiple times within a questionnaire, each
with a single occurrence of the data. The application designer should take into consideration the amount of information
that recurs and the probable number of occurrences.
A common example in a Housing and Population Census is information about deaths in the household during the 12
months prior to the census. If this information (usually sex and age at death of the deceased) is collected during
enumeration, the expectation is that 95% of households will have no more than one, or at most two, deaths during the
previous 12 months. With this volume of information, it would be practical to have one record type that occurs once within
the questionnaire and allows for repeating occurrences of the data, since it is unlikely that even the maximum number of
occurrences, multiplied by the number of positions occupied by each occurrence, will exceed the length of the already-
existing household and population records.
However, in the case of an agricultural survey, a section on crops may include questions about acreage planted, yields,
etc., whose cumulative length for each crop mentioned may be quite large in relation to other records in the file. In such a
case, it would be more practical to define a record type that occurs multiple times within the questionnaire. Within each
occurrence of the record would be found the information relating to one specific crop.
Record Properties
You can view a record's properties by selecting the questionnaire to which it belongs (via the dictionary tree tab). You
may change the default record properties by positioning the cursor in the right window and pressing Ctrl+M.
Property Meaning
Label A descriptive text label which identifies this record.
Name The name given to this record for use in the CSPro language procedures.
Type Value The record type value (code) that identifies this kind of record.
Required Must a questionnaire contain this kind of record? (Yes/No)
Max The maximum number of times this type of record can appear in any one questionnaire.
Pa ge 70 of 684 Records
See also: Dictionary Hierarchy, Record Description, Add or Modify Records
Record Type
The Record Type is an alphanumeric item that uniquely identifies a dictionary record, and therefore helps describe your
data file's organization.
If your dictionary contains more than one record, CSPro needs to be able to differentiate one record from another in the
data file. The Record Type provides the means for doing this.
For example, a typical Housing and Population census data file would most likely have a housing record (describing
details of the living unit) and a person record (to describe details on each individual in the household). You could assign a
Record Type of '1' to the Housing record and '2' to the Person record to distinguish between them.
If your dictionary contains only one record, you do not need to use a Record Type. Therefore, you can 'reclaim' the
location that was set aside for the Record Type as follows:
1. Select the (ID Items) set or the one-and-only record your dictionary contains from the dictionary tree.
2. In the view on the right, you'll notice the first line is (record type). Only three values are used, Starting Position,
Length, and Data Type. Of these three values, you can only modify the start position and length. Change the
length to 0. This will effectively "remove" the record type. (You can always reinstate it later by resetting the start
position and length to non-zero values).
Similarly, if you would like to modify the length of the Record Type, proceed as above.
The record type value is always alphanumeric. Upper- and lowercase letters are distinct Record Type values (i.e., 'A' is
not the same as 'a'). Blank is also a valid Record Type value.
Required Record
One of a record's properties is whether or not the record is required; the options are Yes or No. If a record is required
(Yes), it means that for a given questionnaire, there must be at least one occurrence of this record. If there is not at least
one occurrence of this record, the questionnaire will not be complete and the system will issue an error message to
inform the keyer. If the record is not required (No), the questionnaire may or may not contain an occurrence of this
record. The questionnaire can be considered complete without an occurrence of this record.
Suppose you are designing a dictionary for a census. You'll probably have at least two types of records: one for the
household, and one for each person in that household. You can have four scenarios:
If you allow vacant housing units (i.e., you collect information on unoccupied housing units), then the household
record is required and the person record is not required.
If you allow homeless people, then the household record is not required and the person record is required.
If you allow homeless persons and vacant housing units, then neither the housing record nor the population record
will be required record types, but because the two conditions will never occur simultaneously, you will never have
a questionnaire without one of the other type of record.
If you allow neither homeless persons nor vacant housing units, then both the housing record and the population
record will be required record types. This means that a valid questionnaire will always have one housing record
and at least one population record.
Maximum Number
Pa ge 71 of 684 Records
This record property specifies, for the given record, the maximum number of occurrences of that record allowed in one
questionnaire.
For example, suppose you are designing a dictionary for a census. You will probably have at least two types of records:
one for the household, and one for each person in that household. There should be only one occurrence of the household
record, but for the person record you will of course need more than one occurrence, as there will likely be more than one
person in a household. Thus, the maximum for the person record could be 25, if limiting yourself to a family unit, or
larger, if enumerating group facilities (military barracks, hospitals, mental institutions, etc.).
The maximum number of occurrences that may be specified for any record is 9,999. However, for greater program
efficiency we recommend that you never have this many occurrences and that you keep the maximum to the lowest
value that is appropriate for your particular application.
Pa ge 72 of 684 Records
Items
Item Description
An item describes the response to a question or helps identify the questionnaire. The item is the most basic element of
a questionnaire: age, income, and crop code are all examples of items. Related items should be placed in the same
record. And, just like records and levels, data items possess properties (such as a unique name, label, etc).
Identification Items
Identification items (i.e., ID items) are those data items that uniquely identify the questionnaire and define hierarchical
levels. They are usually geographic items, such as Province, District, or Enumeration Area, or other unique identification,
such as Survey ID Number. These data will appear on every record in a data file, as they are "common" to all of the
records. The maximum number of ID items is fifteen.
If you are using absolute mode, you will typically want to structure your dictionary so that the ID items begin in column
2; in this way they will precede a record's data items (column 1 is usually reserved for the record type identifier). If you
are using relative mode, you have no choice but to place the ID Items first; the system assigns a consecutive position in
the order in which the items are created.
For a pictorial representation, open a dictionary in CSPro and press Ctrl+L (L=layout). Press Ctrl+L again to toggle this
view off.
See also: Item Description, Sub-Items, Item Properties, Add or Modify Items
Subitems
This item property specifies whether the data is an Item or a Subitem.
Subitems allow items to be broken up into smaller pieces, or across broad categories. In this respect, they let you
redefine data items and refer to the same data field in several different ways. The start position of a subitem must be
within its parent item (the previous item).
One useful application of subitems involves date and time fields. A date item, for example, could be referred to as a
single 8-digit entity: DDMMYYYY. However, this does not allow you to easily manipulate or refer to a portion of the date
(such as the day, month, or year itself). Suppose you had the following definition for date (for demonstrative purposes,
not all item properties are being shown):
Item Label Item Type Starting_Position Len
Date of birth Item 20 8
To redefine this item into subitems, you only need to add the following subitems:
Item Label Item Type Starting_Position Len
Day of birth Subitem 20 2
Month of birth Subitem 22 2
Year of birth Subitem 24 4
Another reason for using subitems is to make data references available across larger categories. Censuses and surveys
Pa ge 73 of 684 Items
often have items of three or four digits in length representing categories such as industry, occupation, or ethnicity. For
occupation codes, the full value refers to a very detailed occupation, such as bus driver. The first digit alone refers to the
'major' division, such as 'public service'. The first two digits together refer to a more detailed 'major' division, such as
'public transportation'. It may be useful to test the ranges with the CSPro language at the item level. In tabulation
applications, tables can be made at the major (1- or 2-digit) or minor (3- or 4-digit) divisions. The following example could
represent part of an economic survey:
Item Label Item Type Starting_Position Len
Occupation Item 45 4
Occupation, Major Subitem 45 1
Occupation, Sub-major Subitem 45 2
Occupation, Minor Subitem 45 3
In IMPS 3.1, the predecessor to CSPro, it was very common to use subitems to redefine data items. This is now more
easily accomplished with value sets.
NOTE: Identification items cannot have subitems.
See also: Item Description, Identification Items, Item Properties, Add or Modify Items
Item Properties
You can view an item's properties by selecting the record to which it belongs (via the dictionary tree tab). When creating
an item, the following fields must be set:
Property Meaning
Label A descriptive text label that identifies the item. It is used as default field
text in data entry forms and in default titles in tabulation.
Name The name given to this item for use in CSPro language procedures.
Start Indicates the starting position of the item within the record
Len Indicates the length of the data item (i.e., the number of characters
necessary to represent the values for the item).
Data Type Indicates the type of data (numeric or alphanumeric) that will be found in
the item.
Item Type Indicates whether the item is or is not subordinate to, or part of, another
item. If the item is part of another item, it is considered a "subitem". If
not, it is identified as an "item". Identification items cannot have
subitems.
Occ The number of times this item will repeat within the record. The default
value is "1". Identification items cannot have multiple occurrences.
Dec The number of decimal places (if any) in the item. The default number of
decimals is "0". Identification items cannot have decimals.
Dec Char This specifies whether the item should be stored in the data file with an
explicit decimal character. This applies only to items or subitems which
have been defined with the "Dec" property grater than zero (i.e., Dec >=
1).
Zero Fill This item property states whether the numeric data item should contain
leading zeros or blanks.
Press the Esc key to quit modifying without making changes. Press Ctrl+Enter to finish making changes. Use undo if
Pa ge 74 of 684 Items
you completed the modification incorrectly. There is no limit on the number of items within a record.
See also: Item Description, Sub-Items, Identification Items, Add or Modify Items
Starting Position
This item property indicates the starting location of a data item. In conjunction with the Length property, it specifies the
location of the item in a record. In absolute positioning mode, you cannot give a starting position that will cause the item
to overlap with another item.
The start position of a subitem must be within its parent item (the previous item).
Length
This item property indicates the total length of the data item (i.e., the number of characters necessary to represent the
values for the item). In conjunction with the Start property, it specifies the location of the item in a record. In absolute
positioning mode, you cannot give a length that will cause the item to overlap with another item.
The maximum length of a numeric item is 15 digits. The maximum length of an alpha item is 255 characters.
Data Type
This item property specifies the type of data (numeric or alphanumeric) that will be found in the item. The default item
type is numeric.
Numeric items can contain numbers or blanks, and they may be negative or positive in value. Numeric values will
be right-justified and, if requested, zero-filled.
Alphanumeric items can contain any combination of characters, letters, numeric digits, blanks, or special
characters. These values will be left-justified and are blank-filled, whether or not zero-fill has been selected.
Declaring M or F for SEX is an example of an alphanumeric value.
Some responses are quantitative, such as size of farm, and some are qualitative, such as relationship to head of
household. Most descriptive responses, such as "head of household," are given numeric codes that are placed on the
questionnaire. However, some descriptive responses remain as alphabetic text.
Numeric responses can be discrete values or continuous values. An example of a discrete value is sex, 1 (male) or 2
(female). An example of a continuous value is yearly income, which can range from zero to a value limited only by the
number of digits permitted for the response. A discrete value may be used to represent a grouping of continuous values.
For example, when asking income, one may be asked to select from a choice of ranges of incomes rather than specify
the exact income. Thus, the possible responses to the income question could, for example, be a code between 1 and
10.
Occurrences
This item property defines the number of consecutive repetitions of the item in the data record. The dictionary will reserve
space equal to the product of the length of the item times the declared number of occurrences for the item.
For example: A census collects information on births and deaths, and each questionnaire can list the ages of up to a
dozen household members who died during the past year. By defining an item "Age at death" with a length of 2 digits
Pa ge 75 of 684 Items
and 12 occurrences, the dictionary will reserve a location 24 characters in length for this item.
Be aware that if fewer than 12 people died in the household, then the unused portion of this item will be blank. If you have
several items that use occurrences and they are often unused, you are increasing the size of your data file. Therefore,
you should always specify the number of occurrences with care.
If an item has multiple occurrences, then its subitems may not have multiple occurrences. Conversely, if a subitem has
multiple occurrences, then its parent item may not have multiple occurrences.
Decimal Places
This item property lets you specify how many digits of the numeric item represent the decimal portion of the item. CSPro
does not expect the decimal point to be in the data file; if your data file does contain the decimal point, you will need to
set the decimal character property. Therefore, the length of the item is not affected by the number of decimal places.
For example: Suppose you had two data files, each containing an item in the format "##.##". One file has an implied
decimal point, the other file physically contains the decimal point. Here are the two ways to define the item (using 12.75
as an example)
Length Dec Dec Char
4 2 No (decimal implied; number would appear as "1275")
5 2 Yes (decimal present; number would appear as "12.75")
Decimal Character
This item property applies to those numbers specified as decimal. If the number is a decimal value, this states whether
or not the decimal point is present in the data file. Your valid choices are:
• Yes the data file contains a decimal point for this item, or
• No the data file does not contain a decimal point for this item.
Note that if your item does not have a "numeric" data type, the Data Dictionary will not allow any value other than No.
You can set this option for all items by clicking on "DecChar Default 'Yes'" on the Option menu.
See also: Item Properties
Zero Fill
This item property states whether the numeric data item should contain leading zeros or blanks.
For example: During data entry a numeric item with a length of 3 is encountered. A value of "92" was keyed. How will
this value be stored in the data file?
If Zero Fill had been set to Yes, the value would appear as: "092"
If Zero Fill has been set to No, the value would appear as: " 92"
Pa ge 76 of 684 Items
You can set this option for all items by clicking on ZeroFill Default 'Yes' on the Options menu.
Pa ge 77 of 684 Items
Value Sets
Value Sets Description
Value sets let you specify one or more groups of values for a data item or subitem. When using tabulation applications,
you will want to choose value set labels to tabulate, as it will give you more descriptive results. The resulting tables will
contain row and column labels (or region labels) that correspond to the value labels (or numeric distributions, if no value
label is present). In a batch edit applications, the use of value sets can help you when using the vset option to the
impute function.
For example, suppose you have a survey that needs to classify people's ages three different ways: by discrete value, by
5-year cohorts, or by category, such as "Child," "Adult," etc. This is easily done by adding value sets for the AGE data
item:
The AGE item now has three defined value sets: AGE, AGE_5YRS, and AGE_CATEGORY. The first value set defines
the acceptable range for data entry, while the second and third value sets give a breakdown as you might want to see the
data tabulated.
The value set will always be added to the end of the item's value set listings. If you add to the wrong place, press the Esc
key to stop the add. Use undo if you added at the wrong place.
Property Meaning
Value Set Label A descriptive text label for a collection of categories of an item. This is used in tabulation
applications in table titles.
To associate an image with a value, click on the "..." button at the end of the value row and then select the image.
Images in five file formats are supported: .jpg, .gif, .bmp, .png, and .tif.
If you are deploying a data entry application to another computer or an mobile device, you must remember to distribute
the images along with the CSEntry data entry application. If you want CSPro to automatically bundle the images with the
data entry .pen file, you may want to put the images in a special folder and then denote that folder as a resource folder.
Zeroes should be avoided in assigning codes to identification items that identify geographic areas, because zeroes are
used in CSPro to describe summarized geographic levels. If zeroes are already in the data, they can be recoded to other
values using the CSPro logic.
You can assign a negative number to a value or the starting and/or ending value of a range. Negative numbers have a
leading minus (-) sign. Positive numbers have no sign. The minus sign will be displayed in the data file immediately to
the left of the value. If the item is "zero-fill," the minus sign will be displayed in the left-most position.
New values will always be added to the end of the existing value set listings. If you add to the wrong place, press the
Esc key to stop the add. Use undo if you added at the wrong place.
Value Properties
Property Meaning
Value Label The descriptive text for a single value or range of values. This label is used when showing value sets or by
the tabulation applications module when creating column headings and stubs.
From This is the single value, or starting value of a range associated with the value label. To add multiple
ranges to a value, enter one or more spaces as the value label on the next value(s), the values which
follow become part of the previous value. Multiple ranges are indicated by the lack of a "notes" box at
the beginning of the value line.
To This value is the upper limit of the range of values being defined. It must always be greater than the
"From" value on the same line. Where only a single value is associated with the "value label," the "to"
value may be blank.
Special A numeric data item can be assigned one of four special values in the data dictionary. These are:
missing, refused, notappl, and default.
Image An image that is displayed along with the value label when running a CAPI application.
Pa ge 80 of 684 Va lues
Relations
Relation Description
Relations provide a way of linking one multiple record or item to one or more multiple records or items. For example,
suppose a questionnaire contains two record types, child records and mother records. Each child record contains a data
item that gives the sequence number of the mother record of the child's mother. A relation can be defined which links
child records to mother records so that when data items from a child record are processes, the corresponding mother
record data items are available for processing. Relations work much like database joins.
Relations have one primary multiple record or item. Each instance of the primary element in a case is processed one at
a time. A relation has one or more secondary records or items. The corresponding secondary elements are linked to the
primary element during processing.
Type Description
Occurrence to Occurrence Corresponding occurrences of the primary record or item and secondary record or item
are linked, that is first occurrences are linked, second occurrences are linked and so on.
Item to Occurrence The value of an item on the primary record is a pointer to the occurrence of the
secondary record.
Occurrence to Item The value of an item on the secondary record is a pointer to the occurrence of the
primary record.
Item to Item The value of an item on the primary record is compared to the value of an item on the
secondary record. If the values are equal, the records are linked.
Relations can be used in for statements in batch programs and in the Export Data tool.
Relation Properties
Property Meaning
Relation Name The name of this item for use in for statements in the CSPro language.
Primary This is the name of multiply occurring record or item. Items in the secondary are linked to items in
the primary.
Primary Link This is either (occ) or the name of an item. If the primary is a record, then this is the name of an
item within the primary record. If the primary is an item, then this is a the name of a subitem within
the primary item.
Secondary This is the name of multiply occurring record or item. Items within the secondary are linked to item
in the primary. The secondary cannot be the same as the primary.
Secondary Link This is either (occ) or the name of an item. If the secondary is a record, then this is the name of an
item within the secondary record. If the secondary is an item, then this is a the name of a subitem
within the secondary item.
• Click on the toolbar, or from the File menu, select New; or press Ctrl+N
CSPro created a dictionary ("Poverty") with one level ("Poverty questionnaire"), and that level contains a set of ID Items ("
(Id Items)") and one record ("Poverty record").
The screen on the right displays detailed information for the highlighted object in the left-hand screen. For example, if in
the left-hand screen the focus [cursor or highlight] is on the first line ( ), the right-hand screen will display information
about the dictionary [file] as a whole. If the focus in the left-hand screen is moved to the second line ( ) questionnaire
or case level, the right-hand screen will display information about the questionnaire [case], which is the basic element of
the file. As the focus is moved down the dictionary tree, the right-hand screen changes to reflect the different items of
interest at each successive level.
The tabs at the bottom of the right-hand display are marked "Files" and "Dicts". Clicking on either of these tabs will bring
Pa ge 83 of 684 Da ta Dic ona ry Applica on
up the appropriate tree. In addition, each tree [File Tree and Dictionary Tree] can be toggled [using Ctrl + T] between
views. In the case of the File Tree, the views will show either the internal or external names of the files; in the case of the
Dictionary Tree, the views will show either the name or the label of each entity in the file.
The first thing we suggest you do is to change the level properties to reflect your intended usage for them. Next, change
the record properties [e.g., maximum number of records for each type, whether a given record type is required or
optional, etc.] and create any needed additional records for the level. Once you create a record, you can start creating
the ID and data item value sets and values.
If this structure is sufficient for your needs, you can begin adding identification items and data items to the set(s) and
each record you created. Remember that identification items defined in the ID Items set will appear on each record in
the current level, as well as each record in lower levels.
See also: Add or Modify Levels, Add or Modify Records
• Level:
This is the second-tier tree node and identifies the level or questionnaire or case
• Field:
Represent the individual items. If the item has more than one value set, then the field will show the symbol "+"
indicating that the item can be expanded. Value sets are always designated with the symbol . If the item contains
sub-items, these will be designated with the symbol .
Dictionary Types
Every dictionary associated with an application has a type value that indicates how it is being used. For the primary
dictionary (i.e., the one upon which your application was created), this will be your main dictionary. Other dictionaries
(ones that are inserted either directly or secondarily via a forms file) can have additional properties, as explained below.
To see your dictionary's type, go to the Files tab, right-click on the dictionary in question, and select Dictionary Type.
You will then see the following four choices (which may or may not be active, depending on their use):
Type Description
Main This is the principal dictionary upon which the application was built. You cannot give the dictionary
another status as it will always be the primary dictionary for the application.
External When you add a dictionary to an application, its type can either be "external" or "working." If it is an
external dictionary, it must have an associated data file. When external dictionary variables are
used in an application, their default values will be Not Applicable (Notappl).
Working When you add a dictionary to an application, its type can either be "external" or "working." If it is a
working dictionary, it does not need an associated data file. When working dictionary variables are
used in an application, their default values will be blank (if the variable type is alphanumeric) or zero
(if the variable type is numeric).
Special Output This option is provided for backward compatibility with ISSA Batch Edit Applications. Only non-
primary dictionaries used in Batch Edit Applications can have a "special output" type. Refer to the
ISSA Manual for further instruction.
To rename the item, select "Rename" and then choose the new item name from the list presented. To delete the item,
select "Delete."
You may open a data dictionary and make changes to it, even if it already belongs to an application. Be aware that if you
later open an application to which it belongs, CSPro will automatically make necessary adjustments in other files. For
example, if you delete or rename a dictionary item, then later open an application that contains the data dictionary, any
corresponding fields on forms will be deleted.
You may open a forms file and make changes to it, even if it already belongs to an application. However, you will not
have access to the associated Logic file and you will not be able to run it.
Any changes you make to applications and files are not made permanent until you save the file or application that you
modified.
After pasting the link the value set will turn pink as a way of indicating that the value set is shared across two or more
items. Modify or add values to the value set as you would for a regular value set.
Value Set Label: A descriptive text label for a collection of categories of an item. Used by the Tabulation module to
select tabulation categories. This cannot be left blank.
Value Set Name: The name of this value for use in the CSPro language procedures.
From: The smallest value in the value set.
To: The largest value in the value set.
Interval: The size of the interval in each value range generated. For example, if 5 year age groups are desired, then the
interval is 5. The interval must be greater than 0. If the interval is the smallest positive number that can be contained in
the data item, then single values are produced.
Value Label Template: A model for descriptive text of each value or value range that is generated. The characters %s
in the template are replaced by the from and to values for each value range. All other characters are output as entered.
If you have made a mistake and want to undo it, press on the toolbar; or from the Edit menu, select Undo; or press
Ctrl+Z. CSPro will try to restore your forms to the state previous to last change you made. To undo the next-to-last
change, press the Undo button again.
Sometimes you may undo several changes and realize you have gone too far back. Press on the toolbar; or from the
Edit menu, select Redo; or press Ctrl+Y. Redo is an "undo" of an undo.
You can use cut and copy to move/copy your selection elsewhere within the dictionary, or to use in another open
dictionary. You can delete multiple records, items, or values at the same time. Undo can be a useful feature when
dealing with block operations.
Find What
Enter all or part of the text string to search for. Text used in previous searches is available by clicking on the down arrow
and selecting from the dropdown list.
Next
Find the next occurrence of the text string, starting from the last one found. If it finds the item, it will be brought into
focus in the view; otherwise you will receive a notification that it could not be found.
Prev
Find the previous text string starting from the last one found.
Match Case
If this option is checked, the string will match only if the letters are the same case (upper or lower) as in the string you
entered as the search key. If this option is not checked, the search will ignore case.
Close
Close the Find dialog box.
To convert subitems back to items, delete the item. When asked if you wish to "Delete subitems too?" answer No.
Occurrence Labels
Occurrence labels can be attached to a repeating item or a repeating roster by right-clicking on the dictionary element
and selecting Occurrence Labels. Alternatively, that option can be found on the Edit menu. To edit an occurrence label,
simply click on the appropriate row and add the label.
When the Save As is complete you will be editing the new dictionary.
When adding items to your data entry forms, you can specify that the field labels should be directly linked to the
dictionary label. This default setting ensures that the field labels change when the user changes the language during data
entry.
For your application to fully take advantage of multiple language dictionaries, you must define the languages that you
would like to use in the CAPI language section. When running your data entry program, CSEntry will only list languages
defined in the CAPI section. In this way, changing a language will change both the CAPI language text as well as the
dictionary labels used.
Dictionary Macros
There are several specialized tasks that you can perform on a dictionary using the dictionary macros functionality. To
access the options, either right-click on a dictionary in the tree and select Dictionary Macros or, with the dictionary
editor active, select Dictionary Macros from the Edit menu. A dialog box will appear with the following options:
This can be useful if you want to edit the names, labels, and lengths in a different software tool, such as Microsoft Excel.
The information is copied in a tab-delimited format.
If you edit the data in an outside program and want to bring the changes back to CSPro, you can use the Paste All
functionality. CSPro will check that what is in the clipboard matches the contents of the dictionary, so you cannot add or
delete items outside of CSPro.
You may find it useful to setup your dictionary parameters in CSPro and then have a subject matter specialist or clerk
enter the labels in another software, particularly if working in multiple languages.
Value Sets
You can Copy All the names and labels to the clipboard for all value sets in your dictionary. As with dictionary names
and labels, you can use this feature to edit the value sets in a different software tool. If your dictionary has multiple
languages, the All Languages option allows you to copy all of the labels, instead of just the primary label, to the
clipboard. If you select Value Set Images, the file names of value set images will also be copied.
After modifying the value set labels in an outside program, you can use the Paste functionality to bring the changes back
to CSPro. You do not need to paste all value sets, so you can make changes to only the value sets that you want to
modify. You can add and remove value set labels in the outside program, but you cannot change value set names.
If you want to remove all the value sets from your program, which can be useful if testing a data entry application that
someone else designed (and for which you do not know the proper values), you can use the Delete All functionality. The
change is permanent, however, so you may want to make a copy of your dictionary prior to removing all value sets.
Records
In a dictionary with many records, it may be useful to modify all records to make all required (Set All to Required) or to
make none required (Set All to Not Required).
If you want some number of blank values in your output file, you can modify the Percent Not Applicable. If you want
some items to have values that are not in their value sets, than you can modify the Percent Invalid. When testing batch
edit applications, this can be useful as a way to detect if your edits properly handle blank and out-of-range values.
Suppose your file has 100 cases and you want to create a 10% sample. Creating a random sample means that all 100
cases have a 10% change of being selected for the sample. Based on randomness, that means that generally your
sample file will have 10 output cases, but it could have fewer or more. You might also get some clumping of cases. On
the other hand, a sequential sample means that the output file will consist of the 1st, 11th, 21st, ... cases. The output
file will always have 10 cases. By modifying the Start Position, you can control which case is output first. For example,
if that value was 3, then your output file would consist of the 3rd, 13th, 23rd, ... cases.
Compact Data: When working with text data sources as external dictionaries to a program, you can end up with
a lot of deleted records. Deleted records appear in the file with a tilde (~) at the beginning of the line. This option
will remove all deleted records from a file, reducing its size.
Sort Data in ID Order: This will run the Sort Data tool to automatically sort a data file in ID order. Particularly
after concatenating multiple data files, you may want to sort the data in some order, generally based on
geographic ID items.
Create Notes Dictionary: This will create a new dictionary, based on your current one, that can be used to read
the .csnot notes file generated by a data entry application. If you want to export your data entry notes, you can
use this macro and then open the newly created dictionary in the Export Data tool.
Dictionary Analysis
While editing or viewing a data dictionary, you can generate several reports that analyze the contents of the dictionary by
selecting an option from the Dictionary Analysis submenu off the View menu. The options include:
Items Without Value Sets: Determine which items have no value sets.
Numeric Items With Overlapping Value Sets: Determine which numeric value sets have discrete or range values that
overlap.
Numeric Items With Mismatched ZeroFill or DecChar Options: Determine which numeric items have a zero fill or
decimal character option that differs from the dictionary's default options. You generally want every item in the dictionary
to use the same options.
When checked, the first option, Allow the use of Data Viewer to modify data, means that a user can use Data
Viewer to delete and modify cases.
When checked, the second option, Allow exporting data to other formats, allows a user to use Data Viewer, the
Export Data tool, or the export statement to output data to another format.
Encryption Options
Encryption options control whether or not passwords can be stored for data sources using that dictionary. These options
only apply when using the dictionary to access encrypted data using the Encrypted CSPro DB data source. By default,
passwords used to open such files are not cached on the machine, meaning that the password must be specified every
time the file is opened (unless the password is specified in a connection string). If you want to allow the storage of
passwords, specify the number of minutes that the stored password can be used to open the data file.
The password itself will not be stored on the device; instead, a key that can be used to open the data file will be stored.
A clever hacker who accesses this key may be able to open the data but will have a hard time reverse engineering the
key to get the original password itself. The key is stored in secure storage on the device. You can use the CSPro
Settings dialog to clear any cached passwords.
Program Structure
Declaration Section
Procedural Sections
Logic
Language Elements
For a list of commands, see the CSPro Statements and Functions section.
Data Requirements
Data files appear in many different formats and structures. CSPro can work with several type data sources, but most
people work with text files. If you are using data files created by another software package, you must save the data in a
separate text file before you can use it with CSPro. Data files are limited to 2 gigabytes in overall size; the maximum
length of any record in the file is 32,000 characters. CSPro encodes data files using UTF-8. Read more about Unicode
text files at the Unicode Primer.
CSPro processes one case at a time. Each record must contain a unique questionnaire identification code in the same
position in each record. This number must be the same for all records of the same case. If the file is a "flat" file, meaning
that each questionnaire contains only one record, the questionnaire identification becomes irrelevant. In this case, any
data item can be used as the questionnaire identification, but it should be unique for each record. CSPro uses the case
identification values to determine where one case ends, and the next one begins. Records belonging to the same case
must be contiguous within the data file, but there is no requirement that the data file be sorted by case identifier.
CSPro can handle a data file with multiple record types—for example, housing and population—but a record type code
must identify the type of record. This code must be in the same position in each record. Within the same record type,
each data field must be in the same position.
CSPro can process one input data file at a time, but it can access one or more external files. These files must also be
described by a data dictionary.
In some survey data, especially where the total number of data items is great but only a few responses are expected, the
user may choose a format in which each data field is preceded by a "source code" relating it back to the original
document. By using this scheme, non-response fields (empty responses) need not be entered. With this type of format,
each data field is not in a pre-defined location on the record. Before a file like this can be processed by CSPro, it must
be reformatted so that the data fields are in fixed positions. Items in data files must be fixed format, that is, items must
have the same starting position and length in every record where they occur.
Example
PROC GLOBAL
// variables
numeric MinAgeDifferenceParent = 12,
MaxAgeDifferenceMother = 55;
string personName;
array validRelationships(5);
// user-defined function
function numeric IsMotherValidByAge(numeric motherAge, numeric childAge)
IsMotherValidByAge = ( ( motherAge - childAge ) in
MinAgeDifferenceParent:MaxAgeDifferenceMother );
end;
Procedural Section
This section contains executable and assignment statements that can be written before (preproc) or after (postproc) an
event. Events always fall under the PROC section, which is followed by the name of the forms file, level, form, roster, or
field. Statements are assumed to be in the postproc unless it is explicitly stated that they are in another procedure.
Data entry applications also have a forms file procedure. The form file preproc is executed before a data entry session
begins. There are three other procedure types that may be useful in data entry applications: onfocus, killfocus, and
onoccchange.
Example
See also: Proc Statement, PreProc Statement, OnFocus Statement, OnOccChange Statement, KillFocus Statement,
PostProc Statement, Order of Executing Data Entry Events, Order of Executing Batch Edit Events
Programming Standards
It is a good idea to create a set of programming standards when writing CSPro code. If you work in an organization, you
will want to discuss these standards with the other programmers in your organization. Code is more readable when
multiple people use the same coding guidelines.
This is an example of a set of programming standards adopted by an organization. Your standards may differ, but the
point is to at least have a set of standards.
1. Backup your code, and all components of the application, frequently. Create a system of version control (such as
by using Git).
2. Compile code frequently while working on it as this will help you identify syntax errors early.
3. Use the help system (by pressing F1) and the reference window while writing logic.
4. Use // for line comments and { } for multiline comments. Comment the code while writing it, not afterwards, as
it will probably never get done in that case. Remember that it is likely that you will not be the only person who
looks at your code.
5. Maintain indentations for code within conditional statements (if), loops (do/for), and functions.
6. Write CSPro statements in lowercase.
7. Write variable names from the dictionary in UPPERCASE.
8. Write globally declared variable names in mixed case, starting with a lowercase word (this is called camelCase).
Make all global variables at least three characters long.
9. Declare all constants in PROC GLOBAL, not in the application-level preproc.
10. Use locally declared variables for loop counters and when only one procedure or function uses the value of the
variable. Remember that the value of the local variable will be reset (to 0) each time the procedure is executed.
11. Use functions when an operation is generalizable or if the same operation will be called repeatedly from different
procedures.
12. Use standard prefixes for variable names. For example:
Another technique is to use error messages with the errmsg function. The generation of an error message tells you that
program control has passed to that point in your application. The error message can also include variable values to
indicate the status of those variables at that point.
It may sound obvious, but it is good programming technique to indent your code. This will help you find problems
caused by unterminated if or do statements, for example. If you consistently indent (or tab) the content within any
control structure, finding a lost endif or enddo will be much easier.
You can also use the trace function to determine problems in your application by outputting messages to a separate
window or file rather than have them appear onscreen as they would with error messages.
Prior to CSPro 7.2, there was a compilation mode called implicit that allowed for on-the-fly variable declarations. This
was removed because it could easily lead to mistakes based on programmers mistyping variable names. There were
some purposes where implicit declaration mode could be useful, which can now be achieved by using the ensure
statement.
Variables
In CSPro you can declare numeric or string variables. Variable names must contain only letters, numbers, or the
underscore (_) character, and must begin with a letter. Names are case insensitive, that is, uppercase and lowercase
letters are considered the same. (For example, myvar, MYVAR, and MyVar are all equivalent.) Variables of all types follow
the same rules for names.
Numeric Variables
In CSPro, numeric variables are stored internally in floating point format. They can accommodate numbers of extremely
small or large size, positive or negative. Numeric variables can be global in scope, meaning that you can assign or get
the value of a variable from any other event, or local in scope, meaning that you can only access the value from within a
single event. A numeric variable may be up to 15 digits in size. It is equivalent to a float or double variable.
String Variables
String variables in CSPro store alphanumeric data. You must declare a string variable using either the alpha or string
statements. As with numeric variables, string variables can be either global or local in scope.
See also: Array Object, Text Strings, Logic Objects and Dot Notation
Alias Statement
Format
alias aliased_name : original_name;
Description
The alias statement allows for the creation of new names to alias, or to provide an alternative reference to, names in a
program. This can be used to shorten, or lengthen, variable names, or to match names in a dictionary with preexisting
code using a certain naming convention.
In both examples, P14_AGE, P15_RELATIONSHIP, and P16_SEX are the names of items declared in a dictionary. Once the
alias statement has been specified (in PROC GLOBAL), the names P14 and P14_AGE can be used interchangeably. For
example, these two statements are identical:
Ensure Statement
Format
ensure numeric variable_name1『 , ... , variable_nameN』 ;
Description
The ensure statement allows for the conditional creation of numeric variables in the PROC GLOBAL section of logic. One
or more variable names are specified, and if the name of the variable exists (because of being declared previously in
logic, or defined in a dictionary), then it will not be created. If it does not exist, a temporary variable is created just as if it
were a declared numeric variables. The initial value of the variable is 0.
This statement is useful when working with applications that reuse blocks of logic with different dictionaries. If a certain
routine depends on a dictionary variable, you can use this statement to ensure that the logic compiles without issue. It is
important to make sure that your logic works regardless of whether the variable is defined in a dictionary or from the
ensure statement.
In versions of CSPro prior to 7.2, you could simulate the behavior of ensure by using set implicit mode.
Example
PROC GLOBAL
ensure numeric SUPERVISOR_CODE;
PROC DATA_TRANSMISSION
// assuming that 0 is never a valid value for SUPERVISOR_CODE, then
// you could have two synchronization routines in your code, one for
// surveys where there are supervisors (and SUPERVISOR_CODE is defined
// in a dictionary), and another for surveys where interviewers directly
// send data to headquarters
if SUPERVISOR_CODE = 0 then
SyncWithHeadquarters();
else
SyncWithSupervisor();
endif;
return_value = function_name(argument_list);
Functions may include an argument list, which can vary depending on the function call's requirements. This list may be
empty (that is, it contains no arguments between the opening and closing parentheses) or it may contain one or more
arguments. Each argument specifies a variable, array, list, or file handler that is used by the statements within the
function.
Numeric, string, and alphanumeric variables are local to the function. That is, if a variable is passed as an argument, its
value in the rest of the application will not be changed by actions within the function (this is called "pass by value"). On
the other hand, objects (such as arrays and file handlers) passed as arguments refer to the source variable and
interactions on the variable affect the source variable (this is called "pass by reference"). If you want to pass a numeric or
string variable by reference, you can use the ref keyword to signify that changes made in the function should affect the
source variable.
A user-defined function:
Array Object
In logic, an array is an object that contains a collection of elements of the same type, either numeric, alphanumeric, or
string. An array can be of many dimensions, though each of these dimensions is of a fixed size. Arrays can be used in
various processing operations, including defining a list of constant values (like month names) or working with hotdecks
and DeckArrays.
Whenever an array variable is used in the application, a value or numeric expression for each dimension must be given.
The initial array contents are zero (if numeric) and blank (if alphanumeric or string) until a value for each dimension is
assigned. If using a numeric saved array, the initial array contents are default.
If you want the behavior of an array but without fixed dimensions, you can use the hashmap object, which is an
associative array.
Functionality
An array is a CSPro logic object and the following functions can be called via dot notation:
Function Description
clear Resets all array values to the default value.
length Returns the size of an array dimension.
Audio Object
Pa ge 105 of 684 Decla ra on Sec on
In logic, an audio object represents an audio recording. Audio recordings may be loaded from audio files, saved to audio
files, played back or recorded. Audio objects are typically used to record portions of an interview either interactively or in
the background. With interactive recording an audio recorder is displayed to the user who then controls when the
recording is started and stopped. The user must complete the audio recording before moving to the next survey question.
With background recording, the audio recorder is started and stopped using CSPro logic and the user can continue with
the survey while the audio is recorded.
Audio recording is typically implemented by declaring a variable of type audio, calling either the record or
recordInteractive function, and then calling the save function to save the audio to a file.
Functionality
An audio object is a CSPro logic object and the following functions can be called via dot notation:
Function Description
clear Erases the audio recording currently stored in the audio object.
concat Appends an audio recording to the recording stored in the audio object.
length Returns the length, in seconds, of the recording in the audio object.
load Reads an audio file and places the contents in the audio object.
play Launches an audio player to play back the contents of the audio object.
record Starts recording audio in the background.
recordInteractive Starts interactive audio recording.
save Writes the audio recording to a file.
stop Stops the current background recording.
In addition to these object functions, audio objects can be used as arguments to the filename function.
Audio objects can be assigned to other audio objects, which will replace the recorded audio with the recording from the
assigned audio object.
audio_name = another_audio_name;
Example
File Object
Most files in CSPro are data files, whose structure is defined by a data dictionary are are used in logic by referring to
their dictionary names. However, it is also possible to write to text files without an associated data dictionary. Lines of
text can be written to a file or read from a file, and these files can also be used in export statements.
The physical name of the file can be specified in the Define File Associations dialog when the application is run, in a PFF
file, or by using the open or setfile functions.
Functionality
A file is a CSPro logic object and the following functions can be called via dot notation:
In addition to these object functions, files can be used as arguments to functions such as filecopy, filedelete,
fileempty, fileexist, filename, filerename, filesize, open, and pathname.
Example
file login_times_file;
login_times_file.open("Login Times.csv", append);
login_times_file.write("%s, %s, %d", getdeviceid(), getoperatorid(), timestamp());
login_times_file.close();
HashMap Object
In logic, a hashmap is similar to an array but has dimensions that can be either non-consecutive numbers or strings. A
hashmap is an associative array that can be used to store numbers or strings and can dynamically grow or shrink in
size.
Functionality
A hashmap is a CSPro logic object and the following functions can be called via dot notation:
Function Description
clear Removes all values from the hashmap.
contains Returns whether a specified key exists.
getKeys Fills a list with the hashmap's keys.
length Returns the number of keys.
remove Removes a key.
Hashmaps can be assigned to other hashmaps, which will replace the initial hashmap with the values of the assigned
hashmap:
hashmap_name = another_hashmap_name;
When assigning a hashmap to another hashmap, both must have the same value types, and the dimension types must
be compatible.
Individual elements of hashmaps can be retrieved or set by using an index to specify the dimension keys:
Example
List Object
In logic, a list is similar to an one-dimensional array but without a defined size. That is, a list is a collection of values,
either numeric or string, that can grow or shrink in size.
Functionality
A list is a CSPro logic object and the following functions can be called via dot notation:
Function Description
add Adds a single value, or a list of values, to the end of a list.
clear Removes all values from the list.
insert Inserts a single value, or a list of values, at a given position in the list.
length Returns the size of the list.
remove Removes the value at a given position from the list.
removeDuplicates Removes duplicate values from the list.
removeIn Removes values from the list that are specified in an in list.
seek Returns the index of a specified value.
show Displays the list's values (similarly to accept) and returns the index of the operator's
selection.
sort Sorts the list's values in ascending or descending order.
In addition to these object functions, lists can be filled when used as arguments to functions such as dirlist, keylist,
and hashmap.getKeys.
Lists can be assigned to other lists, which will replace the initial list with the values of the assigned list:
list_name = another_list_name;
list_name(index) = modify_value;
list_name(list_name.length() + 1) = add_value;
Example
list string respondent_query;
do numeric counter = 1 while counter <= count(NAME)
respondent_query.add(NAME(counter));
enddo;
numeric respondent_index = respondent_query.show("Who in the household is responding to
questions?");
Map Object
In logic, a map is an object that can be used to display and control an interactive map. The map may be panned and
zoomed with touch controls and may optionally show the user's current location. Using map functions you can add
markers at geographic positions on the map, add custom buttons, and set the geographical area displayed. You can
also specify user-defined functions in your logic to be called when the user taps on markers, buttons, and on the map
itself. Together, these functions allow for rich map-based interactions such as showing households on a map and
launching an interview when the user taps a household.
Note that maps are currently only supported on Android. The map functions will do nothing when run on Windows.
To display a map, call the show function. The map will be displayed allowing the user to interact with it until the user taps
the back button or the hide function is called from program logic.
You can call map functions to add markers, buttons, set the base map, and pan/zoom before showing the map. This will
be more efficient than adding them afterwards, especially when adding large numbers of markers. You can also call the
map functions after showing the map from within any of the user-defined callback functions for map, marker, and button
clicks and drags.
By default, the map contains a button to zoom to the users current location. You can add additional buttons to the map
by calling addImageButton or addTextButton.
By default, the map will display a base map from Google Maps, which requires an Internet connection. With no Internet
connection the base map will be empty. To display a map without an Internet connection you can copy an an offline map
file to your device and pass the file to the setBaseMap function.
Functionality
A map is a CSPro logic object and the following functions can be called via dot notation:
Function Description
show Display the map and allow the user to interact with it.
hide End displaying the map if it is currently showing.
addMarker Place a marker on the map at specified latitude and longitude.
removeMarker Delete a marker from the map.
clearMarkers Delete all markers from the map.
setMarkerImage Set the icon displayed on the map for a marker.
setMarkerText Set text displayed on the map for a marker.
setMarkerDescription Set text displayed in a popup window when a marker is tapped, and in the marker list.
setMarkerOnClick Set the user-defined function that is called when the user taps a marker.
setMarkerOnClickInfoWindow Set the user-defined function that is called when the user taps on a marker's popup
info.
Example
// Declare a map
map mymap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
mymap.addMarker(38.84839, -76.931098);
// Display the map
mymap.show();
Pff Object
In logic, a pff is an object that contains information about a CSPro Program Information File (.pff). PFF files are used to
run CSPro applications or tools in production mode. Using the pff object in logic, it is possible to dynamically modify the
properties used when running an application or tool.
Functionality
A pff is a CSPro logic object and the following functions can be called via dot notation:
Function Description
load Loads the contents a PFF file from the disk.
save Saves a PFF file to the disk.
getproperty Gets the value associated with a PFF property.
setproperty Sets the value associated with a PFF property.
exec Executes the application or tool associated with the PFF.
Example
SystemApp Object
In logic, a SystemApp is an object that contains information about the parameters to be used when calling a system
application. A system application is an application that is loaded on a device and is generally not part of the CSPro suite
of applications and tools. On Android, the SystemApp object also stores any information that resulted from the running of
the system application.
Functionality
A SystemApp is a CSPro logic object and the following functions can be called via dot notation:
Function Description
clear Clears any stored arguments.
exec Executes the system application.
getResult Gets a result returned from the system application.
setArgument Stores the argument for use when executing the system application.
Example
// play the training video in Windows Media Player
SystemApp windows_media_player;
windows_media_player.setArgument(pathconcat("../Videos/Census Training.mp4"));
windows_media_player.exec("wmplayer.exe");
See also: ExecSystem Function (Desktop), ExecSystem Function (Mobile), View Function
ValueSet Object
In logic, a valueset is an object that contains information about the possible values that are considered permissible for a
field. Each possible value contains a label, a code (the valid value), and can contain a link to an image shown when the
value set is displayed. The value set object, once passed to the setvalueset function, becomes an item's dynamic
value set.
Functionality
A value set is a CSPro logic object and the following functions can be called via dot notation:
Function Description
add Adds a single value, or all the values from an existing value set, to the value set.
clear Removes all values from the value set.
length Returns the size of the value set.
randomize Randomizes the order of entries in the value set.
remove Removes the value with the specified code from the value set.
Pa ge 113 of 684 Decla ra on Sec on
show Displays the value set's labels (similarly to accept) and returns the code of the operator's selection.
sort Sorts the order of entries in the value set by either label or code.
In addition to these object functions, value sets contain two built-in list objects: valueset_name.codes and
valueset_name.labels, which contain the codes and labels for the value set. These are read-only lists and can be used
in most ways that list objects can be used.
Several functions accept value sets as arguments, including: getimage, getlabel, invalueset, maxvalue, minvalue,
randomin, randomizevs, and setvalueset. Value sets can also be used with the in operator.
Value sets can be assigned to other value sets, which will replace the initial value set with the values of the assigned
value set:
valueset_name = another_valueset_name;
Example
PROC RESPONDENT
preproc
valueset respondent_valueset;
do numeric counter = 1 while counter <= count(PERSON_REC)
if AGE(counter) >= 15 and USUAL_MEMBER(counter) = 1 then
respondent_valueset.add(NAME(counter), counter);
endif;
enddo;
setvalueset(RESPONDENT, respondent_valueset);
Executable Statements
Executable statements begin with a command and end with a semicolon (;). They are made up of a combination of
commands, keywords, expressions, and functions. For example:
skip to Q103;
skip is a command, to is a keyword, and Q103 is the name of a data entry field.
Assignment Statements
Assignment statements set a variable equal to the value of an expression and do not contain commands. If the
expression is a string expression, then the variable must be alphanumeric. If the expression is numeric or conditional,
then the variable must be numeric. For example:
AGE = 10;
Q102 = previousAge;
Y = sqrt(X);
NAME = "John Doe";
SEX_RATIO = MALES / FEMALES;
Proc Statement
Format
PROC PROCEDURE_NAME
Description
The PROC statement declares the beginning of the procedures for a data entry or batch processing element. The
PROCEDURE_NAME must always be the name of an object in the forms or edit tree. If you are in the logic view and
select a processing element from the tree, the logic view will automatically generate the "PROC PROCEDURE_NAME"
heading.
If you plan to write logic for more than one procedure, the order of procedures must be as follows:
See also: PreProc Statement, OnFocus Statement, OnOccChange Statement, KillFocus Statement, PostProc
Statement, Order of Executing Data Entry Events, Order of Executing Batch Edit Events
PreProc Statement
Format
preproc
Description
The preproc statement declares that the statements following it are executed at the beginning of a run, case, level,
record, form, roster, or field.
In data entry applications, statements in a preproc procedure are executed when you move forward onto an object.
Moving forward occurs when the execution flow moves the cursor onto the object, or when the user goes forward to the
object by any means (mouse-click, tab key, arrow keys, etc.). If you move backward onto an object, preproc
statements are not executed. Moving backward occurs if a keyer reenters a value, goes backward with a mouse click, or
uses the Shift+tab or arrow keys to move backward. If you want to execute the statements when you move both forward
and backward onto a field, code them in the onfocus procedure.
In batch edit applications, a preproc is used to execute logic at the beginning of a run, case, level, or record. For an
item there is no difference between placing your logic in a preproc or postproc.
Remember, if you don't code a preproc or postproc in a PROC, all instructions are considered postproc statements by
default.
Example
PROC INTERVIEW_DATE
preproc
INTERVIEW_DATE = sysdate("YYYYMMDD");
See also: Proc Statement, OnFocus Statement, OnOccChange Statement, KillFocus Statement, PostProc Statement,
Order of Executing Data Entry Events, Order of Executing Batch Edit Events
Description
The onfocus statement declares that the statements following it are executed when a form, roster, or field becomes
active.
Statements in an onfocus procedure are executed when you move onto the object in which they are coded. When
moving forward, any preproc statements are executed before the onfocus statements. However, when moving
backward, the preproc statements are not executed and only the onfocus statements are executed.
Example
PROC CHRONIC_ILLNESS
onfocus
if curocc() = 1 then
setvalueset(CHRONIC_ILLNESS,CHRONIC_ILLNESS_FIRST_VS);
else
setvalueset(CHRONIC_ILLNESS,CHRONIC_ILLNESS_SECOND_VS);
endif;
See also: Proc Statement, PreProc Statement, OnOccChange Statement, KillFocus Statement, PostProc Statement,
Order of Executing Data Entry Events, Order of Executing Batch Edit Events
OnOccChange Statement
Format
onoccchange
Description
The onoccchange statement declares that the statements following it are executed at the time that a group's current
occurrence changes. The onoccchange statement can only be coded in a group: a roster or a multiply occurring form.
During data entry, whenever the user moves from one occurrence to another (forward or backward), the onoccchange of
the group will be executed.
The onoccchange statement is not very common, but it may be useful in some CAPI applications, particularly to set
question text fills.
Example
PROC PERSON_FORM
onoccchange
if curocc() > 0 then
personName = strip(FIRST_NAME(curocc())) + " " + strip(LAST_NAME(curocc()));
endif;
Pa ge 117 of 684 Procedura l Sec ons
See also: Proc Statement, PreProc Statement, OnFocus Statement, KillFocus Statement, PostProc Statement, Order
of Executing Data Entry Events, Order of Executing Batch Edit Events
KillFocus Statement
Format
killfocus
Description
The killfocus statement declares that the statements following it are executed when a form, roster, or field stops being
active.
Statements in an killfocus procedure are executed when you move off the object in which they are coded. When
moving forward, any killfocus statements are executed before any postproc statements. However, when moving
backward, the postproc statements are not executed and only the killfocus statements are executed.
The killfocus statement is not very common as most logic operations and checks only occur when moving forward in
the application, in which case they can be coded in a postproc.
Example
PROC CEB_MALE
killfocus
CEB_TOTAL = visualvalue(CEB_MALE) + visualvalue(CEB_FEMALE);
See also: Proc Statement, PreProc Statement, OnFocus Statement, OnOccChange Statement, PostProc Statement,
Order of Executing Data Entry Events, Order of Executing Batch Edit Events
PostProc Statement
Format
postproc
Description
The postproc statement declares that the statements following it are executed at the end of a run, case, level, record,
form, roster, or field.
In data entry applications, statements in a postproc procedure are executed when you complete an object; that is, flow
off of it. When you click off a field, manually skip from a field, or move backward, the postproc statements are not
executed. If you want to execute the statements in these situations, code them in the killfocus procedure.
In batch edit applications, a postproc is used to execute logic at the end of a run, case, level, or record. For an item
there is no difference between placing your logic in a preproc or postproc.
If you don't code a preproc or postproc in a PROC, all instructions are considered postproc statements by default.
Example
See also: Proc Statement, PreProc Statement, OnFocus Statement, OnOccChange Statement, KillFocus Statement,
Order of Executing Data Entry Events, Order of Executing Batch Edit Events
PROC P06_MARITAL_STATUS
PROC P06_MARITAL_STATUS
if P06_MARITAL_STATUS = 1 then
skip to P08_WORK_STATUS;
endif;
Press on the toolbar, or press Ctrl+F; or select Find from the Edit menu to open the Find dialog box. You can
simply search for a text string by typing it in the dialog box and pressing Enter. You can then press the F3 key to find
the next occurrence of the text string. If you use the Mark All button, CSPro will show you each line that contains the
text string by putting a yellow circle to the left of it.
Replace
Press Ctrl+H; or select Replace for the Edit menu to open the Replace dialog box. This is the standard Windows dialog
box that allows you to replace one text string with another text string, either one at a time or all at once.
Compile Logic
When CSPro compiles your logic, it checks the logic you have written to see if there are any errors or warnings. Typical
errors including spelling a command incorrectly, not using proper command syntax, and putting logic in the wrong place.
Error messages appear in the panel at the bottom of the screen and a red dot appears to the left of the line that contains
the error. Typical warning usually involve using commands in questionable ways. Warning messages also appear in the
panel at the bottom of the screen and a yellow dot appears to the left of the line that contains the warning.
You can choose to compile code for a specific item, or for the entire application. To compile code for a specific item,
simply select that item from the tree on the left side of the screen. The associated logic for that item will be displayed in
the Logic View. Press on the toolbar; or from the File menu, select Compile; or press Ctrl+K. The results of your
compile will be displayed in the Compiler Output area at the bottom of the screen. When you are ready to compile the
entire application, select the topmost entry of the data entry tree or batch edits tree. This will display all logic written for
the application in the Logic View. You can then press the compile button or press Ctrl+K.
CSPro always compiles your application when you run the data entry or batch edit application. If there are errors, you
cannot proceed until the errors are corrected.
Comments
Comments make applications easier to understand. They are used to explain the purpose of specific statements or to
temporarily disable statements to help find errors. Any text enclosed by braces { } is a comment. The text within the
brackets will be green. Comments can be placed anywhere in an application and are not checked for syntax errors.
Comments can be nested, that is, comments within comments are allowed. Any text following double slashes // until
the end of the line is also a comment.
The third line in the example below is a comment; a comment is also appended to the code in line five. It is highly
recommended that the user document CSPro applications through the liberal use of comments.
Example
PROC HHDAY
{ Do not allow June to have more than 30 days }
if HHMONTH = 6 and HHDAY > 30 then
errmsg(1,"June",30,HHDAY); // if error, then display message
reenter;
endif;
Logic Preprocessor
Format
#if condition
#elseif condition
#else
#endif
Description
Before CSPro logic is compiled, it passes through a preprocessor that allows you to conditionally compile your code
based on certain conditions. This is an advanced feature and is primarily useful when using external logic files where
The preprocessor uses the language's familiar if/elseif/else/endif keywords but with four differences:
An #if and #endif pair must exist, and optionally #elseif and #else can be included as part of a preprocessor block.
The preprocessor is activated before the rest of logic is compiled and thus has limited functionality. Simple math
expressions and conditional checks are possible, and there are two functions that can be called:
AppType(app_type)
This function returns true if your application is of the type specified by app_type. Supported types are Entry, Batch, and
Tabulation.
exists(symbol_name)
This function returns true if a symbol exists at compile-time. The function checks for symbols that exist as part of an
application (dictionary names, form names, etc.), and not symbols that are created during compilation (user-defined
functions, objects, etc.).
Example 1
function string GetEAName(numeric province, numeric district, numeric ea)
string ea_name = maketext("EA %04d-%02d-%03d", province, district, ea);
// if we have access to the geocodes dictionary, add the EA description to the name
#if exists(GEOCODES_DICT)
GEOCODES_PROVINCE = province;
GEOCODES_DISTRICT = district;
GEOCODES_EA = ea;
if loadcase(GEOCODES_DICT, GEOCODES_PROVINCE, GEOCODES_DISTRICT, GEOCODES_EA) then
ea_name = ea_name + " / " + strip(EA_DESCRIPTION));
endif;
#endif
exit ea_name;
end;
Example 2
PROC SEX
if AGE > 15 and NumberOfKids <> notappl then
$ = 2;
endif;
However, in developing a data entry or batch edit application, it will frequently be necessary to define variables that do not
exist in the data file(s) attached to the application. These variables may be used throughout the application, but only
exist during the execution of the application.
See also: Introduction to Data Dictionary, Add or Modify Items, This Item ($)
Pressing Ctrl + 4 (4 is where the dollar sign is on many keyboards) will insert the name into the logic buffer.
Example
PROC AGE
if MARITAL_STATUS > 1 then // ever married
if $ < 12 then // the same as AGE < 12
errmsg("Person too young (%d) to be married",$);
endif;
endif;
Subscripts
Items with multiple occurrences or in multiple records have one name (the item name), but can occur multiple times. In
order to indicate the specific occurrence of the item, you may need to use an index or subscript. The subscripts are
integers and are numbered from 1.
Imagine that the SEX is an item in the multiple record CHILD.
The expressions
SEX(1) refers to the sex of the first child.
refers to the current occurrence of CHILD. (curocc is a function that returns the current occurrence of a multiple record).
When referring to multiply-occurring items within the scope of their repetition, you do not need to use subscripts, as the
current occurrence will be assumed. For example, suppose you have a population record that has multiply occurrences,
and belonging to that record are the three variables SEX, AGE, and FERTILITY. If your code is contained within any of
these variables' procedures, you do not need to use subscripts. To illustrate:
Example 1:
{this will check the sex and fertility values for each person in the household}
PROC SEX
if $ = 1 then
if fertility <> notappl then
errmsg ("male found with fertility");
endif;
elseif $ = 2 then
if age < 10 and fertility <> notappl then
errmsg ("underage female found with fertility data");
endif;
else
errmsg ("invalid sex code (sex=%d)", $);
endif;
However, if you were to place the exact same logic elsewhere in your program, you would have to programmatically
mimic the looping mechanism, and use subscripts. For example, if the above code were placed in the QUEST
procedure, it would need to be adjusted as follows:
Example 2:
PROC QUEST
NumPeople=count (POP_RECS);
do varying i=1 while i <= NumPeople
if sex(i) = 1 then
if fertility(i) <> notappl then
errmsg ("male found with fertility");
endif;
elseif sex(i) = 2 then
if age(i) < 10 and fertility(i) <> notappl then
errmsg ("underage female found with fertility data");
endif;
else
errmsg ("invalid sex code (sex=%d)", sex(i));
endif;
enddo;
Numbers
Numbers may be any positive or negative integer or decimal value. Negative numbers have a leading minus (-) sign.
Positive numbers have no sign, but can have an optional leading plus (+) sign. Numbers can have up to 15 significant
digits. Numbers must not have thousands separators. Decimal points can be either period (.) or comma (,) depending on
the "Regional Options" setting of the computer.
Text Strings
A text string is any set of characters in the computer's character set enclosed between a pair of quotation marks (") or
apostrophes ('). Any spaces enclosed within the quotation marks or apostrophes are considered part of the text string.
Upper- and lower-case letters may be used. However, a text string 'a' is different from a text string 'A.' If you wish to have
apostrophes (') embedded within your string, you must use the quotation marks (") to enclose it, and vice-versa. For
example:
This would set MyString to "that" and the trailing "s great!" would be considered outside the string, and would therefore
provoke a compiler error. Thus, if you wanted to accomplish the above, you must write:
Similarly, if you wanted to embed quotation marks within your string, you must write the string as follows:
Strings that are surrounded by quotation marks will appear in magenta. Strings that are surrounded by apostrophes will
appear in black. We recommend using quotation marks ("), and if the text of the string does not turn magenta when you
think you have completed entering it, it will be quickly apparent that you have not terminated your string properly.
• String
They evaluate to strings. The following are string expressions:
answer="Yes";
concat(FIRST_NAME, " ", LAST_NAME);
edit("ZZZZ9", A + B);
• Logical
Logical expressions (conditions) evaluate to true (1) or false (0). The following are logical conditions:
KIDS > 5
SEX = 2 and AGE > 12
Substring Expressions
Format
string_variable[start_index:string_length]
Description
A substring expression lets you extract a part (substring) of a string. The start_index gives the starting character
position of the substring within the string, and string_length gives the number of characters to include in the substring,
including the starting character. If string_length is not given, then it is assumed to be to the end of the originating string.
A negative start_index leads to the basing of the substring from the end of the string.
Example 1
Suppose the variable STR has the value "ABCDEF":
Both start_index and string_length can be numeric expressions as well as constants. For example, to obtain the last 3
characters of STR you could use the expression:
STR[length(STR) - 2:3]
In this example, if STR is not at least two characters long, you may get unexpected results. You could also write the
above as:
STR[-3]
Example 2
Likewise, substring expressions can be performed on string arrays. Suppose the string array crop had the following
definition:
PROC GLOBAL
array string crop(5); // 5 crop names
PROC MY_PROGRAM
preproc
crop(1) = "maize";
crop(2) = "wheat";
crop(3) = "rice";
crop(4) = "potatoes";
crop(5) = "legumes";
crop(1)[2] "aize"
crop(1)[3:1] "i"
crop(2)[3] "eat"
crop(3)[2] "ice"
crop(4)[5] "toes"
crop(5)[1:3] "leg"
Boolean Values
There are two numeric values in the CSPro language that translate to boolean values as used in other programming
languages. The value true is equivalent to 1 and the value false is equivalent to 0. Note that you need to be cautious
when using true in conditional expressions, as a conditional expression is true if it is neither zero nor special, but
comparing something with true is true only if the compared value is 1. That is:
numeric value = 2;
if value then
errmsg("This will be displayed");
endif;
if value = true then
errmsg("This will not be displayed");
endif;
Pa ge 129 of 684 Expres s ions
See also: Special Values
Special Values
There are four special values in the CSPro language: missing, refused, notappl, and default. A particular value of a
data item can be mapped to one of these special values in the data dictionary. They have the following meaning and
uses:
Missing
The value missing indicates that a data item was supposed to have a response and no response was given. Other terms
for this are "not stated" and "non-response." To properly utilize this special value, you must create a value set for this
item in the dictionary, setting one of the value set entries to the special value "Missing." For example, you could set 8 (or
88, 888, etc.) or 9 (or 99, 999, etc.) to missing. Finally, although you must associate a number with the special value
missing, you can only use the = or <> comparison operators against the special value missing; i.e., you cannot refer to
the numeric value you assigned it to in your dictionary value set.
Refused
The value refused is similar to missing but has several special attributes regarding how it is handled during data
collection. The value indicates that a data item was supposed to have a response but the respondent refused to provide
an answer. To properly utilize this special value, you must create a value set for this item in the dictionary, setting one of
the value set entries to the special value "Refused."
Notappl
The value notappl indicates that a data item is blank. The item did not have a response because the question did not
apply to this respondent. Fields that are skipped during data entry are assigned the value notappl. You can map this
value to a number or blanks in the dictionary using the special value "NotAppl," but generally it is best to keep this value
defined as blanks.
Default
The value default indicates that a data item or variable has an undefined value. This can result from various
circumstances. For example, an invalid calculation in logic (such as a divide-by-zero error) will return the result default.
Reading the value of a variable from a data file when the data type specified in the dictionary for the variable is numeric
and the value in the data file contains non-numeric characters will cause the variable to become default. The same
result occurs if the dictionary does not specify a decimal character for the variable being read but the value in the data file
contains one. Additionally, if a numeric variable has numeric subitems, some of which are notappl (blank) then the value
of the variable will sometimes be default. For example if the numeric variable DATE has 3 subitems DAY=1,
MONTH=notappl, and YEAR=2020, then the value of DATE will be default. The value default can also be written to the
data file when the value of the variable being written overflows the length specified for that variable in the dictionary (for
example if the value of the variable is 999 but the dictionary specifies a length of 2 for the variable). You can map this
value to a number in the dictionary using the special value "Default," but this should generally be avoided.
Refused Value
The value refused is a special value that has been defined in the value set and is treated in several special ways by
CSPro and CSEntry.
function OnRefused()
string probe_text = maketext("Are you sure that you want to select Refused for %s? "
"Remember the approaches discussed in training about "
"trying to get answers for all questions.", getlabel());
when warning("%s", probe_text) select("Yes", continue, "No", continue) default(1);
1 -> exit true;
-> exit false;
endwhen;
end;
Arithmetic Operators
Operation Symbol
Addition +
Subtraction -
Multiplication *
Division /
Modulo (remainder) %
Exponentiation ^
The arithmetic operators work on numeric expressions, though the addition operator can also be used to perform string
concatenation.
Relational Operators
Operation Symbol
Equal to =
Not equal to <>
Less than <
Less than or equal to <=
Greater than >
Greater than or equal to >=
In range in
Has range (for repeating items) has
Logical Operators
With logical operators, either the symbol or the keyword can be used.
In Operator
Pa ge 132 of 684 Opera tors
Description
The in operator is used in logical expressions to test whether an item or variable is within a set of values or ranges. The
item or variable can be a number or a string. A range of values is separated by a colon, for example 1:5. Elements of a
list of values or ranges are separated by commas, for example 1,3:5,7. You can also use special to mean all special
values (as in the special function). The in operator can also be used to test whether a value is in several CSPro
objects:
A list can be used to determine if the value is located in the list's values (as in the list.seek function).
A value set can also be used to test whether a value is in the value set's codes (as in the invalueset function).
Example 1
if RELATIONSHIP in 1:5 then
// is the same as...
if RELATIONSHIP >= 1 and RELATIONSHIP <= 5 then
Example 2
if WORK in 1, 3, 5 then
// is the same as...
if WORK = 1 or WORK = 3 or WORK = 5 then
Example 3
if X in 1:4, missing, refused then
// is the same as...
if ( X >= 1 and X <= 4 ) or X = missing or X = refused then
Example 4
if NAME in "A":"MZZ" then
// is the same as...
if NAME >= "A" and NAME <= "MZZ" then
Example 5
if AGE in AGE_TEENAGE_VS, special then
// is the same as...
if invalueset(AGE, AGE_TEENAGE_VS) or special(AGE) then
Has Operator
Description
The has operator is used in logical expressions to test whether a repeating item is within a set of values or ranges. The
item can be numeric or alphanumeric. A range of values is separated by a colon, for example 1:5. Elements of a list of
values or ranges are separated by commas, for example 1, 3:5, 7.
This function is similar to the in operator except that it works on repeating items. It thus tests whether a group of items
contains certain values.
Example
Y
X true false
true true false
false false true
and:
Operator Precedence
The table below shows the order of precedence for operators. When operators of the same precedence are in an
expression, they are evaluated from left to right. The order of precedence can be changed using parentheses. Operators
in parentheses are evaluated first.
Order Operators
1 ^
2 * / %
3 + -
4 = < > <= >= <> has in
5 not !
6 and &
7 or |
Pa ge 134 of 684 Opera tors
8 <=>
and (X and Y)
Y
X true false
true true false
false false false
or (X or Y)
Y
X true false
true true true
false true false
Lookup Files
A lookup file (external file) is a text file that can be used in a data entry or batch application from which you retrieve data
to display on a form or to use in a calculation. It requires a CSPro data dictionary. Possibilities include:
• Geographic codes and names. Your application could show the name corresponding to the code the user keyed.
• Industry and occupation codes. Your application could ensure the user keys a valid code.
• Last year's data. Your application could look up a corresponding field from last year's data and calculate a percentage
change.
• Generalized menu choices. Your application could read a lookup file and show the contents on the screen as a menu,
then convert the user's choice to a code.
To use a lookup file (external file) in your application, do the following:
Heads-Down Keying
This approach is most commonly used for keying of census forms because of the large volumes of data involved. While
entering data, the operator generally does not look at the computer screen, but rather, looks down at the questionnaire
on the table or work surface. The objective of heads-down keying is to transcribe to the computer, as quickly and
accurately as possible, the data as they appear on the questionnaire. On-line checking is generally kept to a minimum
and consistency errors are resolved in a later phase, generally through computer edit programs. Operators do not need
to be familiar with the subject matter of the questionnaire. They make very few decisions to resolve data errors. The most
important skill is speed and accuracy. CSPro provides operator statistics to help measure operator speed and accuracy.
Heads-Up Keying
This approach is most commonly used for entering data from surveys, due to the smaller number and greater complexity
of the questionnaires (as compared with a census). While entering data, the operator often refers to the computer screen
as well as to the questionnaire. The objective of heads-up keying is to catch and correct as many errors as possible as
the data are being entered. As a result, there is generally more on-line checking programmed into the application.
Operators need to be very familiar with the subject matter of the questionnaire. They will make decisions to resolve data
errors, and must be properly trained to do so.
Skip Issues
To Skip or Not to Skip?
Skipping—causing the cursor to jump over one or more fields during the data entry operation—is an issue that provokes
discussion both pro and con. The decision of the application designer to use (or not use) skips will depend entirely on
the type of application and the data entry staff.
When the data capture operation is expected to be heads-down, as it would be for a census or similar high-volume
application, it will cause less confusion to the keyer if all skips are controlled by the operator rather than the application.
There will then be no "surprises" for the keyer when the application logic forces the cursor to one field when the keyer,
looking not at the screen but at the form, expects the cursor to be in a different field altogether. Thus, instead of
speeding up the entry operation by anticipating probable cursor movement, this tactic eventually slows down the
operation when there are inconsis tencies in the data or, simply, keying errors.
When the data capture operation is of smaller volume, or with a more complex questionnaire or forms, it may make
sense to use application-controlled skipping. The operator will be more likely to be aware of the cursor movement and
less likely to be surprised by unexpected interruptions in the normal sequence. Of course, a combination of operator-
controlled and application-controlled skipping may be used in any given application; the designer will have to weigh the
keying environment and the forms to be entered to make the appropriate decision.
Manual Skips
Automatic Skips
Automatic, or application-controlled, skips depend on information already keyed to direct the cursor movement. For
example, in a household survey, if a female respondent states that she has at least one child living with her, the cursor
can skip automatically to the form for capturing information about that child (and any others). Conversely, if the female
indicates that no children are present, the cursor can be directed to skip over the information about children. It is clear
that such skips depend entirely on the accuracy of the data keyed prior to the skip; if a "Yes" response is mistakenly
entered as a "No," the cursor will be misdirected and the operator will find that the screen(s) presented for keying do not
correspond to the information in the paper forms. This will cause loss of time as the operator seeks to uncover the error.
Automatic skips, when used, must be well-documented so that the keyer is aware of the possibilities of non-sequential
cursor movement. This approach corresponds to the system-controlled option in CSPro.
At data entry time, CSEntry shows a message every time the keyer enters a value that is out of range according to the
data dictionary. You may set the attribute to override the message and force the out-of-range value into the data file. If
this attribute is not selected, the keyer cannot proceed until a valid value is entered.
Adding Logic
In most surveys, consistency errors are corrected manually as opposed to automatically. The correction process, as
done traditionally, is often very lengthy, time-consuming and painful. In surveys of small volume and high complexity,
such as Household Surveys or Income and Expenditures Surveys, it is often desirable to apply the edit specifications
rules at data entry time and resolve any errors immediately while the questionnaire is still at hand. This approach is not
recommended for a census.
You can use the CSPro language to write consistency checks for virtually any part of your data entry application's level,
form, roster, or field. The logic is executed as the data is being keyed. Any error messages are reported back on the
screen, and the operator then has access to both the error messages and the questionnaire itself on the screen. The
same logic can be run against the data after they are entered in either batch or interactive mode.
Operator Controlled
This is the default type of data entry application. This type generally allows more flexibility for the keyer during data
entry. It is recommended for simple ad-hoc applications and for census applications. Operator-controlled applications
have the following features:
Some special data entry keys are active during data entry.
CSEntry will not keep track of the path.
"Not applicable" values will be allowed.
More appropriate to the heads-down methodology.
Operator can bypass logic in the application using special keys.
System Controlled
These applications generally place more restrictions on the data entry operator. This type is sometimes used for
complex survey applications. The behavior of these applications at data entry time is essentially the same as in ISSA.
System controlled applications have the following features:
Some special data entry keys are not active during data entry.
CSEntry will keep track of the path.
"Not applicable" values will not be allowed unless defined in a value set.
More appropriate to the heads-up and CAPI methodologies.
Logic in the application is strictly enforced; operator cannot bypass or override.
You set the application type in the Change Data Entry Options dialog box (Options -> Data Entry from the main menu
toolbar).
Path On
CSEntry will keep track of the order in which the data entry operator entered all fields. If the operator goes backward, the
cursor will go to the fields in the reverse order in which they were entered. For example, if the logic causes the cursor to
skip over a set of fields, the cursor will also skip over these fields when the operator goes backwards. Fields that were
skipped can never be entered, unless the operator goes backwards and chooses different values to avoid the skip. This
helps ensure the integrity of the data file.
• Rosters
A roster is a grid that shows multiple occurrences of a group at the same time. Many questionnaires have rosters
printed on them. A typical example would show each person as a row and each column as a variable, as shown
below. Rosters can also have a vertical orientation, in which case the rows and columns would be reversed.
In CSPro, you can show repeating groups as a roster on a single form or as individual fields on a form that repeats.
The darker gray area at the top of each column is called a column heading. In the example above, the column
headings contain the text "Line number", "Relationship", "Sex", and "Age". The text in the darker gray area to the left
of each row is called the "occurrence label." In the example above, the occurrence labels are "1", "2", "3". These are
the default values.
In rosters with vertical orientation, column headings and occurrence labels are reversed.
• Fields
Fields are areas of a data entry form that may be keyed or may show values. Fields may be placed directly on the
form or may be part of a roster on the form. Fields are always associated with dictionary items. Some properties of
fields, such as length and type (numeric or alphanumeric), are defined in the data dictionary. Other properties are
defined in the forms designer. In this example we have two fields:
Blocks
Overview
Blocks are a way to group several fields into a related unit. In summary:
Blocks contain some number of fields that come from the same group.
On mobile devices, a block's fields can be displayed on the same screen and the operator can enter values into
these fields in any order.
Blocks can have question text. When running a data entry application, this question text will appear above the
question text for each field in the block.
Logic can be defined for the block.
Blocks are primarily useful for two reasons. Firstly, they provide the mechanism for CSEntry to display multiple fields on
a screen. Secondly, they allow you to write logic checks in one place that apply to a number of fields.
The use of blocks is entirely optional. Examples of potential uses of blocks include date fields (day, month, year) or
consumption fields (quantity, unit). Alternatively, you can lump together related questions in one block for easy viewing
on one screen (e.g., fertility counts of children).
Creating Blocks
To create a new block, select multiple fields in the Forms Tree by holding down the Ctrl key while selecting fields. Once
the fields are selected, right-click and select Add Block. A dialog box allows you to specify the block's label, name, and
whether or not the fields of the block should appear together on the same screen on mobile devices.
Only fields can be added to blocks. Blocks can be created from fields on a form or from fields on a roster. To add a field
to an existing block, drag the field onto the block. To remove a field from a block, drag it outside of the block. A block
can contain no fields, but once it contains one field, it can only contain other items from the same group. For example, if
a block contains an item from the population record, then only other items from the population record can be part of the
group.
Using Blocks
You can define question text for a block. When running a data entry application on a mobile device, the block's question
text appears at the top of the screen, and then each field's individual question text appears underneath. This allows you
to have some general text ("What is your date of birth?") and then specific text ("What is your year of birth?"). On
Windows desktop, the block and field question text appears in the same window.
You can write logic that will run when the block is entered or exited. For example, if you have a block with two fields,
FIELD1 and FIELD2, then logic will be executed in the following order if moving forward in your program:
Instead of putting logic in the field procedures, you will generally place all logic related to the fields of the block in the
block's procedures.
You can refer to blocks in logic in the movement statements (advance, ask, move, reenter, skip) and you can use the
blocks in occurrence-related functions (count, curocc, maxocc, noccurs, and totocc).
Just as you will generally want to put logic in the block procedures, you will likely want to use the block name in
movement statements rather than a field name. One advantage of this is that you can modify or reorder the fields in the
block without having to modify movement logic. When using ask on a block, if the condition fails, then all of the fields in
the block will be skipped.
Example
Thinking about the example in the above image, date of birth (day, month, year), one disadvantage of the one-question-
per-screen approach on mobile devices is that it splits related fields onto multiple screens, and, depending on where
logic checks are executed, results in a reenter potentially not going to the most relevant field. With blocks, all related
fields appear on the screen at the same time, allowing the enumerator to enter the values in any order.
Only after the enumerator has filled all applicable values will they move past the block. Logic checks for date of birth
would thus appear in the block's procedure:
Data Dictionary
1. All dictionaries that make up your application should use the same zero fill and decimal character settings. You
can check this using Dictionary Analysis.
2. Every numeric item should have a value set defining the range of values that can be entered into the field. A
common mistake is to not declare a value set for a two-digit age field, but without a value set, ages like -5 can be
entered. Although protected fields do not need value sets (because the operator will not enter a value into the
field), it is still a good idea to add a value set so that this information about possible values is available to other
users of the dictionary. You can use Dictionary Analysis to identify items without a value set.
3. For items with multiple value sets, make sure that the primary value set (the first one showing in the dictionary
editor) is the value set with the complete list of codes. This value set is used to show labels in the case tree. If the
item's value set is defined by cascading options, consider adding a generic value set as the primary value set
rather than having one that potentially leads to incorrect labels based on the cascading conditions.
Form Design
1. Ensure that every field's capture type is set to match the way that you want an operator to enter data for the field.
As much as possible, prevent an operator from having to enter codes. For numeric fields, text boxes should
generally only be used for values like age or currencies; you can use combo boxes for instances when an
operator typically will type a value but can occasionally select a discrete code.
2. Check for the consistency of field properties. For example, if the upper case setting is used for some fields, it
should generally be used for all fields. Field properties can be standardized using the Field Properties (for Multiple
Fields) dialog.
3. All protected fields must be prefilled with a valid value; if not, CSEntry will terminate data collection. Make sure
that all protected fields are filled in via logic or by using PFF parameters.
4. If an operator is never going to collect data directly for a given field, consider leaving it off the form. Items in a
dictionary can be filled in using logic without needing to exist on a form. Alternatively, you can hide such fields
from an operator using the Hide in Case Tree field property.
Question Text
1. Make sure that every question has defined question text. Even for protected fields, it is a good idea, for
completeness, to define question text.
2. For multiply occurring questions, ensure that the minimum and maximum occurrences defined for a field's
question text cover the whole range of occurrences. You can use the Harmonize Occurrences functionality to
Logic
1. Ensure that there is logic to check for acceptable values for fields without value sets. For example, you may want
to ensure that operators enter proper alphanumeric responses for fields such as names and other/specifies.
2. For alphanumeric text box and checkbox fields, you may want to ensure that a certain number of characters is
entered or choices are selected.
3. Review all hard checks and soft checks and think about what data respondents will be able to provide. You do not
want operators to enter invalid data to bypass a hard check, so use soft checks for questions where some
flexibility is allowable.
4. When working with dynamic value sets or question text fills that are set via logic, make sure that any logic that is
executed for such tasks on a multiply occurring field is in an onfocus, not preproc, event. If the logic is in the
preproc, an error can occur as an operator moves backwards to a previous occurrence because the value set and
question text will not be updated (because preproc events only occur as an operator moves forwards).
5. Verify that fields that should have values are not skipped over. When using system-controlled mode, the values in
skipped fields are deleted when a case is finalized. A common mistake is to prefill a field and then skip over it,
which results in the data being lost when the case is finalized. In such instances, the field should be prefilled and
then protected, which will prevent the operator from modifying the value but ensuring that the data is maintained.
6. A best practice is to minimize the use of unnamed constants throughout the application. For example, instead of
coding 2020 as a year, declare numeric CensusYear = 2020; and then use CensusYear throughout the
application. This makes the code clearer and also makes it easier to change your application when
circumstances change.
• Click on the toolbar, or from the File menu, select New. The following dialog box will appear.
• Select Data Entry Application or CAPI Data Entry Application and press OK. Both of these options create data
entry applications, but selecting CAPI sets some options that will be more convenient for creating CAPI applications.
(These options include: turning on CAPI mode, running in system controlled mode, and not asking for the operator ID.)
• A file dialog box will appear. Enter the name of the application file. Make sure you are located in the folder where you
want to place the application files. Then press Create. The following dialog box will appear.
• A default name of the data dictionary describing the data enter file is given. You can use this name or change it. If you
give the name of a dictionary file that already exists, that data dictionary will be used by the application. If you give the
name of a dictionary that does not exist, a new data dictionary will be created.
To generate new data entry forms, either press Ctrl+G; or, from the Edit menu select Generate Forms; or from the
dictionary tab on the left side of your screen, drag the dictionary onto a form. As this action will destroy all existing
forms, a warning message will appear, asking you to confirm that you wish to proceed.
If you choose to proceed, the Drag Option dialog will appear. At this point you have the opportunity to decide text
placement with respect to the data entry boxes; whether you want to roster items (when possible); whether you want
subitems dropped instead of the item, etc.
You can also select whether the text is placed to the left or to the right of the data entry box. (This setting has no effect if
the item is rostered.)
If you select to use the item labels, you can link the field label to the item. That means that if you change the item's label
in the dictionary, then the label on the form will automatically adjust.
Roster Options
This affects dictionary records and items with more than one occurrence. To enter this type of data, you either need a
form that repeats (to allow for the multiple occurrences of the data), or you need a roster.
If you choose Horizontal, CSPro will make rosters in which the occurrences are the rows and the fields are the
columns. In CSEntry the cursor will move from left to right by default.
If you choose Vertical, CSPro will make rosters in which the occurrences are the columns and the fields are the rows. In
CSEntry the cursor will move from top to bottom.
If you choose Don't Roster, CSPro will make forms that repeat.
If you select Use occurrence lLabels in roster, the roster's rows, instead of being labeled 1, 2, 3, etc., will use the
occurrence labels defined for the item or record in the dictionary.
If left unchecked, the cursor will automatically advance to the next field as soon as the maximum number of characters
are entered for the field (that is, if the field length is two, then after entering two characters the cursor will advance to the
next field). An operator can always hit the Enter key to complete a field without having entered the full complement of
digits.
If this option is checked, the operator must always press the Enter key to advance to the next field.
If this box is left unchecked, the control type will default to a text box (keyed entry).
You can also switch between the forms and dictionary window by pressing either or in the Toolbar.
• Forms File:
This is the highest level node, i.e., the root node. It is the owner of all code, which is to say [1] level-, record-, and
item-related code, [2] user-defined functions, and the [3] global routine.
• Level:
This is the second-tier tree node, just below the root. It has a 1-to-1 correspondence with the same-named dictionary
level.
• Form:
This is the third-tier tree node, just below its level. It represents the form and all the items in that form.
• Roster:
It represents the roster and all the items included in the roster.
• Field:
This is the terminal or "leaf"-node; i.e., the lowest accessible level. It has a 1-to-1 correspondence with a dictionary item.
You are free to rename any of the above the unique names via the properties dialog box, but it is recommended that you
retain the original name, so that it is easier for you to see which dictionary entity is being referenced. The data entry tree
represents the order in which the data entry is keyed.
You can change the order of entry by reorganizing the forms or items within the forms by dragging them within the Data
Entry tree view. When the operator enters the data, the cursor will follow the order in the tree not the order in the form.
When selecting a new edit item, the contents of the logic view will change to display the logic for the selected entity.
Pressing Ctrl+T in the data entry tree will allow you to switch between the labels and the names of the items.
Press ; or press Ctrl+R; or select Run from the File menu to launch CSEntry. You will see a screen that looks like
this:
In addition to this row, there may be rows where you can specify the names of lookup files, external files, and the
paradata log.
After specifying the file names, CSEntry may ask for the operator's ID. CSEntry keeps track of the operator's keying
record and can display the operator statistics in CSEntry. You can turn off this option in the data entry options.
You will, of course, want to test the behavior of your data entry application before using it in a production environment, so
this is just a quick way to launch CSEntry. When your application is ready for production, you can launch CSEntry
independently of CSPro. For assistance with this, see Run Production Data Entry.
This assumes that CSPro was installed in the default directory. Your PFF file must have a .pff extension.
You can create a PFF file in one of two ways: either [1] create one with a text editor (such as Notepad or Wordpad), or
[2] have it generated automatically for you by launching your data entry application from within the CSPro Designer. The
file will have the same name as your application, but with a .pff extension instead of .ent. For example, if your data entry
application was named MySurvey.ent, the generated PFF would be named MySurvey.pff.
The following section shows the options available to you in a CSEntry PFF file. A PFF file is not case sensitive, so you
can use any combination of upper and lower case text.
[Run Information]
The [Run Information] block is required. The Version and AppType and must appear exactly as shown in the example
above. The Description is optional. If included, it will be used instead of the name of the file in the list of applications
(mobile only) and in the window title during data entry (all platforms).
[DataEntryInit]
The [DataEntryInit] block is optional. It gives you the opportunity to customize the following runtime characteristics:
OperatorID=specifies the operator ID for the purposes of logging operator statistics. If this line is not present but
your data entry application has been set to ask for this, CSEntry will prompt the operator for one at runtime.
StartMode=determines the mode in which CSEntry starts. The possible options are: Add, Modify, and Verify. If
this line is not present, one of two things will occur: if the data file does not exist, then the program will start in
Add mode; if the data file does exist, then CSEntry will wait for the operator to choose their desired mode. The
operator's choices may be constrained due to options indicated in the Lock feature.
You can also specify the case upon which to act by adding a semicolon and the case IDs of an existing case (for
example: StartMode=Modify;010502). This will open CSEntry in Modify mode and open the case indicated by
the ID. If the StartMode conflicts with the Lock parameter settings, then the StartMode parameter will take
precedence.
Key= specifies the case ID to automatically open. If a key is provided that exists in the data file, CSEntry will
open that case and then close once the case has been entered. If the case does not exist, a new case is created
and the IDs automatically filled from the values specified. If only a partial key is provided, then only the values
specified are prefilled. (For example, if the IDs are cluster and household number and the key only specifies the
cluster, then the cluster will be automatically filled but the interviewer will have to specify the household number.)
The StartMode attribute takes precedence over the Key attribute. You generally will use only one, if any, in your
design. This attribute is typically used in menu programs, often with the output of the key logic function.
[Files]
The [Files] block is required. A description of the files, not all of which have to be specified, is as follows:
[ExternalFiles]
If the [ExternalFiles] block is present, it means that a second (or more) dictionary was linked to the data entry
application. In the example above, LOOKUP_DICT is the dictionary name, and LookupFile.dat is the name of the data file
that contains the lookup codes.
Parameter=allows you to pass in a string to your program. The parameter can be any length, although the string
that retrieves the value in your program (via the sysparm function) must be large enough to accommodate it. Once
the parameter is retrieved, it can be parsed by your program for further usage.
Language=specifies the initial language of the program. The parameter must match the name of a language
specified in the question text, dictionary, or message file.
OnExit=specifies a PFF file to run after the data entry program closes. This can be useful, for example, if you
want to relaunch a menu program after collecting data.
When starting a new case, if the name of a parameter matches the name of a dictionary item, the value of that item will
be set to the value specified in the PFF. This is similar to how the persistent fields are processed (see below), but this
works for non-persistent fields.
[DataEntryIDs]
The [DataEntryIDs] block is for use with any persistent IDs that you have defined. CSEntry will assign the specified
values to the indicated persistent fields when a new data file is created. This feature allows automatic definition of
persistent fields, such as geographic IDs. If you provide values and run this on an existing data file, and the PFF file
values do not match the values in the data file, then the PFF values will be ignored. The syntax is as follows:
id_item_name=initial_value
This generated file cannot be changed in any way. It cannot be opened by the CSPro designer and it cannot be read in a
text editor. If you make changes to the data entry application, you must generate the binary application again.
The binary application will run identically in CSEntry to a normal (text) application. An operator will not see any difference
in behavior.
To generate a binary application, open the application in the normal way, then from the File menu select Publish Entry
Application (.pen).
Before CSPro 6.0, binary data entry applications had the extension .enc. The 'P' in the new extension stands for
"portable," which indicates that the file can be run not only on desktop CSEntry, but also on mobile devices. CSPro can
no longer read .enc files and if you would like to run an application with such a format, you will have to use an older
version of CSPro.
To change the form order, simply drag and drop on the form tree. For example, suppose you currently have Form A,
Form C, Form D and Form B in your level. The forms tree will now show the following:
1. Form A
2. Form C
3. Form D
4. Form B
If you drag the form icon for Form B and drop it on top of Form C, the forms tree will now show:
1. Form A
2. Form B
3. Form C
4. Form D
Similarly, to change the order of fields within a form, use drag and drop on the forms tree.
To change the order of fields in a roster you must drag and drop within the roster rather than on the tree. You can change
the default order in which the forms and fields will be keyed by using logic in the application.
Application Options
Ask for operator ID: If this box is checked, CSEntry will prompt the operator to enter an operator ID.
Confirm end-of-case: If this box is checked, CSEntry will prompt the operator to accept the case at the end of
each case entered.
Allow partial save: If this box is checked, CSEntry will allow the operator, when in add, modify, or verify mode,
to save a case which has not been completed. You can also have CSEntry automatically partially save cases.
Show case tree: There are four options for the case tree selection. There are options to never show a case tree,
to show it only on mobile devices (Android), to show it only on desktop devices, and to show it on all devices. If
shown, CSEntry will display a tree on the left showing each item in the case currently being added, modified, or
verified and its value. The tree can also be used to navigate between parts of the questionnaire.
CAPI mode: If this box is checked, CSEntry will display the Computer-Assisted Personal Interviewing (CAPI)
window. The top part is for question text (to be read during the interview). The bottom is for the normal form
content.
Always show refusals: If this box is checked, refused values defined in a value set will always be shown to the
operator. If unchecked, the operator can view refused values by selecting the Show Refusal Options menu option.
Center forms on screen: If this box is checked, CSEntry will center forms horizontally in the display window.
Decimal mark is a comma: If this box is checked, CSEntry will use a comma instead of a period when showing
numeric fields with a decimal component. The keyer must also type a comma instead of a period to enter the
values after the decimal mark.
Right-to-left entry: If this box is checked, CSEntry will orient rosters in a right-to-left manner, which is useful for
languages like Arabic. The first column of a roster will be the furthest right in the roster and the roster will scroll
from right to left..
Mobile Options
If you are running CSEntry on a mobile device, there are some options to consider:
Automatically advance on selection: When clicking on a response (when displayed as a radio button),
CSEntry will automatically advance to the next field, rather than requiring the interviewer to swipe or press the
next button.
Show field labels above question text: Each field's label will be displayed on the screen above the field's
Frequency: This is the interval between cases that CSEntry will use for verification. For example, if this value is
10, every 10th case will be verified.
Start: This is the number of the first case in the data file to verify. For example, if this value is 5, and the
frequency is 10, cases number 5, 15, 25, etc. will be verified. The case number is determined by the physical
order of the cases in the data file. The start value must be less than or equal to the frequency value.
Random Start: You may check this box instead of specifying a start value. CSEntry will then choose a random
number for the start value. The random number will be between 1 and the frequency value.
General Options
Automatic partial saves: You can specify the number of minutes after which a case will automatically be
partially saved.
It may be useful to uncheck both options if creating a menu or CAPI program, as these files are mostly useful in key-
from-paper operations.
Paradata Options
To view options related to paradata collection, select Paradata from the Options menu in a data entry or batch edit
application.
The following is a summary of each kind of event that CSPro can collect:
Data Source: Information about interactions with CSPro data files, including what files were used, what cases
were loaded and saved, etc.
Device State: Details about the data collection device, including indicators about whether GPS, Bluetooth, and
Wi-Fi were enabled; what mobile network the device was using; etc.
External Application: Information about external applications (or CSPro data entry applications) launched from
CSPro.
Field Entry: Details about each field entered, including how long the interviewer spent answering the question,
how the program reached that field, etc.
Field Movement: Information about each field movement, including how the program reached and exited each
field.
Field Validation: Details about CSPro's validation of each entered field, including information on which value set
was used and whether or not the field was successfully validated.
GPS: Information about GPS collection, including the duration to take fresh GPS readings.
Impute: Details about which items were imputed, including the initial and imputed values.
Language Change: Information about which language was initially used and records of language changes during
the course of data collection.
Message: Details about any error message triggered during data collection.
Note: Information about notes left by interviewers (or via logic) during the execution of the program.
Operator Selection: Details about selections made by the operator as a result of requests coming from
functions such as accept or the userbar.
Property: Information about what CSPro settings were used, or changed, during the program's run.
You can explore these events in more detail using the Paradata Viewer's Table Metadata feature.
For advanced users writing SQL queries on the paradata tables, note that the base "event" table links to each of these
event tables, and that some of these events link to other tables with names that end in "_info" or "_instance."
Field values: The Field Validation event can store each value entered into a field. This can be useful if you want
to look at the frequency of valid and invalid values, but if you do not want such data stored in the paradata log, you
should disable this. The Note event also uses this option to determine if note text should be stored in the paradata
log.
GPS coordinates: The GPS event can store the coordinates of the interviewers. If you do not want this data
stored in the paradata log, you should disable the collection of the latitude and longitude coordinates, though
other information (altitude, accuracy, etc.) will still be stored.
Cases loaded via logic loops: This is not an option related to sensitive information, but if your application loops
through external data files in logic (using forcase or loadcase), the paradata log will become very large quickly
due to a preponderance of Data Source events. You can disable this information if you do not plan to analyze it.
Pa ge 161 of 684 Cha nge Da ta Entry Cha ra cteris cs
Initial property values: If this option is enabled, the initial property settings will be stored as Property events in
the paradata log when your application begins. This can lead to a large amount of paradata that is potentially not
useful. If disabled, only information about changes to properties will be stored.
Background Events
Most events are triggered by some action (e.g., an error message appearing), but two events can occur in the
background, even if the interviewer is not doing anything. These background events only occur in data entry applications:
Device state: Information about the device as described above in the Device State event.
GPS location: Unless a GPS reading is in progress, the device's GPS unit will be turned on and CSEntry will
attempt to take a GPS reading. Once a GPS reading with accuracy 10 meters or less is taken, or once two
minutes have elapsed, the GPS unit will be turned off.
The value listed indicates the number of minutes between events occurring, or 0 if the event should be disabled. For
example, a value of 30 for GPS location means that CSEntry will attempt to store the GPS coordinates of the interviewer
every half hour.
Simple Synchronizations
Setting the Synchronization Options
For basic synchronization scenarios, you can configure your data entry application for synchronization by selecting
Synchronization from the Options menu:
To add support for data synchronization to your application, check the Enable synchronization box.
CSWeb: Synchronize with a CSWeb server. This requires a properly configured CSWeb server. The data
dictionary for the application must be uploaded to the CSWeb server. For more information about CSWeb servers,
see the CSWeb help documentation.
Dropbox: Synchronize using the online file sharing service Dropbox. This requires an account with Dropbox.
FTP: Synchronize using an FTP server. This option requires that you have an account configured on an FTP
server.
Synchronize main data file: This setting determines how the main data file is synchronized. There are three
options:
Upload changes to server: Only data that is modified on the device will be sent to the server. No data will
be downloaded from the server. This is the most common option for interviewers when all interviewers will
work on unique assignments.
Download changes from server: Only receive modified data from the server. Do not upload local
changes. This might be used for a supervisor who wants to see what changes interviewers have made but
does not want to make changes themselves.
Sync local and remote changes: Send local changes to the server and download changes from the
server. This option can be used when multiple interviewers need to work on the same assignments. Note
that if both interviewers modify a case at the same time, one will overwrite the changes made by the other.
Simple synchronizations do not support synchronization of external dictionaries or files other than the .pen and .pff files.
To implement these advanced synchronization scenarios, you can create your own synchronization routines using logic
functions.
The synchronization options are used to synchronize data with a server but are not meant for deploying and updating the
application itself. To download an application to a mobile device or update the application files of an existing application
on a mobile device use the Deploy Application tool to package and upload an application to a server. Interviewers can
then download the application to their mobile device using Add Application from the Entry Applications screen of
CSEntry. Interviewers can also download to an application using Update Installed Applications from the Entry
Applications screen of CSEntry.
Running Synchronizations
If the synchronization options have been configured, then the operator can launch the synchronization when running the
data entry application. Synchronization may not be run while entering case data.
On mobile devices, the synchronization may be launched from the case listing view by choosing Synchronize from the
menu or tapping the following icon:
On desktop devices, select the Synchronize option off of the File menu.
Downloading Data
In order to download the data that has been uploaded to the server, use the Data Viewer's download function. This will
download all the data on the server into a single CSPro data file that can be used by other CSPro applications and tools.
To enable the map based case listing screen, select Mapping from the Options menu:
In the Mapping Options dialog, check the Show case listing on map option and choose the variables in your data
dictionary where the coordinates (latitude and longitude) of the cases are stored. These must be numeric variables from
the main data dictionary with at least one decimal place.
For each case in the data file, CSEntry displays a marker on the map at the location specified by the values of the
selected latitude and longitude variables. Completed cases are represented by blue markers and partially completed
cases are represented by red markers. Tapping on a marker displays a popup window on the map containing the case
label or case ID for the case along with any case notes. Tapping on the popup window launches data entry for the
selected case. You can use the setcaselabel and putnote functions to customize the case label and case note
displayed in the marker popup window. Both the case label and case note support a limited subset of HTML tags for
formatting including <b> (bold), <i> (italic) and <font> (font, size and color).
Any cases that have blank values for the latitude or longitude variable will not be displayed on the map. This means, for
example, that a partially saved case where the GPS coordinates have not yet been captured will not be displayed on the
map.
Mapping is currently only supported on Android. No maps are displayed in the case listing when running your application
on Windows.
By default, the cases will be displayed with a Google Maps base map, which requires that the Android device be
connected to the Internet. In cases where no Internet connection is available, you can specify an offline map file to use in
the PFF file.
If you want to change the font for all text that is already on forms, you must press the Apply to all items button.
See also: Change Field Font, Highlighted Function, Color of Fields in Data Entry User's Guide
Press the Reset button to reset the font in all the field boxes on all the forms to the system default.
The computer must have a sound card, with a speaker connected and turned on.
The volume on the sound system must be turned on and sufficiently loud to be heard.
There must be a sound file associated with the Default Sound(Beep) under Control Panel/Sound.
To change the sound, go to Control Panel/Sound and change Default Sound(Beep) to a different sound file.
This is the design stage of the data entry process. This tool allows you to create new forms, add or modify text, enter
lines and/or boxes, add color to the forms or text. If you have a printed questionnaire you will probably want to use it as a
guide when deciding text and field placement, as well as the order of entry for the items.
After you have developed forms to your satisfaction, use CSEntry to input the data.
• Drag a multiple record ( ) from the data dictionary to a blank form. This will generate a roster containing all the
items in the record.
• Drag one item ( ) from a multiple record in the data dictionary to a blank form. This will generate a roster containing
only that item. You can then add more items to the roster one at a time.
Pa ge 168 of 684 Add Things to a Form
• Drag an item from a multiple record, or the record itself, to a form that contains only items from another single record
or ID items.
• Drag a multiple item or sub-item ( ) to a form. If you have a multiple item that has sub-items, and you want to create
a roster of the sub-items, make sure you have the "Use sub-items if present" box checked in the Drag Options
dialog box.
See also: Add Things to a Roster, Change Roster Properties
Right click on the light gray space in the desired column of the roster and select "Add Text". You will see the Grid
Text Properties window. Note that you can choose the text placement and select whether the text will go only in the
cell in which you clicked, or if it will go at the same position in every cell in the column. You can change this attribute
later if you want.
• Add Boxes
Right click on the gray space in the roster and select "Add Boxes". Note that you can choose whether the boxes will
go only in the cell in which you clicked, or if they will go at the same position in every cell in the column. Drawing
boxes in a roster is essentially the same as drawing boxes on a form.
When you select multiple items with the mouse, you'll notice during the selection process a box that drags with you to
show what you're including. To draw a box on a form, it seemed logical to have that same mechanism at work, so we've
introduced the Select Items/Boxes button. Click on to toggle between the two states. When you first click on this
button it will appear depressed, and a floating toolbar will appear with the following buttons:
Click To
Toggle states between selecting items and drawing boxes without having to close down the toolbar.
Draw a box with an etched edge.
Draw a box with a raised edge.
Draw a box with a thin edge.
Draw a box with a thick edge.
When you have finished drawing boxes and no longer need the Box-Draw toolbar, close it down by either toggling the
button, or close the Box-Draw toolbar.
To quickly select several fields in their entirety, just grab their data entry boxes. This will cause automatic selection of
any accompanying text, as well. To quickly select just the text portion of several fields, be sure that the selection field
visible on the screen does not touch any of the data entry boxes.
To select several items, hold down the left mouse button while dragging a selection box around the desired items.
Or you can also hold down the Ctrl key while individually clicking on each item (box or text) to be selected with
your left mouse button.
To select both a data entry box and its associated description text, hold down the Shift key while clicking on
either the entry box or its associated text.
To select several data entry boxes and their associated descriptions, hold down both the Shift and Ctrl keys
while individually clicking on either the edit box or its text for each different field.
Move Things
When you drag a dictionary item onto a form, it will be placed on the form at the point where you released the mouse
button. The dictionary label will be used as identifying text for the field, and it will be placed on the form according to the
Drag Options in effect, which may mean the item becomes rostered. Once the field is on a form, you can fine-tune its
placement.
• Move a Field
To move a field, select the box and drag it to the desired location. Each field has a text item associated with it. You
can see the text by holding down the Shift key and clicking on the field. This will select both the field and its text. You
can now move both of them together by dragging and dropping. You can move a field's associated text separately by
simply dragging and dropping only the selected text.
• Move a Roster
To move a roster, select it by clicking on the gray space in any cell or in the small box in the top left corner of the
roster. Drag it to the desired location. You can also resize a roster.
• Move Text
To move any text, simply select and drag it to the desired location.
• Move a Block of Items
First select a block of items. Then, move the mouse over one of the tracker regions selected. When you see the
mouse cursor change from to , you are ready to move the block. Press down with the left mouse button and
drag it to its new location.
A tracker (or tracker region) refers to the item(s) that has(have) been selected with the mouse. Visually, you will see a
heavy dashed line drawn around the item(s).
Left
This will take the left-most item (whether the text of a field, the data entry box of a field, a roster, etc.) and use it as
the basis for aligning all other selected elements. This alignment method works best for fields that have been placed
on the form in a top-to-bottom manner, with text either to the left or right of the respective data entry box.
Center
This will take the [horizontal] mid-point between the left-most and right-most items (whether the text of a field, the
data entry box of a field, a roster, etc.) and use it as the basis for centering all the selected elements. This alignment
method works best to center text items that have been placed in a top-to-bottom manner, or to center the text of a
field over the data entry box.
Right
This will take the right-most item (whether the text of a field, the data entry box of a field, a roster, etc.) and use it as
the basis for aligning all other selected elements. This alignment method works best for fields that have been placed
on the form in a top-to-bottom manner, with text either to the left or right of the respective data entry box.
Top
This will take the top-most item (whether the text of a field, the data entry box of a field, a roster, etc.) and use it as
the basis for aligning all other selected elements. This alignment method works best for fields that are spread out
across the form in a left-to-right manner, with text either above or below the respective data entry box.
Mid
This will take the [vertical] mid-point between the top-most and bottom-most items (whether the text of a field, the
data entry box of a field, a roster, etc.) and use it as the basis for aligning all the selected elements on the mid-point.
This alignment method works best to center text items that have been placed in a left-to-right manner, or to center
the text of a field next to the data entry box.
Bottom
This will take the bottom-most item (whether the text of a field, the data entry box of a field, a roster, etc.) and use it
as a basis for aligning all other selected elements. This alignment method works best for fields that are spread out
across the form in a left-to-right manner, with text either above or below the respective data entry box.
Evenly Horizontal
This will evenly space three or more items (whether the text of a field, the data entry box of a field, a roster, etc.)
horizontally. The left-most and right-most items will not move. This alignment works best to evenly space data entry
boxes across the screen.
Evenly Vertical
This will evenly space three or more items (whether the text of a field, the data entry box of a field, a roster, etc.)
vertically. The top-most and bottom-most items will not move. This alignment works best to evenly space data entry
boxes one above the other.
Please note that aligning items could have unintended results. For example, if your fields are spread across the form
Splitting a Column
Click on the column heading to select it, then right-click and choose Split. If a column already has more than one field in
it (from a previous join), you can Split the column so that there is one column for each field.
To Delete a Form
Click on the form on the forms tree and press Delete, then choose Delete Form from the main menu at the top.
If the application does not match, the system will indicate that discrepancies exist and will ask you to permit the
updating of the application. If you prefer not to accept the update or if you wish to first investigate the cause of the
discrepancy, you may answer No and the system will not open or update the application. You can then verify that you
have the correct dictionary and review any changes that might have been made since the last time the application was
opened.
• Label
This is descriptive text that helps you identify the current forms file. It may contain any characters (including blanks)
and be up to 120 characters long.
• Name
This is the name of the forms file which you would use when writing programming logic. It must consist of letters,
digits and the underscore ('_') character. It must not begin or end with underscore.
You can see either labels or names on the forms tree. Press Ctrl+T to switch back and forth between them.
• Label
This is descriptive text that helps you identify the current level. It may contain any characters (including blanks) and be
up to 120 characters long.
• Name
This is the name of the level which you would use when writing programming logic. It must consist of letters, digits and
the underscore ('_') character. It must not begin or end with underscore.
You can see either labels or names on the forms tree. Press Ctrl+T to switch back and forth between them.
Label
This is descriptive text that helps you identify the current form. It may contain any characters (including blanks) and be
up to 120 characters long.
Name
This is the name of the form which you would use when writing programming logic. It must consist of letters, digits, and
the underscore ('_') character. It must not begin or end with underscore.
You can see either labels or names on the forms tree. Press Ctrl+T to switch back and forth between them.
Color
The button shows the color of the form. To change the form color, click on this button, select a new color and click OK.
You can change the form color back to what it was originally (usually gray) by clicking on the Reset Default Color
button. You can make all forms the same color by clicking on the Apply to All button.
Name
This is the name of the block, which you use when writing programming logic. It must consist of letters, digits, and the
underscore ('_') character. The name must begin with a letter and cannot begin or end with an underscore.
Field Name
Screen Text
This is the text that is associated with the data entry box on the form. It is also the text that is displayed on the mobile
case tree for the field. If Linked to dictionary item is checked, then the screen text is synced to the label of the
associated dictionary item. That is, if the dictionary item's label changes, then the screen text will change automatically.
Skip to
This is the name of the field that will be skipped to if the operator presses the plus (+) key on the numeric keypad. If the
"skip to" field it is blank and the plus key is pressed, CSPro skips to the next field in sequence. "Skip to" is available
only in operator-controlled data entry mode.
Capture Type
This attribute allows you to specify an extended control (a popup window that shows entries in the value set) for the field,
or to change the appearance of an alpha field.
Validation Method
This attribute allows you to specify how CSEntry validates the value entered in the field. The default option Use value set
and capture type means that CSEntry will ensure that a value is part of the field's value set (if applicable), and also, for
date and check box capture types, that the value is valid (beyond simply the value set checks). The option Allow out of
range with confirmation means that, if the value is not in the value set or invalid based on the capture type, the
operator will be presented with a confirmation message and the operator can accept the invalid value. The option Allow
out of range without confirmation allows the entry of any value into the field. More information about validation can be
found on the extended controls page.
Keyboard Input
This attribute allows you to choose a keyboard ID that specifies what keyboard input to use for the field, which may be
useful in multiple language environments.
Persistent
Check this box to make a field persistent. Persistent fields are ID fields that take the value from the previous case in the
data file as their default. Persistent fields are typically used for geographic IDs that change very rarely from one case to
another. These fields are shown as light gray boxes on the form. In CSEntry, the operator must press a special key (F7)
to change the value of a persistent field. You can make any ID field (except for mirror fields) persistent.
Auto Increment
Check this box to make a field auto increment. Auto increment fields are ID fields that receive an automatic value based
on a calculation using the other cases in a data file. For the first case in the data file, the ID field is set to 1. For
subsequent cases, the field takes the value of the highest value in the data file plus 1. Auto increment fields are similar
to sequential fields except that they work on ID fields across multiple cases, whereas sequential fields work across a
repeating group in a single case.
Sequential
Check this box to make a field sequential. Sequential fields automatically increment at data entry time. They are
commonly used as occurrence-number fields in multiple groups. A sequential field takes the value 1 on the first
occurrence. For subsequent occurrences, CSEntry will use the value of the previous occurrence and add 1. If the field is
not also marked as protected, the operator may change the sequence at any time by simply keying a new value, and
from that point, CSEntry will use this new value to continue the sequential incrementation. You can make any field
(except for mirror fields) sequential. You can define your own kinds of sequential behavior for fields by writing pre-
processing logic. In this case, do not use the sequential field attribute.
Mirror
Mirror fields show the value of a previously entered field on the screen. The cursor never goes to a mirror field during data
entry. Mirror fields are useful to display values from one screen on another screen. Any singly occurring item can be a
mirror field. A common use of mirror fields is to show the geographic IDs on all screens. The first form might contain the
geographic ID fields which the operator keys in, and subsequent forms might contain the geographic ID mirror fields,
which will show the operator the ID values even when the ID form is not on screen. The first time you drag a dictionary
item onto a form you create the normal entry field. On each subsequent occasion that you drag the same dictionary item
onto a form, you create a mirror field.
Upper Case
Check this box to make a field upper case. Alphanumeric fields can be upper case. This means that every alphabetic
character that is keyed will be forced to upper case.
Verify
Check this box if you want to verify the field when the operator is in verification mode. If left unchecked, verification is
skipped. If checked, verification will occur as follows: after each field is keyed, the value entered is compared with the
value currently in the data file. If there is a difference, an error message is displayed, and the field must be reentered.
The Data Capture Type options allow you to change the type of extended control used for the field. It will list the union
of valid possibilities for the selected fields, but if a change is requested it will apply the change only to fields that can
support the specified type. For example, selecting Check Box will only apply that option to alphanumeric fields that can
support checkboxes. The capture type of fields that do not support the selected type will remain unchanged. If you would
like to set the capture type to the default based on the field's item and value set properties, select <default for field>.
If you selected multiple fields on a form to bring up this dialog but would like to apply the changes to all the fields in your
application, select the Apply to all fields option.
You can specify a specific keyboard input method for a field if you would like to avoid making keyers change input
methods manually. The list is populated by currently activate input methods. If an input method is not listed, activate it
first, typically by modifying the Regional and Language Options in the Control Panel.
The keyboard input method will only apply to the selected field. After the keyer moves away from the field, the input
In the above scenario, the keyboard input method will only be modified for Field3, which will be entered in Armenian.
However, suppose that the keyer manually changes the keyboard from the default keyboard to Arabic while on the
Field2. In that case, Field3 will be entered in Armenian, and then the input method will be changed back to Arabic for
Field4. In other words, the Default Keyboard input method resets the keyboard back to whatever input method was last
active on another field with a keyboard ID of 0.
When CSEntry loads the data entry program, it also loads any keyboard input methods that are not currently active on
the machine. The input methods will be unloaded when CSEntry terminates. However, some input methods require
additional files that do not come in a standard Windows installation. CSEntry can only use input methods that contain all
the required files. If the files do not exist, the Default Keyboard will be used for those fields. Before deploying your
application you will want to ensure that the required input methods are installed on each keyer's machine.
See also: ChangeKeyboard Function, Change Field Properties for Multiple Fields
Roster Label
This is descriptive text that helps you identify the current roster. It may contain any characters (including blanks) and be
up to 120 characters long.
Roster Name
This is the name of the roster which you would use when writing programming logic. It must consist of letters, digits and
the underscore (_) character. It must not begin or end with underscore.
Pa ge 181 of 684 Cha nge Form Proper es
Occurrence Control Field Name
If set to a singly-occuring field on the form, entry in the roster will stop after the keyer has entered the number of rows
determined by the occurrence control field name's value.
Orientation
This defines whether the cursor will move from left to right or from top to bottom during data entry.
Free Movement
This defines the order in which to enter a record with multiple occurrences: all the items in a record n times or each item
n times before continuing with the next item
You can see either labels or names on the forms tree. Press Ctrl+T to switch back and forth between them.
• Column Heading
This is the text that shows in the heading. CSEntry may automatically wrap text to make two or more lines, if the
column width is small. You can force your own multi-line text by using Ctrl+Enter at the end of each line. For example
if you type "Age of", then Ctrl+Enter, then "Mother", you will have two lines of text no matter how wide the column is.
• Horizontal alignment
This allows you to force the text to be left-aligned, right-aligned, or centered within the column heading area.
• Vertical alignment
This allows you to force the text to be aligned at the top, middle, or bottom of the column heading area.
• Font
To change the font of the column heading text, choose the "Use custom font for text" radio button then click on the
"Choose font" button.
• Text Color
The button shows the color of the selected text. To change the text color, click on this button, select a new color and
• Row Heading
This is the text that shows next to the row. CSEntry may automatically wrap text to make two or more lines, if the
width of the occurrence label area is small. You can force your own multi-line text by using Ctrl+Enter at the end of
each line. For example if you type "College or", then Ctrl+Enter, then "University", you will have two lines of text no
matter how wide the occurrence label area is.
See "Change Column Heading Properties" for a description of the rest of the properties.
Font
This shows what font is in effect for the selected text. To change the font, select the Use custom font radio button then
click on the Choose font button.
Text Color
The button shows the color of the selected text. To change the text color, click on this button, select a new color and
click OK. You can change the text color back to what it was originally (usually black) by clicking on the Reset Default
Color button. You can make all text on all forms the same color by clicking on the Apply to All button.
Write Notes
You can add notes to a field, which can be viewed by the operator and/or edited. These notes can be used to display
instructions to the keyer or to elaborate on a particular value. See the putnote, getnote, and editnote functions.
Generate Reports
You can write customized reports to a file. For example, you could create a report to summarize the demographic
characteristics in the household.
Structure Edits
Structure edits can be used to check coverage and to establish that each case has the correct number and type of
records. The tests that make up a structure edit ensure that the questionnaires are complete before beginning the
consistency edit. In a typical Housing and Population census, the structure edits will check that:
For all collective and conventional households within an enumeration area, all required records are present and are
in the proper order.
Each enumeration area (EA) batch has the correct geographic codes.
Where required, each household has one and only one housing record.
Each occupied housing unit has at least one person record, and vacant housing units (where permitted) have no
person records.
Within a household (conventional or collective), there must be one and only one person record for each member of
the household (i.e., no duplicate or missing records) such that the total number of person records is equal to the
recorded count of persons in the household.
Within each conventional household, there is one and only one qualified head of household; within each collective
household, there is no head of household.
Consistency Edits
Consistency edits look at the individual data items within a questionnaire or case, and examine the validity of each item
and the consistency of each item with respect to other related items. For example, the degree of educational attainment
of an individual should have some predictable relationship with the person's age. This means that it is not expected that a
9-year-old child will have progressed much beyond the fourth or fifth year of elementary school, depending on local
standards. If, in the unedited census data, a 9-year-old indicates educational attainment above that level, it is probable
that either the age or the educational level have been incorrectly recorded. Whether the age or the educational level is
modified to produce data consistency is a decision left to the subject-matter specialists, who should know whether age
data or education data are more correctly reported. It is also important to take into account the method of data capture;
some, like scanning, are prone to systemic error (that is, where a "0" is consistently read as a "9," for example). If such
error is not recognized and taken into consideration when assessing the reliability of reported information, it can lead to
even greater error in the final data.
At a minimum, consistency checks should seek to resolve all errors which might eventually lead to doubts about the
quality of the data. That is, the subject-matter specialists must consider the types of tabulations to be produced and the
uses to which the data may be put in the future. For example, if the plan includes tabulations of educational achievement
by age and sex, the edit specifications must include a means of detecting and correcting the data for individuals whose
declared educational achievement is not in line with their declared age (as in the example of the preceding paragraph). If
a published table shows even one 9-year-old having completed secondary school, the quality of the data will be called
into question, simply because (in that particular culture) it is not the norm that a child so young could be so advanced
educationally. It makes no difference if, in fact, the child is a prodigy and did indeed complete secondary school; when it
is a case of one or two "outliers" and the weight of probability is against them, the values must be changed and brought
in line with reasonable expectations. It is not recommended that such changes be made only in the logic which
generates the tables; if the data (even a subset) are to be given out to researchers or other users outside the statistical
office, they must be clean and consistent throughout before distribution.
So what does this code do? If a person's age is in the range from 0 to 110 ('0' is for infants born less than 12 months
before the Census reference date), then the value is accepted as valid and program flow exits the procedure. If not, then
the value is either outside this range or missing, in which case the subsequent errmsg statement will be executed,
showing the reported age for Person N.
More involved edits may be needed for other variables. For example, fertility information is only asked of a female of a
certain age. So if fertility information is present, you may wish to confirm the values of other variables. A possible test
could be as follows:
PROC FERTILITY
if FERTILITY in 0:20 then { there are 0-20 children }
if sex = 1 then {if sex = male}
errmsg ("male has fertility info present"); {message displayed}
reenter; {operator must enter valid value}
else {sex is not male}
if sex = 2 then {if sex = female, check woman's age }
if age < 15 then { 15 = minimum age for fertility }
errmsg ("woman is too young (%d) to have children", age);
endif;
endif;
endif;
else { Fertility is blank }
if sex = 2 then { a problem if the woman is "of age" }
if age >= 15 then
errmsg ("woman aged %d should have fertility info", age);
endif;
endif;
endif;
As you see, the complexity of the logic used to find (and soon, correct!) errors will depend on the specifications
provided. In the case where specifications are minimal, it is important that the programmer consider all
situations/paths in developing the logic.
The screen is divided into three main work areas: the Tree View, the Logic View and the Message View.
• Tree View
The window on the left half of the screen displays the data entry tree with the first form ( ) selected.
• Logic View
This is the window block in the upper portion of the right half of the screen. It is the programmer's "clean slate, to
which may be added logic for any part of the data file: any item, any section, any record, even the file as a whole. It is
up to the programmer to determine the correct placement and sequence of execution for each logical element. The
initial screen (PROC IDSO_FORM) represents the Identification form or initial form. If you move to the top of the tree (
), then the logic view will display:
{Application 'MYDATAENTRY' logic file generated by CSPro }
PROC MYDICT_FF
This is the beginning of your program. This two lines of code will always be in your application file. You write the
declaration statements under PROC GLOBAL, then the procedures for the event.
If code has been written for a given edit level, form, roster, or field, a check mark will appear superimposed on the icon
for that entity. This is how, at a quick glance, you can see where you have placed programming logic. Once one line of
code has been written anywhere in the program, a check mark will appear on the Forms File icon.
• Message View
This is the window block in the lower portion of the right half of the screen. It is devoted to messages (user-created and
system-generated). As with the Tree View, tabs are available to the programmer; clicking on one of them will make the
contents of that view active. The Compiler Output tab displays errors found during compilation of your program; if the
code compiled successfully, it will state "Compile Successful." The Message tab is used to type in error messages
that will be used in the execution of the program.
If you wish to modify the size of any of these three work areas, just place the mouse over one of the separating bars,
grab it, and drag to resize.
For example, suppose you select AGE from the tree and there is no associated programming logic, you will see in the
logic view:
PROC AGE
Since there is no logic, PROC AGE is generated "on-the-fly" and will not be saved in the logic file. If there was associated
logic, you might see something like this:
PROC AGE
if not ( AGE in 0:99 ) then
errmsg("Invalid age found");
endif;
CSPro is based on the concept of a "case" containing one or more types of "collections of information." The "case" will
usually correspond to the questionnaire used in the survey or census, and the "collections of information" [or groups of
data items] will usually correspond to one or more record types that make up the case/questionnaire. These elements
constitute a hierarchy, and in applying logic, CSPro follows this hierarchy; that is, it begins with any logic that pertains to
the file itself, and works "down the tree" through the various levels. CSPro executes application events one case at a
time. During data entry, preproc, onfocus, onoccchange, killfocus and postproc statements are executed in the
order in which they are encountered.
The following diagram illustrates the default order of events for a two-level data entry application that has no skip or
advance statements that might otherwise alter the program's natural flow. Level 1 has two forms (1 and 2) and level 2 has
one form (3). This description applies to both system- and operator-controlled applications.
Note that the natural flow through the fields can also be altered by the use of the cursor or mouse. For example:
Scenario 1
Given: Three fields (A, B, and C). From Field A, click on Field C. Assume all fields have preproc, onfocus,
killfocus, and postproc events.
Result: Field A's killfocus would execute, but its postproc would not. Nothing would execute for Field B. Finally,
Field C's preproc and onfocus would execute.
Scenario 2
Essentially, if the programmer uses logic, or if the data entry operator moves backwards or forwards with the mouse or
arrow keys, the natural flow of the program will be altered. If exiting a form, field, or roster prematurely, the killfocus
event will execute but the postproc event will not. Similarly, if entering a form, field, or roster by backing up into it, the
onfocus event will execute but the preproc event for it will not.
See also: CSPro Program Structure, Order of Executing Batch Edit Events
Compile an Application
When CSPro compiles your application, it checks the logic you have written to see if there are any errors or warnings.
Errors typically include spelling a command incorrectly, not using proper command syntax, and putting logic in the
wrong place. Error messages appear in the panel at the bottom of the right-hand screen and a red dot appears to the left
of the line that contains the error. Warnings usually involve using commands in questionable ways. Warning messages
also appear in the panel at the bottom of the right-hand screen and a yellow dot appears to the left of the line that
contains the warning.
You can choose to compile code for a specific item, or for the entire application. To compile code for a specific item,
simply select that item from the edit tree. The associated logic for that item will be displayed in the Logic View. Press
on the toolbar; or from the File menu, select Compile; or press Ctrl+K. The results of your compile will be displayed
in the Compiler Output area at the bottom of the right-hand screen. When you are ready to compile the entire
application, select the root node from the data entry or batch edit tree. This will display all logic written for the application
in the Logic View. You can then press the Compile button or press Ctrl+K.
CSPro always compiles your application when you run the application. If there are errors, you cannot proceed until the
errors are corrected. During code development, you should compile only the logic for an individual data entry or batch edit
entity. This saves time, because the system does not have to recompile the entire logic module. Furthermore, your entire
file may not be ready for compilation (i.e., there are unfinished parts awaiting someone's input), so it will be more efficient
to compile only untested code.
If your logic compiles with no problem the system will display "Compile Successful" in the Message View. Now you are
ready to run the data entry or batch edit application.
User Settings
User settings are accessed from logic with the loadsetting function and are modified using the savesetting function.
Unless otherwise specified, all CSPro applications on a device share the same settings, which make user settings a
convenient way to share information across different applications. For example, a menu program might save some
information:
PROC INTERVIEWER_NAME
savesetting("InterviewerName", INTERVIEWER_NAME);
And then another program might automatically fill in the interviewer name based on the saved setting:
PROC INTERVIEWER
preproc
INTERVIEWER = loadsetting("InterviewerName");
noinput;
Configuration Settings
Configuration settings are stored locally on your device and are loaded at compile time. The value of a configuration
setting is accessible by declaring a string variable using the config type. Unlike user settings, configuration settings
cannot be modified in logic. They can only be modified using the dialog box.
Because binary data entry applications (.pen files) are only compiled with the .pen file is created, the configuration
settings stored on the computer used to create the file are saved in the .pen file.
Configuration settings are a convenient way to store some information outside of your logic file. If you are using a revision
PROC GLOBAL
config ftp_server_url;
PROC EXAMPLE
syncconnect(FTP, ftp_server_url);
For this to work, you need to use the dialog box to add an attribute ftp_server_url with its value being the FTP server
URL.
See also: Config Statement, LoadSetting Function, SaveSetting Function, CSEntry Settings Modification
Run as Batch
You might also run the data entry application after you finish entering data for a file. If you select "Run as Batch" under
the File menu, the system will display:
• Skip Structure
If this option is checked, the application will check for entries in items that should be blank when they are skipped
under certain conditions, and will list those items.
• Check Ranges
If this option is checked, the application will list the items with out-of-range values.
After you press "OK" the system will display the following dialog box:
CAPI offers a flexible approach to collecting data, and can result in improved data quality and more efficient interviewing
and processing. Creating good instruments, however, requires training and experience on the same level as creating
good batch edit programs.
CATI (computer-assisted telephone interviewing) is similar to CAPI, but interviews are conducted over the phone instead
of in person.
CAPI Features
Extended Controls
CAPI Strategies
Multimedia Features
How to ...
CAPI Features
A CAPI application is similar to a regular data entry application, but involves making use of some additional features.
CSPro offers the following features suited to implementing CAPI surveys:
Feature Description
Question text Customized text for each question can be displayed at the top of the entry form.
Help text A help shortcut (F2) can bring up customized help information for each field.
Multiple languages The CAPI entry application can be developed with question and help text in several
languages; the interviewer can switch among languages as needed.
Response boxes During entry, a response list of valid values for each question can be optionally displayed;
these can make the interview go more smoothly.
Notes Interviewers can enter field notes when needed.
Support for partial save CAPI applications can be developed so that they support partial save. This lets the
interviewer save an incomplete case, then return later to complete it.
Customized branching The entry application can determine on the fly which forms or questions to present based on
the user's previous responses. This is accomplished through program logic.
Immediate editing Responses can be edited, and errors or possible mistakes signaled to the interviewer. This
can help improve data quality.
Extended Controls
Occasionally it may be useful to display to an enumerator or keyer the possible values for a given field in a data entry
The CSPro Designer shows the fields that use extended controls by coloring the border of the field in blue. The blue
border does not exist when the data entry application is run in CSEntry. The field border will be a lighter blue color when
the Number Pad capture type is selected.
Text Box
The Text Box attribute is the default option for a field and indicates that no extended control will be displayed. On
Windows, tickmarks divide each character.
For multiline controls, the characters \n will be placed in the field text for every line break. CSEntry will automatically
interpret \n characters as line breaks, but if you use your data with another software package, you may need to adjust
for this behavior. In the above screenshot, the text would appear in the data file as:
100 Main Street, Suite 1234\nAnytown, NY 00000-0000
Both of these Text Box options can also be specified for alpha fields in rosters. To resize the field within a roster you
may need to first adjust the roster's row heights or column widths.
Radio Button
The Radio Button capture type can be selected if an item is either numeric or alphanumeric and if the value set has only
discrete values, meaning that there are no "to values" defined.
Check Box
Check Boxes are used to capture multiple response variables. Items using check boxes must be alphanumeric. Each
value in the value set corresponds to one of the responses and will be displayed as a separate check box. When the
boxes are checked, the corresponding values will be put into the alphanumeric field, from left to right. The length of the
field must be a multiple of the maximum length of a response. For example, if each response code is one character and
you have five responses, then a field of length five would allow for the selection of all responses. A field of length two
would allow for only two responses (e.g., "pick the top two...").
Date
The Date capture type is valid for numeric and alphanumeric items of length 4, 6, or 8. The item does not need to have a
value set. The choice of date format affects how the date is stored in the field.
Number Pad
Some users find it cumbersome to use the on-screen keyboard while entering data on a tablet using Windows. This
control displays a number pad for entering numeric values with a finger or mouse.
Validation Method
When a capture type is used as part of the validation method, acceptance of a value is as follows:
Multimedia Features
When collecting data, especially when conducting a CAPI census or survey, you can take advantage of several
multimedia features available in CSPro. The following features are available when running CSEntry on Android devices:
Feature Description
Audio Record audio in the background or in an interactive mode controlled by an enumerator.
Barcode Scan a barcode or QR code.
Photo Take a photo and save it as a .jpg or .png file.
Signature Capture a signature and save it as a .jpg or .png file.
Consider breaking your application into a series of sections, each of which deals with a topic (earnings, fertility, etc.).
One or more forms are then used for each section's questions.
Avoid scrolling in CAPI applications, since this prevents the interviewer from seeing the entire form (or roster). Also, the
interview environment—laptop, maybe no mouse, possibly poor lighting—often makes scrolling cumbersome.
Also, it is good practice to gather the names of all household members at the start of the interview. Then the interview
can cycle through each member. Doing this minimizes the risk of accidentally omitting household members.
Try to limit the size of your forms so that the interviewer won't have to scroll them. When developing your application, be
sure to test your application using the screen resolution that will be on the laptops used in the field.
Fields
In CAPI data entry, consideration needs to be given to the properties of fields used in the application. The following table
lists certain field properties and how they might be used:
Use enter key This is usually selected, because otherwise the application will automatically advance to the next
field, which might be confusing for the interviewer.
Mirror fields These are very useful in CAPI applications, as they show the contents of fields in other forms. This
can assist the interviewer.
Protected fields CAPI applications use protected fields to show calculated or derived values. These provide useful
feedback to the interviewer.
Force out-of-range This is almost always turned off, since the interviewer should only be allowed to enter valid
responses. However, you should consider including "not applicable", "don't know," and "refused"
response categories.
CAPI applications frequently make greater use of alphanumeric responses than do traditional key-from-paper systems.
This is because of the interactive nature of CAPI, and it also makes the interview more interactive (by capturing names,
for example).
Questions
CSPro shows customized question text in the top part of the entry window. To help the interviewer distinguish between
text for the interviewer and the respondent, consider using different color fonts. For example, things the interviewer should
say to the respondent (for example, "How many children do you have?") might be shown in black. Instructions to the
interviewer (for example, "Ask of each household member") might be shown in blue. CSPro includes two user defined
fonts, CAPI Font #1 and CAPI Font #2, which can facilitate this.
Question text can be customized using fills, or text substitutions. These fills reference the contents of variables,
dictionary items, or user-defined functions. They are identified by the name of the object surrounded by percent
characters (%). During the interview, the value of the variable or data item is substituted into the question text. For
example, "How old was %FIRST_NAME% on January 1?" might be transformed into "How old was Edward on January
1?" Fills help customize the question text and frame it for specific respondents.
Values
A CAPI application can optionally show a popup box containing an item's valid values. This makes it easier for the
interviewer to read and select response categories. The popup box appears to the side of the item, so a response could
also be typed in. The list of responses that appears comes from the item's first value set.
If you decide to use this feature, be sure to create descriptive value sets for your questions, since these will be displayed
as during the interview.
To get started, it is a good idea to define sections containing sets of questions related to one topic. The number of
sections will vary from survey to survey. You might need a section to begin the interview and a section to end the
interview. You also might need some help sections with FAQ or roster of persons in household.
Next you need to develop an order of the sections, that is the order of the interview. Some sections like FAQ or person
roster may be independent of the interview sequence, and may best be placed after the last subject matter section or
after the end of the interview section.
Now with a section you need to define the order of questions. Place one item or several items which are related to one
another on separate forms. The forms will be in the sequence the questions are asked.
CSPro supports multiple languages, and lets you add languages and define question text for each. By default, just one
language (marked as English) is available. You can define additional languages and then enter question text for each as
needed.
During the interview, the interviewer can switch among the application's languages on-the-fly. Each question's text will be
displayed at the top of the screen, in the chosen language. So, if an interviewer arrives at a household and finds that the
respondents prefer to conduct the interview in another language, this can be easily done.
When you develop a multi-language application, it is probably easiest to finalize and test all the question texts in one
language. Then, once this is done, language specialists can translate them into the other languages your application will
support. This is easy to do, since the question text editor can display question text for two languages at the same time.
The translator can copy and paste text or bitmaps, if needed. Finalizing all the question text in one language first also
helps avoid version control problems that might arise if things were translated then later modified.
CSPro fully supports partial save, which causes a case to be saved to disk even if it is not yet complete. The case can
The user function OnStop can be used to trap the stop button ( ) or attempts to exit the interview (for example, by
pressing Alt+F4). The developer might want to include a form that manages the exit process and saves the partial case
to disk.
You could also use OnStop() to intervene and prevent an interview from stopping, or to jump to a callback form before
exiting. Your application also could store the field being entered, so that it could jump to that section when the interview
is later continued.
If it is necessary to return later to finish a CAPI interview, you might want to include a form that asks when the
respondent would prefer to do the follow-up visit. In a production CAPI system, a separate case management system
could take advantage of this information and assist the interviewer in tracking and scheduling appointments.
Prefilling Values
In a data entry application, it is possible to prefill fields using several approaches:
1. Make an ID field persistent, which uses the value entered for a previous case to prefill the value for a new case.
The initial values of persistent fields can be specified in a PFF.
2. Make an ID field auto increment, which takes the value entered for a previous case and increments it by one to
prefill the value for a new case. If no previous case has been entered, the value starts at 1.
3. Use the Key attribute of a PFF to specify the initial value of the case's ID items.
4. Use the Parameters section of a PFF to specify the initial values for non-repeating, non-persistent items.
5. Make a repeating field sequential, which automatically increments a field on a roster, incrementing it by one on
each added occurrence.
6. Specify a value in the preproc of the field; for example:
PROC INTERVIEW_END_TIME
preproc
INTERVIEW_END_TIME = timestamp();
Notes:
After enabling CAPI mode, the forms screen will be divided horizontally. The top part is for question text (to be
read during the interview), the bottom is for the normal form contents.
Define Languages
A CAPI application can contain question text in multiple languages. The interviewer can switch among available
languages as needed.
1. From the CAPI Options menu, select Change CAPI Languages. The CAPI Languages dialog box will be
Notes
Language names follow the same rules as names; that is, they must be unique and cannot contain spaces. Try
using abbreviations like EN (English), ES (Spanish), FR (French), or PT (Portuguese).
Language labels can contain any text to describe the language.
During data entry, the interviewer can easily switch among languages.
Organize Forms
Divide your CAPI questionnaire into sections, one for each topic area. Add any special sections, such a begin and end
section or global help sections, like an FAQ. These sections organizing units for defining forms and movement within the
application.
Break up the questions within a section into forms containing one or several responses. Remember, you only have about
half of the screen at the bottom to use for a form, because the question text may take up the top half of the screen.
If the form contains a roster, you should make sure that only the roster scrolls, not the form. Scrolling forms can be
confusing to interviewers.
1. From the View menu, press CAPI Questions (or press Ctrl+Q). The question editor will be displayed.
2. Select an item from the tree on the left. Any text already defined for this item will be displayed in the editing
window on the right.
3. Enter text into the editing window. You can format the text using the toolbar, or use the formatting choices shown
in the CAPI Options menu.
4. When you have finished entering the question's text, you can select a different item or switch to the form view or
logic view.
The question text editor supports copy and paste operations, including pasting bitmap images.
When used with numeric values, the fill uses the value code. For example, %SEX% might be 1 or 2. If you want to use
the value label, you can use the getvaluelabel function:
householdMembers = "Bouba<br>Frank<br>Chen";
Bouba
Frank
Chen
Or more realistically:
householdMembers = "";
do numeric ctr = 1 while ctr <= totocc(PERSON_REC)
householdMembers = householdMembers + strip(NAME(ctr)) + "<br>";
enddo;
1. Select Change CAPI Font 1 or Change CAPI Font 2 from the CAPI Options menu.
2. A font dialog will appear, where you can change the font type and size. You can also change the font formatting
(underline, boldface, italic) and color.
Change Formatting
Question text formatting options include:
use CAPI font 1 use the font specifications from CAPI font 1
use CAPI font 2 use the font specifications from CAPI font 2
bold make text bold
italic italicize text
underline underline text
color change text color
left Align text left
center Align text center
right Align text right
bullets convert text into a bulleted list
insert bitmap insert a bitmap file (.BMP extension) into the
question text
change CAPI font 1 change the font characteristics for CAPI font 1
change CAPI font 2 change the font characteristics for CAPI font 2
Add Pictures
Pictures and other graphics can be added to your question text. The picture or graphic must in bitmap (*.bmp) file. There
are two ways to do this:
You can also add bitmaps by right-clicking inside the question window, and choosing the Insert Bitmap option.
Audit Undefined Text: This creates a list of fields and blocks that do not have any question text. For most applications,
you will want to define question text for all fields and blocks that an enumerator will visit. This option shows you which
entities may require question text.
Remove Unused Text: If you define question text for a field or block but then remove that entity from a form, its
question text may remain in the .qsf file. This option will remove any unused—not connected to a form entity—question
text from that file.
Harmonize Occurrences: Increases the maximum occurrence number of a field or block's question text to match that
of the controlling record in the dictionary. This is useful if you increase the number of record occurrences; rather than
having to manually increase each question's occurrences values, you can use this option.
Initialize as Dictionary Label: For any field that does not have question text defined, the question text will be set to
the field's item dictionary label.
Paste from Clipboard: Using one of the formats specified, a field or block's question text will be set to the text
specified on the clipboard.
Resource Folders
There may be times, when deploying an application, especially to a mobile device, that you will want to include some
auxiliary files that the application uses. Such files might include lookup files or value set images. To simplify the
deployment of such an application, these files can be packaged in with the data entry .pen file. The files will be
compressed in the .pen file and then decompressed when the user opens the application. CSEntry will recreate the
folder structure and files contained in the resource folders. Files that already exist on the device will be overwritten only if
the timestamp of the .pen file is newer than the timestamp of the file on the device.
To specify a resource folder, select File -> Add Files and then select a resource folder. Your application may have more
than one resource folder. All files in the resource folder, and its subfolders, will be included in the .pen file, so it is
important to think about how to structure your data entry application so that only relevant files are bundled together in the
.pen file. You might, for example, have an Images folder where you put all the value set images that your application
uses. When decompressing data on mobile devices, files will only be saved in the csentry folder or in any of its
subfolders.
If you no longer want to mark a folder as a resource folder, select File -> Drop Files and select the folder that you no
longer want to use.
If your CAPI application supports multiple languages, consider starting by entering text for all the questions in the first
language. Then use can translate each text into the other languages by putting the first language in the top question
window, and translating in the bottom question window. You can also copy and paste between question text windows.
Note: Conditions can include dictionary items or variables, and can even call functions. Conditions may not include
boolean operators such as "and" and "or".
1. Select Resize Question Windows from the CAPI Options menu; or press Ctrl+W.
2. Note that all of the application's forms' question texts are now sized so that they can be seen without scrolling.
Structure Movement
Decide what movements within the case, an interviewer will be allowed to make. For example, will the interviewer only be
allowed to move backwards or forward through the interview questions or will they be allowed to jump around. If they can
jump around, what jumping movements will they be allowed to make. Are there sections that must be completed first?
Are there optional sections that don't have to be filled in? Are there sections that can be completed in any order?
If an interview can be continued at a later time, how will the interview be restarted? By just advancing to the last question
completed? Or by having the interviewer ask a few questions to reorient the interviewer and respondent.
1. From the View menu, press CAPI Questions (or press Ctrl+Q). The question editor will be displayed.
2. Select an item from the tree on the left. Any text already defined for this item will be displayed in the editing
window on the right.
3. Switch from editing questions to editing helps by pressing the button, or by choosing Help Text from the
CAPI Options menu.
4. Enter help text into the editing window. You can format the text using the toolbar, or use the formatting choices
shown in the CAPI Options menu.
When you have finished entering the question's text, you can select a different item or switch to the form view or logic
view.
To turn on the display of values for selection during the interview, use the following statement:
The interviewer can turn the responses on or off during the interview by pressing Ctrl+C or going to the options menu and
selecting Show Responses (This Field).
There are three methods that could be used to collect this data in a CAPI application. Two involve defining in the
dictionary a data item with multiple occurrences. For example if there are 10 possible responses, then the item, usually
a single character, would occur 10 items. On the data entry form, the item would be displayed as a roster with 10
occurrences. Response labels can be given on the form by changing the occurrence labels from numbers from 1 to 10 to
the text of the responses, so the enumerator will know, and can read the possible responses.
One way to collect the responses is to create a roster with free movement, or spreadsheet like behavior, so that
enumerator can move the cursor between occurrences to mark the ones to which the respondent gives positive answers.
Another way to collect the data is to protect the roster and have a separate data item to collect the number of the
response. When the number is entered, the corresponding occurrence of the item is marked. If the number is entered
again the corresponding occurrence is unmarked.
A final way to collect the data is to set the item as an alpha item and use a checkbox to collect data for the item.
To make this happen, a global navigation key needs to be established and handled by the onkey function. The accept
function can be used to collect the enumerators choice of what section to move to and define the first field in the section
to move to. Movement, however, must be made from a location, a field, before any of the sections are encountered in
order maintain the path structure. This field is protected so that no data need to be entered into it.
Below is code for the onkey function to support using F9 to move to another section in the application. The NAVIGATE
data item provides the location of movement.
PROC NAVIGATE
onfocus
$ = "Z"; // need to have a value in the field
postproc
if JumpFlag = 1 then
JumpFlag = 0;
move to JumpField;
endif;
1. Put a form with one alpha item at the end of all forms.
2. Make question text for the item be the help text.
3. Code in the OnKey function a key that will jump to the help form.
4. Code the alpha item on the help form that will be used to jump back.
In the code below, Ctrl+H is used to jump to the general help. The data item X_HELP is on the form with the help text.
PROC GLOBAL
string lastField;
function numeric OnKey(numeric keystroke)
string thisField = getsymbol(); // current field location
if keystroke = 2072 and thisField <> "X_HELP" then // Ctrl+H
lastField = thisField;
move to X_HELP;
endif;
OnKey = keystroke;
end;
PROC X_HELP
reenter lastField;
Test Application
CAPI applications require even more testing than data entry applications from paper forms because of the enumerator's
need to be able to move easily and quickly around the questionnaire.
Go through the case in intended sequence to test skip patterns, range checks, and edit checks.
If breaking off and restarting the interview are allowed, try doing that from different locations. Can you get back to
where you left off easily?
Test moving around within the application, moving backward and forward.
Test moving between sections, if that is allowed.
Test field helps and general helps.
Sharing a single file between multiple simultaneous users is only possible when using non-text files such as CSPro DB
files. Data files in text format cannot be shared.
By default, each keyer will be able to view, modify, and verify cases in the shared data file entered by other keyers. This
can be restricted by setting the CaseListingFilter in the PFF filter. See Run Production Data Entry for more information.
When using synchronization, each keyer enters data to a separate data file on their computer. After entering data, they
use CSEntry's synchronize functionality to transfer the data that they keyed over the network to a server. The combined
data from all keyers can then be downloaded from the server.
To enable synchronization in your data entry application, use the Simple Synchronizations documentation. After entering
data, keyers select Synchronize from the File menu in CSEntry to send data to the server. To download data from the
server, use the Download function of the Data Viewer tool. This will automatically combine the data from all the keyers
into a single data file that can be used for further processing.
Data synchronization is only possible when using the CSPro DB and Encrypted CSPro DB file formats. Data files in text
format cannot be synchronized.
In the Android environment you will only be able to run binary data entry applications (.pen files). You will not be able to
execute .ent files or any other kind of CSPro application (concatenation, batch processing, etc.). This .pen file, along
with the .pff runtime script, constitute the two files needed to run a data entry application on an Android device.
All applications on the Android device must be stored in the csentry folder on your device. Connect your Android device
to your computer and, via USB transfer, copy your .pen and .pff files to the Android device. Generally the device's
external storage is its SD card. The first time you run the CSEntry application, the csentry folder will be automatically
created for you. The application searches subdirectories located in the csentry folder, so it is possible to add multiple
applications while maintaining a clean directory structure.
Starting with CSPro version 7.5, the csentry directory is located at <external
storage>/Android/data/gov.census.cspro.csentry/files/csentry. In earlier versions it was located at <external
storage>/csentry. If you previously had an older version of CSPro on your Android device, the csentry will not be moved
when you upgrade.
You can also transfer applications to your device by using the Deploy Application tool.
Data Security
As in the Windows environment, the Android application does not encrypt data files by default. Fortunately, it is easy to
encrypt an Android device so that all files stored on it are encrypted. On your Android device:
1. Go to Settings.
2. Under the Personal group of options, select Security.
3. Select the Encrypt Phone option and follow the subsequent instructions.
Encrypting the phone will require that you supply a PIN or password to protect the data on the phone.
Android Navigation
Pa ge 215 of 684 Android Da ta Entry
Overview
The CSEntry for Android application is designed to be straightforward to use, with features and options that interviewers
can easily control.
Screen Elements
This illustration showcases the features of the data entry screen.
1. Clicking on the menu icon displays the case tree.
2. This is the name of the survey.
Pa ge 216 of 684 Android Da ta Entry
3. Clicking on the pencil and paper icon allows field-specific notes to be
taken.
4. Clicking on the search icon and typing in a query allows you to filter
the response listing.
5. Clicking on the more options icon displays a menu with additional
options.
6. The field label is a short description of the field that you are currently
entering.
7. This is the question text for the field that you are currently entering.
This will be more descriptive than the field label.
8. These are the response labels. CSEntry can display text boxes, radio
buttons, check boxes, combo boxes, or date selectors.
9. These are the left and right navigation buttons, which correspond to
moving backwards or forwards in the data entry application.
Alternatively, swipe left or right to move backwards or forwards.
Clicking on the Android back button will close out of adding or
modifying a case.
Field Notes
After clicking on the pencil and paper icon, you will have the opportunity to
type a field-specific note. Click the icon again to close the note.
"CSEntry.Menu.Help"
Pa ge 220 of 684 Android Da ta Entry
Whether to show the "Help" menu option. Values: "Yes" (show) or "No" (hide)
"CSEntry.Menu.AddApplication"
Whether to show the "Add Application" menu option. Values: "Yes" (show) or "No" (hide)
"CSEntry.Menu.Settings"
Whether to show the "Settings" menu option. Values: "Yes" (show) or "No" (hide)
{ Settings Options }
"CSEntry.Menu.ShowHiddenApplications"
Whether to show the "Show Hidden Applications" option in the settings. Values: "Yes" (show) or "No" (hide)
"CSEntry.Menu.ReviewAllNotes"
Whether to show the "Review All Notes" menu option. Values: "Yes" (show) or "No" (hide)
"CSEntry.Menu.ShowCaseTree"
Whether to show the "Show Case Tree" menu option on tablets. Values: "Yes" (show) or "No" (hide)
"CSEntry.Menu.ShowRefusals"
Whether to show the "Show Refusal Options" menu option. Values: "Yes" (show) or "No" (hide)
"CSEntry.Setting.ShowNavigationControls"
Whether to display the navigation controls. If this value is set, then the "Show/Hide Navigation Controls" menu option is
hidden. Values: "Yes" (show) or "No" (hide)
"CSEntry.Menu.DeleteOccurrence"
Whether to show the "Delete Occurrence" menu option when clicking on a group. Values: "Yes" (show) or "No"
(hide)
"Setting.ShowCaseTreeInOverlay"
Whether to always show the case tree (as opposed to it being displayed only on user demand). Values:
"BasedOnScreenSize" (generally yes for tablets, no for phones), "Yes" (always), or "No" (hidden until user
demand)
Examples
// prevent the user from adding new applications
savesetting("CSEntry.Menu.AddApplication", "No");
// override the help menu so that it opens the survey documentation
savesetting("CSEntry.Setting.HelpUrl", pathname(Application) + "Enumerator Manual.pdf");
The Batch Edit Designer module allows you to create and modify batch edit applications. A batch edit application is
used to clean (via editing and imputation) your data files.
For examples and methodology on how to develop your edit routines, refer to the United Nations Handbook on Population
and Housing Census Edits.
• Click on the toolbar, or from the File menu, select New. The following dialog box will appear.
• A default name of the data dictionary describing the data file is given. You can use this name or change it. If you give
the name of a dictionary file that already exists, that data dictionary will be used by the application. If you give the
name of a dictionary that does not exist, a new data dictionary will be created.
Pa ge 224 of 684 Crea te a Ba tch Edit Applica on
If you are using an existing CSPro data dictionary, you may begin creating batch edit procedures. If you are creating a
new CSPro data dictionary, you will need to enter information into the dictionary about records, items, and values before
you can create edits.
Batch edit applications consist of the following files:
These two lines of code will always be in your application file. You can delete them, but they will always be
regenerated and placed in your file on open, save, or exit. This is the beginning of your program. You write the
declaration statements under PROC GLOBAL, then the procedures for the event.
• Message View
This is the window block in the lower portion of the right half of the screen. It is devoted to messages (user-created and
system-generated). As with the Tree View, tabs are available to the programmer; clicking on one of them will make the
contents of that view active. The Compiler Output tab displays errors found during compilation of your program; if the
code compiled successfully, it will state "Compile Successful." The Message tab is used to type in error messages
that will be used in the execution of the program.
If you wish to modify the size of any of these three work areas, just place the mouse over one of the separating bars,
grab it, and drag to resize.
See also: Moving Around a Logic Application
• BatchEdit File:
This is the highest level node, i.e., the root node. It is the owner of all code, which is to say [1] level-, record-, and
item-related code, [2] user-defined functions, and the [3] global routine.
• BatchEdit Level:
This is the second-tier tree node, just below the root. It has a 1-to-1 correspondence with the same-named dictionary
level.
• BatchEdit Record:
This is the third-tier tree node, just below its level. It has a 1-to-1 correspondence with the same-named dictionary record.
• BatchEdit Item:
This is the terminal or "leaf"-node; i.e., the lowest accessible level. It has a 1-to-1 correspondence with a dictionary item.
You are free to rename any of the above the unique names via the properties dialog box, but it is recommended that you
retain the original name, so that it is easier for you to see which dictionary entity is being referenced. The batch edit tree
represents the order in which the logic associated with each edit item is executed.
If code has been written for a given edit level, record, or item, a check mark will appear superimposed on the icon for that
entity. This is how, at a quick glance, you can see where you have placed programming logic. Once one line of code has
been written anywhere in the program, a check mark will appear on the root node.
You can never delete edit levels, records, or items (i.e., the entries shown on the edit tab). However, you can change the
order of the logic execution by dragging the items within the Batch Edit tree view. When selecting a new edit item, the
contents of the logic view will change to display the logic for the selected entity.
Pressing Ctrl+T in the batch edit tree will allow you to switch between the labels and the names of the items.
Input Data File: This asks for the name of the data file against which you wish to run your batch application. This
data file will not be modified in any way; it will only be opened, read, and closed.
Output File: The output file is where the results of correcting your data will be written. If you are not making any
corrections in your program, then the generated file will be an exact copy of the original data file. If you are making
corrections to your data file, then this will be the corrected data file. If you do not need a corrected data file, and
are simplying running the program to generate a report, you can select a none data source as the output.
Listing File: This asks for the name of the file to which you want to write the results of the run. Results from
errmsg functions will be written here. This file will always be generated, regardless of whether or not your program
includes errmsg commands. Listing files generally have the extension .lst.
The following rows are optional, based on the logic in your application:
Impute Freq File: If your program contains any impute statements, the results of this command will be written
to this file. The default file extension is .freq.lst, but you can use whatever you'd like. This field is optional;
therefore, if your program contains impute commands, but you forget to specify a frequency file, no file will be
generated. Similarly, if you indicate a frequency file but your program does not contain any impute commands, no
file will be generated.
Write File: If you have any write function calls in your program, the results will be saved to this file. This field is
optional. If no write file is specified, the write function's output is saved to the listing file.
In addition to these rows, there may be rows where you can specify the names of lookup files, external files, and the
paradata log.
The following diagram illustrates the order of operations for a two-level batch edit application. Level 1 has two records (1
and 2) and level 2 has one record (3).
See also: Batch Edit Order, CSPro Program Structure, Order of Executing Data Entry Events
To make your own custom order of the editing items within records, you need to do two things: from the Options menu,
select "Custom Order"; then drag and drop items in the batch edit tree into the order you wish the edits to be performed.
If you rearrange items within a record in the dictionary, the custom order will not change. If you add new items to a
record, the new items will be placed at the end of the record for purposes of editing. If you unselect "Custom Order," the
edit order will return to the order of items in the dictionary.
Using a Batch Edit application to identify errors is a relatively easy task, though care must be taken to do so correctly.
Improperly identifying errors can waste precious personnel resources, so a precise set of rules should be developed with
input from subject-matter specialists.
Just as you have two methods available to you when searching for errors, you have two methods available to you for
correcting errors: manually or automatically.
Manual Correction
Manual correction of a census could take years, and the possibility of human error is great. When large volumes of
data are collected in censuses and surveys, it is not always practical to refer to the original document in order to
correct an error. Often the data recorded on the original questionnaires are wrong or inconsistent.
Automatic Correction
With computer editing, both time and the possibility of human error is reduced significantly (just how much depends
on how well your logic is written!). The high degree of accuracy and uniformity in computer editing cannot be obtained
in manual editing. In computer editing, range checks and within-record consistency checks can easily be made,
between-record edits can be done, and unknown information can be allocated (imputed) automatically. If an allocation
method is used, you should strive to retain as much of the original information as possible. With a computer editing
and imputation system like CSBatch, erroneous data can be corrected immediately and reports can be generated of
all errors found and all changes made.
The programmer should plan and design computer edits to inspect the data and have the computer correct them
according to specifications supplied by a subject-matter specialist. It would most likely be an extension of the original
program written to find the errors—when you reach the point where there is an error, instead of (or in addition to)
printing out an error message, you should now correct it with an appropriate value.
Actual methods of making corrections vary depending upon the item. In most instances, data items can be assigned
valid codes with reasonable assurance that they are correct by using responses for other data items within the record, or
in other records in the questionnaire. When recorded responses are missing, impossible, inconsistent, or unreasonable
and cannot be determined from other responses in the same questionnaire, the hot deck technique can be used to
assign entries.
The fewest number of changes should be made to the originally recorded data. You are only trying to make a
record or questionnaire acceptable, not make it conform to what you think should be acceptable.
If you must change a data value, do so only once. If you change a person's age, then later find this age doesn't
work for another edit, then you didn't write the original edit correctly. Go back and review the first edit.
If the hot deck method of imputation is used, it is important that the edit specifications indicate where, during the
processing, hot decks are to be updated, that is, at which points in the logic the data items can be considered valid.
Imputation
Imputation refers to the process of providing values for missing, erroneous, or inconsistent responses. For example, if a
person's sex code is invalid (i.e., out of range or otherwise unacceptable) or missing; then an appropriate response
should be substituted.
You may decide that for missing data, you'd rather just keep it "missing" and publish your tables with an extra column
(or row) for unknown values. This is a very cumbersome method, however, as the number of missing values will vary for
each data item, and so the number of missing entries will vary from table to table, making the data difficult to analyze.
Inconsistent responses occur when a response yields an impossible situation with respect to another response. For
example, if a 5-year-old female reports having children, either her age is wrong, or her fertility data are wrong (i.e., that
section should be blank). This type of error must be corrected, as your users will place very little faith in the quality of
your data if this type of condition becomes evident in the tabulations. Many users also do not look kindly on "missing" or
unreported data. Of course, nothing can correct for bad data, and if you find that a significant amount of your data are
bad (poorly designed questionnaire, inadequate field procedures, inattentive coders and keyers, etc), you may want to
reconsider whether the data should be released at all.
Procedures have been developed to provide the missing information, thereby avoiding discrepancies and the need to
determine percentages twice (with and without unknowns). For a detailed discussion on using imputation and the
methods available to you, please refer to the United Nations Handbook on Population and Housing Census Edits.
Static Imputation
Static imputation means providing a value from a predetermined set of values. Suppose a person's sex is missing or
invalid (out of range). If we decide to change the response using static imputation, there are two basic methods to use:
hard coded or from a "cold deck."
Hard Coded
Using our example above, we would programmatically set SEX to the value we think it should be. For example:
What we've done above is a very primitive form of imputation. Essentially, when we encounter a bad value for sex, 50% of
the time the variable SEX will be assigned the value "male," and 50% of the time the value "female." Note that no
accommodation was made for other responses; for example, if fertility data were present, you might not wish to make
this person "male." Or if this were an enumeration of a prison where the entire population is male, you would probably not
want to be adding females to this group! So while this method can be used, you need to take into account other
responses. We attempt to do this in our next method of static imputation, where we use a cold deck.
Cold Deck
With the cold deck approach, known information about individuals with similar characteristics (sex, age, relationship to
household head, economic status, etc.) is used to determine the "most appropriate" response to be used when some
piece (or pieces) of related information is invalid.
For example, suppose a person's age is missing or invalid. We might have a table as follows, where the row indices
represent the person's sex (1 = male, 2 = female), and the column indices refers to the person's relationship codes (1-5)
(this table assumes that the relationship and sex codes have already been corrected):
Head (1) Spouse (2) Child (3) Other Relative (4) Non-Relative (5)
Male (1) 35 50 10 41 65
Female (2) 32 48 10 37 68
In the event that a female child was found to have a missing age, she would be given the age of 10. If a female head of
the household had a missing age, then her age would be given as 32. This method is acceptable if you do not need to
use it often; that is, if very little of your data is missing or invalid. Also, if your population is fairly homogeneous (for
example if you were correcting for religion and 90% of the population is Muslim), then this will not result in an unrealistic
portrayal of your country.
However, if you find yourself referring to this table often, or you have a very diverse population where a few static values
do not give an accurate portrayal, then your data will end up skewed. For these reasons (and others), dynamic
imputation is the preferred method.
For example, assume, for a given person, that the age, relationship, and sex codes appear correct and that consistency
checks validate these items. You can use the values from this person to update your "cold deck," thus making it a "hot
deck." Below is the table given in the cold deck example:
Head (1) Spouse (2) Child (3) Other Relative (4) Non-Relative (5)
Male (1) 35 50 10 41 65
Female (2) 32 48 10 37 68
If the person in question is a male 6-year-old child, the table can be updated with new information, giving the following:
You would proceed in this way for every person in the household who had correct age, sex, and relationship values.
Then, when you encounter a person with an invalid or missing age, you can extract from the table, using the sex and
relationship of the person, a value for age. This value is more likely to be appropriate for the person than would be a
purely random value. (The preceding example is clearly a simplification of the hot deck technique, which requires great
care in constructing and updating the tables used for allocation.)
DeckArrays
Using "DeckArrays" in CSPro is a simplified way of working with hotdecks. It is important to understand the hotdeck
process well before using DeckArrays because the functions getdeck and putdeck automatically handle recodes and
thus hide much of the behavior of hotdecks from the programmer. DeckArrays eliminate the need for the programmer to
recode variables to fit within the boundaries of an array as the recodes are processed automatically based on the values
in a value set.
If using DeckArrays it is very easy to change the parameters of a hotdeck. For instance, if a subject matter specialist
decides that a variable should be hotdecked based on five-year intervals of age but later changes the specification to be
ten-year intervals, programming such a change is very quick if using DeckArrays, whereas with standard hotdecks
reprogramming the recodes may be time-consuming.
Declaring a DeckArray
Declare a DeckArray in PROC GLOBAL as you would a normal array but instead of using numeric dimensions, specify
the name of a value set. The size of the dimension will match the size of the value set. If you make changes to the value
set, the size of the array will automatically reflect the changes.
array Age_HD_from_Sex(SEX_VS); // same as array Age_HD_From_Sex(2);
array Age_HD_from_Sex_Relationship(SEX_VS,RELATIONSHIP_VS);
To create a DeckArray, at least one of the dimensions must be a value set, but it is permissible that the other
dimensions are declared with a numeric size. Because the program can only recode values for the dimensions based on
value sets, it is necessary to specify the indices of any numeric dimensions when calling getdeck or putdeck.
array Age_HD_from_EA_Sex(200,SEX_VS); // same as array Age_HD_From_EA_Sex(200,2);
Accounting for "Else" Values
Occasionally a programmer wants to create a hotdeck with a "leftover" row for any values not contained in the value set.
Sometimes these are invalid values, other times the value set can be created so that this leftover, or "else," row consists
of values valid for the census or survey. For instance, if a user wants to hotdeck a variable based on Christian sects, the
programmer would create a value set with all the Christian sects, and then would create a DeckArray with a leftover row
for all people who do not belong to the Christian sects identified. This is indicated by adding a (+) after the value set
name.
array CEB_HD_Religion(RELIGION_CHRISTIAN_SECTS_VS(+));
Example 1
Simplified edit specification: If the head of household's age is missing or invalid, the value should be imputed with a
hotdeck based on the head's sex and the spouse's age. If there is no spouse or no valid age for the spouse, hotdeck
based on the head's sex and the household size, which will be assumed to have a maximum value of 10+.
PROC GLOBAL
array ageFromSpouseHotdeck(SEX_VS,AGE_VS);
numeric maxHHSize = 10;
array ageFromHHSizeHotdeck(10,SEX_VS); // household size is from 1 - 10+
endif;
Example 2
VS_SEX has two values: 1 (Male) and 2 (Female). VS_EDUC has three levels: 1 (No schooling), 2 (Primary schooling), 3
(Secondary schooling).
// Command: // The hotdeck looks like
// this after the command:
After keying or scanning your data, there will be errors in the data file. This is unavoidable, and will be a combination of
human and computer error. It will therefore be necessary to correct the data by writing a series of edit routines
(procedures) to systematically and consistently clean your data files.
The Batch Edit Designer module allows you to create and modify batch edit applications. A batch editing application
contains logic which you can apply against one set of files to produce another set of files and reports. Batch editing
applications can be used to gather information about a data file.
To create these edit routines, you will use CSPro to develop the batch editing application based on the dictionary that
describes your data files. If you received this data file from someone else and do not have a dictionary that describes it,
you will need to create a dictionary before you are ready to develop programming logic for it. You use CSBatch to run the
application. For small surveys and for testing applications, you can run CSBatch directly from CSPro, on the same
computer. For large surveys and censuses, which require a production environment, you can transfer the application files
to other computers and run CSBatch on them.
You can have the following runtime features in your batch editing application:
Automatic Modification
When a census or other large survey is being processed, it would be unduly cumbersome to make most data corrections
by visually examining the errors. CSPro provides the facility for not only finding incorrect or inconsistent data, but also
making modification to the data. Needless to say, modifications are not always corrections. Moreover, any modifications
to data that have been collected must be carefully thought out and monitored through CSPro's edit statistics reports.
Generate Reports
You can write customize reports to a file.
Match Files
CSPro allow for matching two files and gathering information from both. The feature is useful, for example, when a file
must be created which has a combination of data from two other files.
Why bother retyping it each time? You can simply define it once and reference it over and over. To do so, select the
Message tab in the Message View. You will see one line that has been generated for you; it reads: {Message code
file generated by CSPro}. Beneath this simply add your error message (we'll give it number 100001):
Then, whenever you want to use this message in your code, simply use the message number:
errmsg(100001,AGE);
Besides simplifying your work, after you run your program, a convenient summary statistic will be generated for each
user-defined error message, indicating how often it was used. A sample listing is shown below:
See the errmsg function for a detailed explanation of all the options available to you.
As you can see, this may be difficult for the non-programmer to decipher. But by using the write command, you can
more clearly display this. One way would be to put the following write statements in the preproc of the first level (in this
way it would only get written out once per questionnaire):
PROC QUEST
preproc
write ("***************");
write (" Province: %3d", PROVINCE);
write (" District: %3d", DISTRICT);
After the execution of the program, the .wrt [report] file would show the following (of course, actual values will vary
depending on the questionnaire IDs):
***************
Province: 1
District: 7
Village : 30
EA : 4
***************
Additional write statements can be included in the batch edit program to generate a customized report.
Once the array has been declared and initialized, you need to add logic to check each occurrence of the variable you
wish to impute using the hot deck. If the value of the variable is valid, update the hot deck by assigning this value to the
appropriate cell in the array. If the value is invalid, set the value of the variable based on the corresponding cell in the hot
deck. For example:
PROC AGE
if AGE = notappl then
// if the value for age is not valid...
// assign the value from the hot deck based on sex and relationship
impute(AGE,AgeSRDeck(SEX,RELATIONSHIP));
else
// otherwise...
// update the value of the hot deck
AgeSRDeck(SEX,RELATIONSHIP) = AGE;
endif;
When an age is missing during the data file's processing, we will use a value from the array AgeSRDeck; if the age is
present, we will "refresh" the age for the person using the current sex and relationship codes as indices into the array.
For a more detailed explanation of what hot decks are, refer to the United Nations Handbook on Population and Housing
Census Edits.
The following is an example of code that will initialize age values based on sex and relationship:
See also: Dynamic Imputation (Hot Deck), Use Hot Decks, Initialize Hot Decks Using Saved Arrays
PROC GLOBAL
// declare an array initialized from the saved array file
array AgeSRDeck(2,6) save;
PROC AGE
if AGE = notappl then
// if the value for age is not valid...
// assign the value from the hot deck based on sex and relationship
impute(AGE,AgeSRDeck(SEX,RELATIONSHIP));
else
// otherwise...
// update the value of the hot deck
AgeSRDeck(SEX,RELATIONSHIP) = AGE;
endif;
Interpret Reports
After specifying your file(s), a progress dialog bar will be displayed as CSBatch works its way through your data file, then
TextViewer is launched and generate the following listing report:
The "CSPro Process Summary" gives you information about the records and cases. In this example:
• 29143 records were read from the input file, which represent 100 % of the input file
• "Ignored" is the sum of "Unknown" and "Erased" cases.
• "Unknown" represents number of cases with invalid record type code.
• "Erased" represents number of records with a "~" character
• The list report contains 47063 messages of which: 36 messages were defined by the user; 9004 are warning messages
generated by the system; and 38023 are error messages generated by the system
• "Level 1" indicates that the following statistics are for Level 1 only. If the application had more than one level, the
statistics will be displayed for each level.
• The input data file included 4872 cases (questionnaires) of which: there were "0" case with bad structure, meaning that
no required records were missing; and 4872 cases were written to the output file ("Level Post").
This assumes that CSPro was installed in the default directory. Your PFF file must have a .pff extension.
You can create a PFF file in one of two ways: either [1] create one with a text editor (such as Notepad or Wordpad), or
[2] have it generated automatically for you by launching your batch application from within the CSPro Designer. The file
will have the same name as your application, but with a .pff extension instead of .bch. For example, if your batch
application was named MyEdits.bch, the generated PFF would be named MyEdits.pff. You can also manipulate PFF
files in logic by using the pff object.
The following section shows the options available to you in a CSBatch PFF file. A PFF file is not case sensitive, so you
can use any combination of upper and lower case text.
[Run Information]
The [Run Information] block is required and must appear exactly as shown in the example above.
[Files]
The [Files] block is required. A description of the files, not all of which have to be specified, is as follows:
[ExternalFiles]
If the [ExternalFiles] block is present, it means that a second (or more) dictionary was linked to the batch application.
In the example above, LOOKUP_DICT is the dictionary name, and LookupFile.dat is the name of the data file that
contains the lookup codes.
[Parameters]
The [Parameters] block is optional. This section defines parameters for the batch run.
ViewListing=determines whether you see the batch run report. If this entry is missing or set to Always, then you
Pa ge 245 of 684 How to ...
will see the generated report. Other available options are OnError, in which case you will see the listing report
only if an error occurred during the run, or Never, in which case you will never be shown the generated report.
ViewResults=determines whether or not the impute and/or write file are displayed in Text Viewer at the end of the
run. The available choices are Yes or No. If the entry is missing, the resultant file(s) will be displayed by default.
For more information on these files, see Run a Batch Edit Application.
ListingWidth=allows you to control the number of characters outputted to the listing file before forcing the start of
a new line. This is set to 120 characters by default. If your screen resolution and/or printer permits, it may be
useful to increase the width of the listing file. This only applies to text listing files.
MessageWrap=determines whether or not summary messages displayed at the end of a listing file will be
outputted on several lines when they are too long to fit on one line. The default option is No, in which case the
messages are truncated so that they fit on one line. This only applies to text listing files.
ErrmsgOverride=allows you to override the default behavior of the errmsg function. The override only affects
errmsg functions used in the code without a case or summary specifier. The option No maintains the default
behavior (displaying the messages for each case, as well as in the summary). The option Summary only
displays the message in the summary. The option Case displays the message for each case but not in the
summary.
InputOrder=allows you to override the default order in which cases are processed in a data file. The default option,
Sequential, means that cases will be processed from the first case to the last case, in file order. The option
Indexed overrides this order and cases will be processed in alphabetical order of the case IDs. Text data files
can contain duplicates case IDs if processed in sequential order, but not in indexed order.
Parameter=allows you to pass in a string to your program. The parameter can be any length, although the string
that retrieves the value in your program (via the sysparm function) must be large enough to accommodate it. Once
the parameter is retrieved, it can be parsed by your program for further usage.
Language=specifies the initial language of the program. The parameter must match the name of a language
specified in the dictionary or message file.
OnExit=specifies a PFF file to run after the application closes. This can be useful, for example, if you want to
relaunch a menu program after processing data.
Before coding begins, a complete set of edit specifications and a formatted listing of the data diction ary should be
available. A team leader should be appointed; this person should have a thorough understanding of the CSPro language.
The leader should develop the naming conventions and standards that are to be followed by other members of the team.
During the course of program development, questions will arise about the edit specifications, and it is important to involve
subject-matter specialists to resolve these issues.
This section outlines a step-by-step approach to developing a CSPro program. It assumes that a data dictionary has
been developed. If this is not the case, then the first step is to build the data dictionary using the Data Dictionary module
and to check the dictionary carefully against original specifications.
An editing system may contain more than one CSPro program. It is not unusual to have at least two CSPro programs in
the edit system: the first is used to find and report structure errors which must be resolved before the data moves on to
the next stage. These include missing records, duplicate records, or other incompatibilities with requirements which are
considered essential to the processing. The second program is the more traditional value-validation and consistency
edits.
Typically, during the testing phase, the report by case is very useful, because it permits detailed examination of the
effects of the logic coded. After testing, it is usually not used because of the volume of reports generated.
Names for variables to be defined in the program should have unique prefixes. The standard could be that variables
Team members should code the edit specifications and review the code with the team leader. The team leader should
then incorporate the new code into the CSPro program. All syntax errors should be corrected before more code is added
to the main program. It is important to maintain earlier versions of the program as insurance against problems that may
creep in.
When hotdeck imputations are used to correct invalid or inconsistent items, it is particularly important to ensure that the
hotdeck arrays are being updated under the appropriate circumstances and with valid values. This is the most common
error in program construction, and one way to avoid it is by requiring that the edit specifications explicitly indicate where,
in the sequence of actions, hot decks should be updated.
The output file produced by the Run program step should be "clean," assuming imputations were performed to correct all
invalid and inconsistent data. To verify that the CSPro program does not contain contradictory logic, that it corrects
errors properly, and that it does not introduce any new errors, the Run program step should be rerun with the "clean"
(output) data file as input. In this rerun, no errors should be found and the edit statistics should reflect this. If errors are
found, then the logic of the CSPro program must be corrected.
Remember: Just because the CSPro program is free of syntax and logic errors is no guarantee that all the required
editing is being done and is being done correctly! An error-free CSPro program cannot compensate for incomplete edit
specifications. Among the test most commonly omitted tests when creating edit specifications are: making sure there is
one and only one head of household, and making sure the head of household is of an acceptable age.
As soon as enough batches have been edited to form a larger geographic area (perhaps all enumeration areas within a
district), tabulations should be run at the higher geographic level (in this case, district) to see if any inconsistencies are
apparent. If so, it will be necessary to pause and discover the reason for the inconsistencies. This may involve modifying
and re-testing the CSPro program until the data are producing satisfactory results. Each time the CSPro program is
modified, all previously-edited batches must be re-edited (always starting with the same original data files!) so that final
data will all have been edited and corrected following the same logic.
Parts of a Table
Shown below are the terms used by the CSPro software to reference the parts of a table.
Left, Center, Right Header: Optional text printed at the top left, center, and/or right of each printed page (above
the title).
Left, Center, Right Footer: Optional text printed at the bottom left, center, and/or right of each printed page
(below any footnotes).
Secondary Stub Head [Not shown]: When stub text is displayed on both sides [left and right] of the print
table, this text is displayed.
Tally: Characteristics associated with the numbers that appear in a table. These include percents, averages,
medians, etc.
Subtable: When more than one set of categories is crossed with another characteristic, then each is
considered a subtable. A table with only one set of stubs and column headings can also be considered as a
subtable. (See below: a comma in the table title separates subtable item labels.). A subtable is a portion of a
larger table that can have its own settings for universe, weights, value tallied and unit of tabulation different from
other subtables in the same table. When hidden parts in the view menu is enabled, a colored box will be drawn
around each subtable.
Test Edits
Although CSBatch automatically produces edit statistics which report the extent of the editing it performed on a file, the
subject-matter specialist and/or computer programmer has the option of using tabulation applications to analyze a data
file before and after editing to examine the extent of the changes to a file, and to review relationships between data items
in the edited file. Tabulation applications can be used to determine whether the edits did the job they were intended to
do.
Test Tables
Tabulations that are destined for publication and for distribution to outside users must be carefully verified to ensure that
the values presented are without error. Tabulation applications are currently designed for use as a custom tabulation
package but it can also be useful as a means of comparing the results produced by other tabulation software. In this
case, the relative rigidity and transparency of tabulation specifications means that there will be virtually no errors
attributable to programming faults; if differences are encountered between the tables produced by CSPro and tables
produced by other software, the tables produced by CSPro are likely to be closer to the "true" counts. In any case,
tabulation applications can be an effective tool to create publication-quality tabulations or for cross-checking numbers
produced by other software.
For example, this table has Urban/Rural and Mother Alive as independent variables; Sex and Literacy as dependent
variables. In addition, this table could be produced by 'area'; usually this is a geographic item but it could be any item
that meets certain conditions. The 'area' item is basically an extra tabulation variable.
The user can elect to display tabulations in terms of actual counts or as percentages of the total, and has the option of
counting special and/or undefined values. The counts in tabulations may be unweighted or weighted. Users can produce
a table on a subset of a file using the universe option, or select an item or numeric value to be used as a weighting factor
during tabulation.
The selection of an item as a row or column variable will affect the table layout.
See also: Create a Table
Restrict a Universe
When you define a universe, CSPro will only tabulate data records in the questionnaires that meet the conditions
stipulated by you. The universe specification acts as a filter, as the tables produced use only a subset of the data file's
records. Therefore, values in the table may be lower than they would be with no universe specified, since the universe
restricts the data available for tabulation. Note that each table has its own universe definition, but that a given universe
specification may be extended to all tables in an application.
See also: Preferences and Default Formats, Modifying Preferences, Load and Save Formatting Preferences
CSPro Tables (.tbw): Lets you save tables so they can be used later by the CSPro Table Viewer.
Rich Text Format (.rtf): Lets you save your tables so they can be used later by word processors such as Word.
See also: Save Tables for the Table Viewer, Copy Table to Other Formats
See also: Save Tabulations in Different Formats, Select and Copy Table Data to Other Applications
This feature is very useful for moving tables from one application to another. If you have many tables to produce, you can
divide them into several applications each with a small number of tables, then use copy and paste table specification to
assemble them into a single application at the end. If you do this, you must be careful to use the same data dictionary in
each application.
This feature is also useful if you need to add a table in the same application which is very similar to an existing table.
From the Tabulation Tree tab, right-click over any of the tree entries. A popup dialog box will appear. Select Copy
Table Spec or Paste Table Spec.
From the Tabulation menu, select Edit, then Copy Table Spec or Paste Table Spec.
From the table itself, right-click anywhere over the cells of a table. A popup dialog box will appear. Select Copy
Table Spec or Paste Table Spec.
Subtables are created automatically as the user builds the table. Every time an independent variable is dropped onto the
rows or columns of the table, new subtables are created as appropriate.
Subtables are designated as part of the larger tabulation using colored outlines when viewing hidden parts. Each
subtable has the same properties available to the entire tabulation, i.e., universe, weight, value tallied and unit of
tabulation. These properties may be set differently for each subtable within a table.
See Also: Add a Variable to a Tabulation, Using Subtables
• Click on the toolbar, or from the File menu, select New. The following dialog box will appear.
• A default name of the data dictionary describing the data file is given. You can use this name or change it. If you give
the name of a dictionary file that already exists, that data dictionary will be used by the application. If you give the
name of a dictionary that does not exist, a new data dictionary will be created.
Create a Table
To create a table, do the following:
Select the Dictionary [Dicts] tab to make the dictionary file structure visible.
Expand the tree until the item(s) you wish to use for row or column variable(s) appears on the tree.
Drag the desired dictionary item and drop it in the table window. Where you drop it on the table will determine
whether it is used as a row or column variable. For a row item drop it on the left side below the generated "Table
nn". For a column drop it on the top but a little below the generated "Table nn". If you imagine a diagonal line
connecting the top-left corner of the table window to the bottom-right corner, dictionary items dropped below and
to the left of the line will become row variables and items dropped above and to the right of the line will become
row variables.
This can be repeated with additional items. The key to placement for additional items is whether they are dropped
next to or on top of existing items. This is indicated by the "+" or "x" displayed with the item while it is being
dragged.
An item dropped next to an existing item is indicated by the "+" while dragging.
An item dropped on top of an existing item is indicated by the "x" while dragging.
The dropped item will be crossed with the existing item to make sub-groupings, e.g., Total, Literate, Illiterate under Total,
Male and Female.
To delete a row/column variable, left-click on one of its value names, then drag and drop it back on the dictionary
tree. In the 'x' example above, if "Sex" is deleted, then "Literacy" will also be deleted. However, "Literacy" can be
deleted without affecting "Sex".
The rows/columns that are created when you add an item to a table come directly from the value set defined in the
dictionary for that item. If you would like to have different rows/columns created for a particular item, you can either
modify the value set in the dictionary or create an additional value set for the item and use the new value set instead. To
use a different value set, rather than dragging the item itself onto the table, drag the value set instead. For items with
more than one value set, the value sets appear underneath the item in the dictionary tree. For more information see
Implications of Data Dictionary Value Sets.
Optional Definitions
• Define the universe for the table.
• Define the area for the table.
• Select the value Item or constant you want to tabulate.
• Select the weight Item or constant
• Add percentages.
When two variables are selected for the same dimension (row or column), the first one selected becomes the
independent variable and the second one becomes the dependent variable. (See the discussion on row/column variables
for more details.)
See also: Tabulations, Add a Variable to a Tabulation, Remove a Variable from a Tabulation
Notice these are like two separate tables side-by-side; 1) Marital Status by Sex and 2) Marital Status by Literacy. There
is a total of 6 columns (3 for Sex plus 3 for Literacy). Note also the wording in the generated title.
Compare the above to the following:
If you choose an item with occurrences as a row or column variable, all occurrences of that item will be tabulated across
all corresponding records in the data file. For example, if you choose MyItem, both MyItem(1) and MyItem(2) would
be tabulated across all records.
If you choose a specific occurrence of an item as a row or column variable (e.g., MyItem(2)), only that occurrence will
be tabulated across all corresponding records.
You may also use items with occurrences as the value or weight item. Note that when the unit of tabulation is not the
The Tally Attributes (Variable) Dialog Box contains two lists: on the left is the list of available calculations and on the
right is the list of calculations selected for the current variable.
To add a new calculation, select the type of calculation in the list on the left by clicking on it and then click the "Add"
button. The selected calculation will appear in the list on the right.
Each calculation has its own set of options such as percent type for percents, number of tiles for n-tiles, etc. To modify
the options for a calculation, first select the calculation by clicking on it in the selected list, and then click the "Options"
button to bring up the options dialog box.
To remove an existing calculation, click on the calculation in the list of selected calculations on the right and click the
"Remove" button. The calculation will be removed from the list of selected calculations.
The order that the calculations appear in the selected list will be the same order in which they appear in the table. That
is, if you have 'Total' before 'Median' in the selected list, then in your table, you will have a "Total" row or column before
the row/column containing the median. To change the order of the calculations, choose a calculation in the selected list
by clicking on it and click either the "Up" or "Down" button to change its position in the list.
The following calculations are available:
Counts: Displays the counts (frequencies) for each value in the value set. Adding counts to the variable will add a row or
column to the table corresponding to each value in the value set for the variable.
Type: Selects how to calculate percents for the selected variable. Specifically, this gives the 'base' (100%) or
denominator for the percent. One of three options can be selected:
Total - use the total count for the subtable as a base for each percent.
Row - use the total count for the subtable row as the base for the percents in each row. If there is a total
column, it will contain '100 %'s.
Column - use the total count for the subtable column as the base for percents in each column. If there is a
total row, it will contain '100 %'s.
Interleaved/Separate: Selects whether or not the percents are alternated with the counts. For percents to be
interleaved, counts must be present directly before or directly after the percents in the selected list. When percents
and counts are interleaved, the order in which the counts and percents appear in the selected list determines
whether the first column/row will be a percent or a count -- i.e., if counts appear first in the selected list, then the
first row or column will be a count followed by a percent.
Interleaved percents
Separate percents
Total Percent: Displays a single row or column with the total for all values in the value set expressed as a percentage of
the row, column or subtable total. Total percent has the following options:
Type: The type of percent (row, column or total as for percents above). You may also choose '(same as
percents)' to force the total percent to use the setting from the percents.
Type: Specifies the type of variable – either discrete or continuous. This should be set to "Discrete" for
discrete variables (such as number of rooms) and to "Continuous" for continuous variables (such as age or
income). The median is approximated using linear interpolation based on a frequency distribution for the
variable. For discrete variables, 0.5 is subtracted from the result to reflect the fact that values in a category
are all equal to the lower limit of the category whereas for continuous variables values in a category are in
between the lower and upper limits of the category.
Calculation ranges: Controls which frequency distribution is used to calculate the median. If "Use value set
ranges for calculation" is checked, then the median will be approximated using a frequency distribution for
the value set that was dropped onto the table. Otherwise, you can specify the ranges to use for the
distribution by entering the min, max and interval. CSPro will generate ranges of width 'interval' starting at
'min' and going up to 'max'. For example, a "min" of 0, "max" of 20 and "interval" of 5 will generate the
following four ranges:
0 up to but not including 5
5 up to but not including 10
10 up to but not including 15
15 up to but not including 20
If the value set you use does not have very many ranges, it is better to generate your own ranges in order to
get a closer approximation of the median. For example, using a value set with only three categories for age
such as 0-14, 15-49 and 49+ will generally result in a very poor approximation of the median, while using 5-
year groupings will generally result in a good approximation. For an even better approximation, turn off "Use
value set ranges for calculation" and specify a min of 0, a max of the largest age in the population and an
interval of 1. For variables such as "income" or "price of a house", you may need to experiment with different
sets of ranges to come up with a good approximation for the median.
Mode: Displays the mode (value with the greatest number of occurrences in the data set) for each row or column.
Standard deviation: Displays the standard deviation for each row or column.
Variance: Displays the variance for each row or column.
N-tiles: Displays percentiles for each row or column. N-tiles has the following options:
Calculation ranges: Controls which frequency distribution is used to calculate the n-tiles. If "Use value set
ranges for calculation" is checked, then the n-tiles will be approximated using a frequency distribution for the
value set that was dropped onto the table. Otherwise, you can specify the ranges to use for the distribution
by entering the min, max and interval. CSPro will generate ranges of width "interval" starting at "min" and
going up to "max". For example, a "min" of 0, "max" of 20 and "interval" of 5 will generate the following four
ranges:
0 up to but not including 5
5 up to but not including 10
10 up to but not including 15
15 up to but not including 20
If the value set you use does not have very many ranges, it is better to generate your own ranges in order to
get a closer approximation of the n-tiles. For example, using a value set with only three categories for age
such as 0-14, 15-49 and 49+ will generally result in a very poor approximation, while using 5-year groupings
will generally result in a good approximation. For an even better approximation, turn off "Use value set
ranges for calculation" and specify a min of 0, a max of the largest age in the population, and an interval of
1. For variables such as 'income' or 'price of a house', you may need to experiment with different sets of
ranges to come up with a good approximation for the n-tiles.
Minimum: Displays the minimum value for each row or column.
Maximum: Displays the maximum value for each row or column.
Proportion: Displays the proportion of a specified subset of the counts for the variable as a percent or ratio of the total
for the variable. You could also use proportion, for example, on the age variable, to display the percentage or fraction of
the population with age between 15 and 49. You could also proportion on a yes/no variable such as 'owns a television' to
show the percentage or fraction of the population that owns a television. The proportion is calculated by taking a subset
of the values in the value set for a variable, computing the total for that subset and then dividing that total by the total for
all values in the value set. The result may be displayed as a percent or as a ratio. The numerator used to calculate the
proportion is the total number of values that match the specified subset of the value set and the denominator is the total
number of counts for all values in the value set. For example, in the case of the proportion of people between 15 and 49,
the numerator would be the total number of people between 15 and 49 and the denominator would be the total number of
people of any age.
Proportion has the following options:
Range: Specifies the set of values whose counts will be used in the numerator of the percent or ratio and as
the total. Range can be a single value (e.g., 3), multiple values separated by commas (e.g., 1, 2, 5, 17) and it
may also include value ranges using colons (e.g., 1, 2, 5:10, 25:30).
To show the percentage of the population between 15 and 49 you would add "proportion" to the age variable, set the type
to "Percent" and set the range to 15:49. To show the fraction of the population that owns a television, add "proportion" to
the variable "owns a television", set the type to "Ratio" and the range to 1 (assuming 1 means "yes").
Note that the values calculated for median, mode and n-tiles are dependent on the value set used. For more information
see Implications of Data Dictionary Value Sets.
See Also: Hide or Change the Position of the Total, Add Percents to a Table, Add Summary Statistics to a Table
Note that if you choose Tally Attributes (Subtable), the subtable that you clicked on will automatically be selected in this
list.
This dialog displays all units that can be applied to all of the subtables. Choose the unit from the drop down and click
"OK". Potential units will not be included if they cannot legitimately be applied to one or more of the subtables. For
example, if one subtable contains an item from a multiple record (e.g., "sex" on the person record), then only the
multiply occurring record, and multiply occurring items on that record, are considered legitimate units to be applied to all
subtables.
Value Tallied: Optional name of a numeric item in the dictionary whose value would be tallied into the data cells instead
of the unit (one) tally. For example, setting the value tallied to 'children ever born' would count the actual number of
children instead of counting the number of cases with a given number of children. This can also be a numeric constant.
Weight: Optional name of a numeric item in the dictionary that contains the inflation factor (weight) for a survey case.
For each tally, this value will be tallied instead of the unit (one) tally. This can also be a numeric constant.
Universe: An expression that restricts the data used for tabulation. When you define a universe, CSPro will tabulate only
those data records in the questionnaires that meet the conditions stipulated by you. The "universe" specification acts as
a filter, as the tables produced use only a subset of the data file's records. Therefore, values in the table may be lower
than they would be with no universe specified, since the universe restricts the data available for tabulation.
Tab Logic: Optional CSPro logic statements that may be used to modify the values of variables in the case during
tabulation. This is used mainly for recodes of existing main dictionary variables into new variables used in the table. For
information see Table Logic (tablogic). Logic may be entered directly in the associated edit box or you can click on the
Edit button to bring up a larger window in which to modify the logic.
PostCalc: Optional CSPro logic statements that may be used to modify values in the table after the tabulation is
complete. This logic can be used to add values to the table calculated from the tabulated data such as ratios. For more
information see Introduction to Table Post Calculation. Logic may be entered directly in the associated edit box or you
can click on the Edit button to bring up a larger window in which to modify the logic.
Special Values: Allows you to customize which special values are counted in the selected table. By default, only
special values included in the value sets being tallied are considered. By checking "use custom special values" you can
specify which, if any special values to include in the tabulation regardless of the special values contained in the value
sets. For example, if the value set used in a subtable contains the special values notappl and missing, by default
undefined and default values will not be included in the tabulation. Enabling custom special values and checking notappl
while leaving the others unchecked will cause only notappl values to be included in the tabulation.
Lowest Break Level: Allows you to limit the geographical areas used for area processing for just the selected table.
The lowest break level is the lowest geographical area that will be used in the table. For example, if your file includes
data at the province, district and village level but you set the lowest break level to district, then only province and district
will appear in the selected table while all three levels will appear in other tables in the file.
See Also: Create Multiple Subtables, Define a Universe for a Table, Add Weights to a Table, Tabulate Values Instead of
Frequencies, Include/Exclude Special Values in a Variable
To add a new table simply press the Add button on the toolbar. You'll notice a new table is created with the name
"Table #" (where # represents the number of the table—if this is the 5th table in your table set, it will initially be
named "Table 5"). Finish the definition of the added table by adding dictionary items and specifying any universe
definitions or other tabulation parameters desired.
You can also add a table by right-clicking anywhere in a table and selecting Add Table from the popup menu.
Alternatively, you can add a table from the Edit menu: select Add Table.
Insert Table
You can insert a table before another table by right-clicking anywhere in a table or on the table name in the table tree,
then selecting Insert Table from the popup menu. Alternately, you can insert a table by clicking on the icon, or
selecting Insert Table from the Edit menu.
Any table inserted into the existing table set will always be placed before the currently displayed table, which we will call
"Table N". You'll notice a new tab is created with the name "Table #A" (where # is N-1). If you insert another table before
this one, it will be created as "Table #AA". If you insert another table before "Table N", it will be created as "Table #B". In
both cases # is N-1. Once created, finish the definition of the inserted table by adding dictionary items and specifying
any universe definitions or other tabulation parameters desired.
Delete Table
The table that is currently on view in the table window is always the one affected when you choose to delete. So if the
table you want to delete is not displayed in the table window, you must first make it visible. Either select it from the table
tree on the left or by using the table paging arrows in the Tabulation toolbar.
You can then delete the table by right-clicking anywhere in the table or on the table name in the table tree, then
selecting Delete Table from the popup menu. Alternately you can use the icon on the toolbar or select Delete
Table from Edit menu. You will need to confirm that you actually want to delete the table.
Press the Run Tabulation button or press Ctrl+R. If the application has been modified, you will need to save the
changes in order to run.
The run will request the following files as appropriate:
Input Data: The data file(s) being tabulated. There is NO required extension for CSPro data files. Multiple input data files
can be selected using the browse button.
Area Names: Only used for applications with area processing. An Area Names File is used to associate the area codes
Note that you can also stop the processing by using the "Cancel" button.
If there are no warnings or errors, the application will display the resulting tables. If your application has generated
messages, these will be displayed via a listing in Text Viewer but the results will still be shown in your tables.
You can change the name of a table using the table properties dialog. Right-click on the table in the "Tables" tree and
choose "Properties". Edit the name of the table and click "OK". Note that the system will not allow you to have two
tables with the same name.
You can also change the name and label of the table application by right-clicking on the table application in the "Tables"
tree and choosing "Properties". The tables application is always located at the top of the tree. Edit the name and label of
the application and click "OK."
In order to create subgroupings of categories already defined in the table, e.g., by sex, drag and drop the subgroup item
(sex) anywhere on the existing value set. (See 'Sex' in column below)
In order to create an additional set of row/column categories, drag and drop the item or value set below the existing rows
[for additional set of rows] or to the right of the existing columns [for additional set of columns]. (See Place of Birth and
Martial Status below)
If an independent item is removed then the associated dependent item will also be removed.
You can type a universe condition statement, as CSPro logic, in the Universe box. Or, you can double click on an item
name, press a relation button, and double click on a value.
Double click on the item in the left-hand box that you want to use in the universe condition. The item name will
appear appended to the text in the universe condition below.
Click a relationship button ( =, <>, >, >=, <, <= ). The relation will appear appended to the text in the universe
condition below.
Double click on a value in the right-hand box. The value, not the text label, will appear appended to the text in the
universe condition below.
You may enter several conditions using the and / or. You can also add parentheses to modify the order of
evaluation of the conditions.
To delete the universe simply erase the contents of the universe condition.
Press OK when you have completed the universe condition.
You can apply a universe to all the tables, by pressing the Apply All button to the right of the Universe box in the
Tally Attributes dialog box
Examples:
To restrict your table to females of reproductive age, you might state:
P03_SEX = 2 and P04_AGE in 12:49
To restrict your table to heads of households who are economically active, you might state:
P02_RELATION = 1 and ECON_ACTIVE = 1
Use the Apply All button to the right of the box if the weight is for the entire table and you want to propagate it to all
tables currently defined. It will replace any existing weight in other tables.
Notes:
If the value of the weight data item or expression is not numeric when a tally is to be made, the tally is NOT done. This is
equivalent to a tally of zero.
If the weight expression is not valid it must be corrected before the Tally Attributes dialog box can be closed.
See also: Tabulate Values and/or Weights, Tally Attributes for a Table
To activate the Special Values portion of then menu, check the Use custom special values box (as shown).
After that select the types of special values you want to add to/remove from the value sets of each item in the table by
checking the appropriate box.
Once Use custom special values is checked, this setting overrides any special values in the value sets for items in the
table. If Use custom special values is checked, any of the special values that are checked in the tally attributes dialog
will be displayed for every item in the table regardless of whether or not the value set for that item contains the special
value. In addition, only the special values that are checked in the dialog box will be displayed in the table even if they
appear in the value set for the an item in the table. For example, if your value set contains Notappl and you check Use
custom special values but leave Notappl unchecked then the columns/rows corresponding to the Notappl value in
your value set will be removed from the table. If your value set contains no special values, and you check Use custom
special values and Notappl, then a row or column for Notappl will be added for each variable in your table.
Note: Undefined values count includes blanks if Notappl values is not checked. If it is checked then those counts are
taken out of the "Undefined values" count and tallied separately. Similar statements apply to other special values.
To remove the total row or column, select the total by clicking on it in the list of selected calculations on the right hand
side of the dialog, and then click on the Remove button to delete the total.
To move the total after the counts, select the total by clicking on it in the list of selected calculations on the right hand
side of the dialog, and then click on the Down button to move it until it is after the counts.
The default options are set in the Preferences and Default Formats dialog.
Type
Total – Percents are based on total cell for the table (% cell = (associated cell value * 100) / total of all the cells
in the table).
Row – Percents are based on total for the row (%cell = (associated cell value * 100) / total of all the cells in the
row).
Column – Percents are based on total for the column (%cell = (associated cell value * 100) / total of all the cells
in column).
Interleaved/Separate
By default, percents and counts are interleaved; that is, for each value in the value set, the frequency for that value is
listed in the column/row directly preceding or following the percent for that value. Percents may also be separate, in
which case all the frequencies are grouped together and all the percents are grouped together.
Separate Percents
For percents to be interleaved, the percents must directly follow or directly precede the counts in the list of selected
calculations in the tally attributes dialog.
You can change the position of the percents relative to the other selected calculations (including the counts) by selecting
the percents in the list on the right-hand side of the dialog and clicking the "Up" or "Down" buttons.
To show percents only (percents without counts), add the percents as described above and set the options to
"Separate". Then select the counts in the list of selected calculations on the right-hand side of the dialog and click the
"Remove" button to delete the counts. Finally, do the same to remove the "Total".
To show percents only with a number row, add the percents and remove the counts as above, but do not remove the
"Total".
The Defaults are set in the Preferences and Default Formats.
By default, data in rows and columns containing percents will be displayed with one decimal place (e.g., 12.3). To
change the number of decimal places see Change the Number of Decimal Places Displayed.
You can also show the percentage of the counts for a subset of the values in a value set (e.g., percent of people with age
between 15 and 49) using proportions. See the section on proportions under Tally Attributes for Variable for more
information.
It is assumed that users know the meaning and relevance of any statistics that are selected. They are only meaningful
for data that represents a true numeric value, e.g., age, income, hectares, etc.).
To exclude a table from the run, first make sure the table is selected and appears on the screen. From the Edit menu,
select Exclude Table from Run, or right-click on the table name on the table tree on the left and select Exclude from
Run. The icon next to the table in the tree on the left will show a red X when the table is excluded.
If a table is already excluded, follow the same steps to change the setting back to included in the run. The red X will
disappear.
To exclude all the tables but the one currently selected, select the Exclude All But This option. This is a useful during
testing when you want to run only one table at a time. To undo any exclusions, select the Include All Tables option.
Check your value set to make sure that no single value is repeated and that no ranges overlap. Of course, in some
cases, you may want to have overlapping ranges in your value set, for example if you want to show subtotals.
See Also: Implications of Data Dictionary Value Sets, Special Values
The Format (Table Element) Dialog Box has the following settings:
Hide: When checked, the selected element will not be displayed in the table. If a column head or row stub is selected,
then the associated column or row will also be hidden. Note that stub heads, titles, and data cells may not be hidden.
Hide when All Cells in Row are Zero: Available only for stubs. When checked, if all cells in the row contain the value
zero, then the row will be hidden.
Custom Text: When checked, the default text generated by CSPro for the selected element will not be displayed. You
may add your own text for the selected element by double-clicking on the element and typing in the new text. Note that if
you double-click on a table element and add your own text without first checking the custom text box, CSPro will
automatically check it for you. You can uncheck this box to replace your new text with the default CSPro-generated text.
Font: Displays the current font and font size for the selected element. You may switch to a different font and/or font size
by clicking on the change button.
Text Color: Shows the current color used to display the text of the selected element. To change the text color, click on
the square displaying the current color.
Fill Color: Shows the current color used to display the cell background of the selected element. To change the fill color,
click on the square displaying the current color.
Indent: Allows you to set the indentation of the selected element from the right and left edges of the cell. To set the
(Thick lines extended into cells from a stub ('30 to 39 years') and a column head ('Separated'))
Note that the lines between rows and columns are shared between multiple elements. For example, the vertical line
between two columns is both the left edge of one column and the right edge of the other column. It may also be the edge
of one or more spanners. At times, you may need to change the line setting in more than one of these elements in order
to change a line. For example, in order to make the vertical line between two columns disappear, you may have to set
the left line of one column to 'none' and the right line of the other column to 'none'. You may also have to set the left or
right line of one or more spanners to 'none'.
Extend Text Color into Cells: When this is checked for a column head or stub, the text color for the selected element
is applied to all cells in the associated column or row. For a stub, all cells in the row of the stub will share the text color
of the stub. For a column head, all cells in the column underneath the head will share the text color of the column head.
Extend Fill Color into Cells: When this is checked for a column head or stub, the background color for the selected
element is applied to all cells in the associated column or row. For a stub, all cells in the row of the stub will share the fill
color of the stub. For a column head, all cells in the column underneath the head will share the fill color of the column
head.
Extend Indentation into Cells: When this is checked for a column head or stub, the indentation setting of the selected
element is applied to all cells in the associated column or row. For a stub, all cells in the row of the stub will share the
indentation of the stub. For a column head, all cells in the column underneath the head will share the indentation of the
column head.
Span Cells: This setting applies only to captions. When this is checked for a caption, the caption spans the entire row.
By default, captions are placed in the first column of the table along with the stubs, and the cells containing them are the
same size as the cells containing the stubs. This means that there are empty cells in all other columns of the caption
row. When "span cells" is checked, the empty cells in the caption row are removed and the caption is placed in one
large cell that spans the entire row. This allows you, for example, to center or right-justify the caption in the entire row
Pa ge 286 of 684 Forma ng Ta bles
(by setting the alignment).
Decimal Places: Sets the number of decimal places used to display numeric values for the selected element. If applied
to a data cell, this sets the number of decimal places used for that individual cell. If applied to a column head or stub it
sets the number of decimal places for the entire row or column. This may only be applied to data cells, column heads
and stubs.
Use the right-click to access the Format (Stub) menu. Change the indentation to .5 in. then select OK. The result is
below.
Similar actions can be performed on columns by selecting multiple column heads. You can also select multiple captions
or spanners.
You cannot change the format for one row or column in only one sub-grouping but not the others. You can, however,
achieve the same result using multiple subtables with universes.
See also: Change the Way Numbers are Displayed
Stub Leadering: Allows you to set a pattern such as '….' or '_ _ _ _' to appear in each row of the table between the end
of the stub text and the start of the next column. You may choose different patterns for the stubs on the left and right
sides of the table. Note that stub leadering is only visible in print preview and on the printed page.
Display Leading Zeros: Controls whether or not zero should be displayed before the decimal point of a decimal number
between zero and one. Leading zeros are displayed by default.
Measurement System: Allows you to switch between US (inches) and metric (centimeters) measurements for page
margins. The measurement is used when specifying margins in the Format Print (Table) dialog as well as indentation in
the Format (Table Element) dialog. The measurement system is set to US by default.
Zero Mask: Allows you to change the text that is displayed for data cells that contain a value of exactly zero. By default
the zero mask is "-", so any data cell containing a value of zero is displayed as "-" rather than "0". If you want zero to be
displayed, remove the zero mask text.
Rounded to Zero Mask: Allows you to change the text that is displayed for data cells that contain a value which is
rounded off to zero because it cannot be displayed using the number of decimal places for that data cell. For example, if
the value in a data cell is calculated to be 0.002 but the number of decimal places for that cell is set to 2, the rounded to
zero mask will be displayed in that cell. Having the zero mask and rounded to zero masks be different makes it clear
which values are exactly zero and which ones are rounded off. By default the rounded to zero mask is "*". If you want
zero to be displayed, remove the rounded to zero mask text. Note that you can change the number of decimal places for
The Format Print (Table) Dialog Box has the following settings:
Stubs: Controls where stubs will be placed on the printed page. There are four options:
Left side, standard (default): place stubs along the left side of every page. If the table is wide enough so that
some of the columns go onto a second page, stubs will be placed on the left sides of both pages.
Left and right sides, standard: place stubs on both the left and right sides of the page for every page. If the table
is wide enough so that some of the columns go onto a second page, stubs will be placed on the left and right
sides of both pages.
Left and right sides, facing pages: place stubs along the left side the first page of facing pages and on the right
side of the second page of facing pages. If the table is wide enough so that some of the columns go onto a
second page, stubs will be placed on the left side on the first page and the right side of the second page. If the
table columns all fit on one page then this is the same as Left side, standard.
Boxhead Frequency: Controls where the boxheads will be placed on the printed pages. The box head is the area at the
top of the table that contains the stub heads, column heads and spanners. You can set the boxhead frequency to any of
the following:
Top of each page (default): Places a boxhead at the top of every page of the table.
No boxheads: No boxheads will be placed at all.
Top of table only: Only place the boxhead on the first page of the table and not any of the other pages of the
table.
Page Margins: Allows you to set the top, bottom, left and right margins of the printed pages for the selected table. By
default, margins are measured in inches although you can change it to centimeters by choosing "metric measurement"
in the Format (Application) Dialog Box. Use the "Make Equal" button to set all four margins to the currently selected
number.
Folio Text: Allows you to set header and footer text for each page of the selected table. The header and footer each
You can modify the fonts used for the header and footer by clicking the Font buttons. The top Font button applies to the
header and the bottom button applies to the footer.
You can place the current date, current time, page number and file name(s) in any of the header or footer sections by
clicking on the appropriate buttons below the header and footer edit boxes or by adding the following text into the
appropriate header/footer edit box:
By default, CSPro places the filename in the left header, the date in the right header and the page number in the center
footer.
Start Page: Sets the page number to use on the first page for page numbering. If this is set to 1, then the first page of
the table will be page 1; the second page will be page 2, etc… If it is set to default then the first page of the selected
table will be one greater than the last page of the previous table in the file so that the numbers continue from one table to
the next with no gaps. For the first table in the file, if the start page is set to default, then the page numbering for that
table will start at page 1.
Views of Tables
Beside Print Preview, there are two views of every table. However, unless the table has hidden rows or columns, or
multiple subtables, the two views are basically the same. One view, called the "Design view" shows the hidden rows or
columns, and the other view, called the "Display" or "Data" view, does not. Users can switch between the two views by
using Ctrl+D or checking/unchecking "Hidden parts" under the View menu.
Most often, users will see the Design view of the table. This is the one that shows hidden stubs or columns. Such stubs
or columns are shaded in gray.
In example the "Total" stub is hidden.
When a table is first created, you see the design view of the table. When a table is run, the display view is automatically
shown and hidden parts of the table are no longer visible. In order to switch back the design view (for example, so that
you can select and unhide a part of the table that is hidden), use Ctrl+D or check "Hidden Parts" on the view menu.
In addition to showing hidden parts, the design view also marks subtables with a colored outline while the display view
does not. It is often useful to switch to the design view in order to be able to more easily set the tally attributes for a
subtable. When the colored outline of subtables is displayed, you may right-click inside the outline and choose Tally
Attributes (Subtable).
See Also: Using Print Preview, Hide or Show a Row or Column
You can now change the text of the table title. If you want a second line of text, press Ctlr+Enter to create a new line of
text.
To go back to the original default text, right click on the text you want to change, select Format (table part), uncheck
the Custom Text check box, then press OK.
Note that if you customize the text of a stub or column header in a sub grouping, the text will be changed for all
corresponding rows or columns in the sub grouping. In the example below, the text for the column header Male has been
changed in one column and the change is reflected in the other sub groupings.
If you wish to change the text in only one sub grouping, you will have to use to multiple subtables with universes.
See Also: Change the Automatically Generated Text
Similarly, to hide a column, right-click on the column head for the column you wish to hide, select "Format (Column
Head)"
To have CSPro automatically hide a row if all values in the row are zero, right-click on the stub for the row and choose
"Format (stub)". In the "Format (stub)" dialog, check the box "Hide when All Cells in Row are Zero". When you run the
table, the row will be hidden if all cells are zero. To view the row again, in order to able to select it, switch to the design
view. To switch to the design view, use Ctrl-D or select "Hidden Parts" from the View menu.
You can hide multiple rows if they contain all zeros at the same time by first selecting the rows and then right-clicking
and choosing "Format (Stub)" as above. Multiple selection is done in the usual Windows manner by left-clicking the
mouse while using the Shift or Ctrl key. Using the Ctrl key adds each cell in the table that is clicked on to the selection.
Using the shift key adds all the cells in between the two cells clicked on to the selection.
Rows or columns containing percentages are automatically designated as having one decimal place.
If you change the number of decimal places in a data cell to a number other than the default setting (indicated by the
word "Default" in the pulldown), then the setting for the data cell will override any settings made for rows or columns
Click to add a check mark to either the pagenote or endnote and then click OK. This will add a new row at the bottom of
the table. Double click on this new row and type in the text for your footnote.
A pagenote will be displayed at the bottom of every page when the table is printed while an endnote will only be
displayed on the last page of the printed table.
To remove a pagenote or endnote, uncheck the corresponding box in Format (Table).
Format (Table) allows you add a pagenote or endnote to an individual table. To add an endnote or pagenote for all tables
in a file, use the preferences.
See also: Add Header/Footer Text to a Table
To place predefined text in a text box, you can either type it directly or click in the text box where it should go and then
click the button corresponding to the desired item. Predefined text strings may be mixed with other text, as in the right
header of the example above. To delete text in the header or footer, delete it from the corresponding edit box.
Separate fonts are available for headers and footers. You can set these fonts using the font button to right of the text
boxes.
If page numbering is used you may set the starting page number, the number to use for the first page of the table, in the
"Start Page" text box. By default, the starting page number will be 1 for the first table in the file and for all subsequent
tables in the file it will be one greater than the last page of the preceding table.
Using the Format Print (Table) dialog will set the header and footer text for the current table. You may set the header and
footer text for all tables in the file using the preferences.
Note: Headers and Footers are visible only in Print Preview and on the printed page.
See also: Add a Footnote (Pagenote or Endnote)
Add a Subtitle
To add a subtitle to a table, right-click anywhere on the table and choose Format (Table) or choose Format (Table) from
the Edit menu to launch the Format (Table) dialog.
Click to add a check mark next to "Subtitle" then click OK. This will add a new row in the table directly underneath the
title. Double click on this new row and type in the text for your subtitle. You can then change the format (font, color,
alignment, etc…) of your subtitle by right-clicking on it and choosing Format (Subtitle).
To remove the subtitle row, uncheck the corresponding box in Format (Table).
By default, no stub leadering is displayed. To add stub leadering to a table, right-click anywhere on the table and choose
Format (Table) or choose Format (Table) from the Edit menu to launch the Format (Table) dialog.
Choose the desired pattern from the Stub leadering pulldown menu. If you have chosen to display both left and right
stubs, you can set different leadering patterns for each one.
Note: Stub leadering is only visible in Print Preview and on the printed page.
Using the Format Print (Table) dialog will stub leadering for the current table. You may set stub leadering for all tables in
the file using the preferences.
Add Borders
To add borders to a table, right-click anywhere on the table and choose Format (Table) or choose Format (Table) from
the Edit menu. This will launch the Format (Table) dialog.
Borders outline the table on the printed page. Different borders are available for each side of the table.
Individual Data cells cannot be hidden but you can replace their contents with Custom Text. Of course, the number of
decimal places, fonts, alignment, and indentation can be changed using the options in the Format (Data Cell) dialog
above.
You can hide or otherwise modify entire rows or columns of numbers. Select the rows or columns to be modified
(multiple selection is done using Shift and/or Ctrl keys), right click and use Format (Stub) for rows or Format (Column
Head) for columns.
The settings that have an effect on the numbers in the rows or columns associated with the stub or column head are in
the "Associated Cells" section. These check boxes allow you to extend the format of the stub/column head into all the
cells in the corresponding row/column. Of course, the number of decimal places affects only the data cells themselves
and not the column head or stub.
See Also: Formatting Row, Column, or Cell Data, Hide or Show a Row or Column
Every occurrence of the default text will be replaced by the alternate text. In the example, every "total" text in every table
will be replaced by "All persons".
If the text is not appropriate in certain places it can be replaced by "custom text ".
Note: Any text can always be replaced by custom text.
Changing the automatically generated text is particularly useful when generating tables in languages other than English.
You can translate all of the automatically generated text into your language in the Format (Application) dialog and will be
updated in all tables in your file.
To change the font, click on the 'Change' button and select the font characteristics from the available list.
Indentation is given in inches (in.) or centimeters (cm.) depending upon the choice of Measurement System in the
Format (Application) dialog. Indentation affects where in the text box, the text starts (Left) and/or ends (Right).
Alignment is the placement of text in the text box. The horizontal options are left(-justified), center, or right(-justified).
Vertical choices (shown) are top, middle, and bottom.
See Also: Preferences and Default Formats
The column header "No" has thick lines set for left and right
and extend into cells is checked
You can also set the left and right borders of a spanner and have them extend into the leftmost and rightmost columns
under the spanner. This is done the same way it is done for individual columns. Right-click on the spanner, choose
Format (Spanner), set the left and right lines and ensure that Extend Lines into Cells is checked.
By default, stubs and columns have no borders, but spanners have thin lines on the left. This means that cells in the
leftmost columns under a spanner will have left borders. You can change these default settings by using the
Preferences.
See Also: Formatting Row, Column, or Cell Data, Add Borders to a Table
The result:
Note: The horizontal alignment of the caption was also changed to "center" to demonstrate the size of the associated
text box.
By default, captions do not span fields. You can make all captions in all tables in the file span fields using Preferences
and Default Formats.
Note: There is NO "undo" for this operation. All custom entries are lost.
Left side, standard (default): place stubs along the left side of every page. If the table is wide enough so that
some of the columns go onto a second page, stubs will be placed on the left sides of both pages.
Left side, facing pages: place stubs along the left side of the page but only for the first page of facing pages. If
the table is wide enough so that some of the columns go onto a second page, no stubs will be placed on the
second page. If the table columns all fit on one page then this is the same as Left side, standard.
Left and right sides, facing pages: place stubs along the left side the first page of facing pages and on the right
side of the second page of facing pages. If the table is wide enough so that some of the columns go onto a
second page, stubs will be placed on the left side on the first page and the right side of the second page. If the
table columns all fit on one page then this is the same as Left side, standard.
Use the Format Print (Table) dialog to set the boxhead frequency for a single table. Use Preferences and Default
Formats to set the default boxhead frequency for all tables in the file.
See Also: Viewing Multiple and Facing Pages
No File Processing: It is also possible to produce tables without creating an area names file. Select the levels of
geography desired but after clicking on the Traffic Light button in CSPro to run your tables, leave the <Area Names> field
blank. CSPro will use codes from the data file instead of names for each level of geography. For example, if province
Gluten's code is 20, the tables will show code 20 instead of the name Gluten. This can be useful for generating a list of
every geographic code found in a data file.
See also: Area Names File,
File Format
The following is an excerpt from the area names file for Popstan, which you can find in the Examples folder. The first
section identifier, [Area Names], indicates the type of file, followed by the CSPro version number.
Pa ge 309 of 684 Crea ng Ta bles by Geogra phic Area
Next, the [Levels] section provides the names of the geographic hierarchy (levels) in major-to-minor order.
Finally, the [Areas] section gives the correspondence of the geographic codes (major-to-minor order) found in the data
with a name for the geographic area. More detail follows the listing of the area name file.
If the data file contains area codes that do not have corresponding names in the area names file then the unmatched
area codes will be displayed as a text string in place of the missing name.
[Area Names]
Version=CSPro 7.5
[Levels]
Name=Province
Name=District
[Areas]
X X = Popstan
1 X = Artesia
1 1 = Dongo
1 2 = Idfu
⋮
2 X = Copal
2 1 = Baja
2 2 = Bassac
⋮
3 X = Dari
3 1 = Argentina
3 2 = Benlata
3 3 = Bristol
⋮
The line following the [Areas] section contains the codes and name of the "total" area ("country" in our case). It is
considered the "Grand Total" level and denoted by 'X' values for each level of the area hierarchy. In the example, the first
'X' represents the Province code and the second 'X' represents the District code. Basically, an 'X' value is similar to a
"wildcard" match, so any value in this field is acceptable (and thus part of the area).
Following the total area name is the set of codes and names for the lowest valued major level, "province" in our example.
The lowest code for a province is the '1' associated with Artesia. Again, an 'X' value is given for the District code since
any code here is acceptable. Next, we must give codes and name for all districts in Artesia starting with the lowest code
value.
Note that each line for this province begins with '1' since the province code must be combined with the district code to
uniquely distinguish this district. Dongo is district '1' of Artesia (province '1'). Data for a questionnaire with Province code
= 1 and District code = 1 will be tallied for Dongo District.
When districts for Artesia are all listed (codes and names), start with the next lowest province code followed by its
districts. The process is repeated for each province.
Area codes must be listed in ascending sort order from major to minor.
If the area name file has only one level, e.g., province, then only one code would be given. If three levels were needed,
e.g., province, district, village, then three codes would be required. As always, 'X' represents the wildcard match.
The indentation associated with the names above is for illustrative purposes only. Area names shown in tables will not be
indented, but indenting the file can make it easier to read.
Also, codes within the [Areas] section can be separated by commas, spaces, or a combination of both. Any of the
following are acceptable to define an item at the district level:
Hiding Tables
To hide tables for certain level of geographies, preface the area name label with a tilde (~). For areas with this label, a
table will still be created (and stored in the .tbw table file), but it will not be displayed and will thus be hidden. For
example, if a table shows states (provinces) and counties (districts) but some states only contain one county, it may be
desirable to suppress the repetition of the data at both the state and county level. For example:
X X = United States
1 X = Maryland
1 1 = Montgomery Country
1 2 = Prince George's County
2 X = District of Columbia
2 1 = ~ District of Columbia
3 X = Virginia
3 1 = Arlington County
3 2 = Fairfax County
Without the tilde, a table would be displayed for District of Columbia as both a state and a county. Adding the tilde leads
to the desirable behavior of only displaying one table for that level of geography.
As an example from above, the original tables [created from the data file] would be an urban table for each district in
each province and a rural table for each district in each province.
Standard Consolidation would:
Add the appropriate urban and rural tables to create the province-district table.
Add the appropriate district tables to create the province table.
Add all province tables to create the 'country' table.
This procedure produces four levels of tables: Urban/Rural, District, Province, and Country (given in minor to major
order).
The Lowest Level allows users to select the most minor level to be produced. Counts at this level and higher will
appear in the tabulation but any lower level counts will be discarded.
Custom option allows some freedom in defining the aggregation or consolidation scheme. See Custom Consolidation for
more information.
See also: Area Processing, Create an Area Names File
Area Captions
Area Captions are, as the name indicates, row captions in which the corresponding area name is placed for each table.
Area captions are placed just above the total line of the first stub group in the table. An area caption is denoted by the
"%AreaName%" text.
If the table is a 'one-row' table, e.g., it has only column items but no stub groups, the area name caption replaces the
'Total' stub that would otherwise be present and the area caption is 'hidden'.
The area caption has the same formatting options as other captions, available by right clicking on the caption and
choosing Format (Area caption). You can hide area captions, as in the example above, you can change the fonts, make
them span rows, etc…
Custom Consolidation
Custom Consolidation allows the aggregation of lowest level tables to create almost any reasonable higher level. Users
define the consolidation scheme starting from the Standard consolidation, shown below.
Using Custom Consolidation you can also create user-defined schemes for combining the lowest level tables by adding
new rows to the consolidation scheme:
Each row of the scheme must have a CSPro 'name' for the set of tables to be created, listed under Area Level Name
above.
In the example above, the following tables would be created:
One Country table
One urban table and one rural table for the entire country
A Province table for each province
One urban table and one rural table for each province
A District table for each district in each province
One urban table and one rural table for each district in each province.
In addition, you may place conditions in the cells of the consolidation scheme grid. These conditions are used to modify
how the tables replaced for a given level (the column) are consolidated for a particular consolidation scheme (row).
The options are:
Blank – Any code in this position is included.
Each – Each different code in this position creates a separate table.
Single value – One table created for this value of the area level
Replacement formula – One 'summary' is table created for all areas that the meet the condition and the
replacement code is substituted for the area code for the summary table. [start:end = replacement]
The TOT_URB table will include all tallies that had a value of '1' for the UR area level. The area codes on this table will be
Pa ge 315 of 684 Crea ng Ta bles by Geogra phic Area
Province=X, District=X, UR= 1.
The PROV_1_4 table will include all tallies that had Province codes of 1 through 4. The area codes for this table will be
Province=99, District=X, UR= X. Since other province codes are 1 to 15, the table associated with '99' would be
displayed after any for those coded. (In the example above NO province tables were created hence they are not
displayed)
In order to use these codes – a corresponding name should be in the area name file.
Note: When custom consolidation is used it is the user's responsibility to make sure the consolidation scheme is
reasonable and that it works in the desired manner. As usual, every aspect of a data processing system should
be tested for correctness.
The 'All' selection represents the usual display. If area codes are found in the data file that are not defined in the Area
Name File then those codes will be listed. The areas are listed in sort order, i.e., the same order that they are defined in
the area names file.
Note that this only changes which area is currently displayed. Tables are still produced for all areas originally specified
and all of these areas will still appear when the tables are printed or saved. To only produce tables for certain
geographies either set the lowest break level for a table, or use custom consolidation.
The spacing or sizing of rows or columns in Display view is NOT carried over to Print Preview.
See the View menu for other combinations of keys that will allow you to move between tables.
There is also a "Goto ..." option available from the Edit menu and from the right-click menu which allows you to go
directly to a particular page, table or area within a table.
See Also: Print Preview Options
To modify the height of a row place the cursor below the row text until it changes to a vertical double-sided arrow, left
mouse click and drag the (usually invisible) horizontal bar to the new position.
If this option is checked and a change is made to the width of the "Male" column in any of the three panels then the
width of all three will be modified. If the option is unchecked then only the specific column selected will have its width
modified.
Automatically Fit columns across each page
If checked this option will "spread" the table columns for all tables across the entire printed page. If unchecked, the
columns for all tables will be displayed in the minimal width possible given text, font, etc.
An example of checked:
Next page after Page Break applied. This page starts with the "15-19 years row"
Print Setup
This is the usual dialog box for Print Setup. It should be something like the following:
What is important is that this information is used to create the Print Preview of your tables. Changes made here will
If an action is taken and you want to "undo" it then right click in the Print Preview window. This gives the following menu:
Each "Object Format" on the left has a corresponding menu. Select the object in the tree then you can set the 'default'
settings for each of the available options. The only difference is that 'Use Default ' check box may be missing since this
is the menu that sets those defaults. It is important to note that if a setting is changed here it will be applied to all tables
that use the default settings even the tables that have already been defined.
For example: If you select blue as "Text Color" for Spanner then all spanners that exist in tables already defined (and
use the default setting) will now be displayed in blue as well as spanners in tables yet to be created.
For the meanings of the various options in each menu see the related topic.
See Also: Tally Attributes for a Variable, Formats for a Part of a Table, Formats for a Table, Formatting Row, Column, or
Cell Data , Formats for an Application , Formats for Printing
The Load option is used to change preferences to those in your CSPro Table Format (.tft) file. For this option, enter the
name of the .tft file or browse to find it. Once the file is loaded the revised preferences will be in force.
These "Default" settings are contained in Preferences and Default Formats. In any Tabulation application the initial set of
Preferences is established by CSPro. They can be reset to other settings through "Load Preferences…" under the File
menu.
Sets of user-defined preferences are contained in CSPro Table Format (.tft) files. Once the file has been created by the
"Save Preferences…" option under the File menu, it can made available to other users in the same manner that any
other file is shared.
See Also: Loading and Saving Preferences
Use the Select All button if all tables are needed. Otherwise, select the individual table(s) that you would like to save in
a single Table Viewer file. (Multiple tables are selected in the usual manner with the Shift and/or Ctrl keys.)
After selection press OK.
In the Save As dialog box enter the name of the Table Viewer file to be created or browse to select the file to be
replaced. Table Viewer files must have the .tbw extension.
See Also: Saving Tables as Text, HTML and Rich Text Format
Enter the name of the file to be created or browse to select the name of the file to be replaced.
Note: ONLY ONE table at a time can be in Rich Text (.rtf) or HTML (.htm) format. Tab delimited format supports saving
multiple tables in a single text file.
- Click on the toolbar; or from the Edit menu, select Copy; or press Ctrl+C.
If the tabulations use area processing, the table viewer files can also be imported into a CSPro Table Retrieval System
(TRS). This is a much more structured way to distribute tables, especially for large numbers of tables (or documents, for
that matter).
See also: Save Tables for the Table Viewer, Saving Tables as Text, HTML or Rich Text, Introduction to Table Viewer,
Introduction to Table Retrieval Setup
To copy and paste a single table or part of a table follow the procedure outlined in Select and Copy Table Data to Other
Applications.
To copy multiple tables, use the one of the file format options given for "Save Tables" (Tab Delimited Text, HTML, or Rich
Text). These options are available for single tables or multiple tables. In some cases, you may need to save single tables
in individual files to achieve the desired results.
See also: Select and Copy Table Data to Other Applications, Saving Tables as Text, HTML or Rich Text
There are two ways to add a new column. We can add a new value to the existing value set for the existing variable or we
can drag a new variable onto the table next to the existing one. Adding a new value to the value set creates a new
column under the existing spanner while adding a new variable creates a new column under a new spanner.
Male/Female Ratio column added as new value to value set for the variable Sex. The new column is under the Sex spanner.
Male/Female Ratio column added by dragging a new variable onto the table. The new column is under a separate spanner.
First, we must add the new column to the table as described in Adding Rows and Columns For Post Calculation.
Once the new column has been created, all that is left is to add the postcalc logic. Postcalc logic in CSPro tabulation
applications is similar to working with arrays in a CSPro batch edit or data entry application. To access an individual cell
in a table, use the table name followed by the indices of the row and column in parentheses:
<table name>(<row>, <column>)
where:
<table name> is the name of the table.
To see the names of the tables, click on the Tables tab in the bottom left of the CSPro window to show the
Tables Tree and then select "Names in Trees" from the View menu or press Ctrl+T. This toggles between
showing the table titles and the table names in the Tables Tree.
Note that column and row numbers start at zero, so the first column is column zero, the second column is column is
column 1, etc…
To enter postcalc logic for a table, open the Tally Attributes (Table) dialog. You can then either type postcalc logic
directly into the postcalc area or click on the "Edit" button next to the postcalc area to bring up a larger window to type
in.
Postcalc logic can contain numeric constants as well as table values. For example if you wanted men per 100 women
rather than male to female ratio, you could use the following logic:
TABLE1(0,3) = 100 * TABLE1(0,1) / TABLE1(0,2);
This multiplies the result of the division by 100 to give the number of men per 100 women.
In fact, postcalc logic can contain nearly any of the statements and functions available in program logic in batch edit and
data entry applications.
See also: Post Calculation For Rows, Columns and Ranges, Row and Column Indexing for Post Calculation
For information on how to add the additional column, see the previous section Adding Rows and Columns For Post
Calculation.
The postcalc logic for assigning rows and columns is similar to that of individual cells, however there is a simplified
Pa ge 340 of 684 Ta ble Pos t Ca lcula on
syntax for working with cell ranges, rows and columns. You could set each cell individually with multiple statements as
follows:
TABLE1(0,3) = TABLE1(0,1) / TABLE1(0,2);
TABLE1(1,3) = TABLE1(1,1) / TABLE1(1,2);
…
TABLE1(2,3) = TABLE1(2,1) / TABLE1(2,2);
TABLE1(20,3) = TABLE1(20,1) / TABLE1(20,2);
However, that would take a lot of code. Instead, you can specify this one statement using a range or a wild card. To use
a range, give the lower and upper limits separated by a colon:
TABLE1[0:20,3] = TABLE1[0:20,1] / TABLE1[0:20,2];
This means divide the cell in column 1 by the cell in column 2 and put the result in the cell in column 3 for each row from
0 to 20. You can also use a wildcard (an asterisk) to specify an entire row or column:
TABLE1[*,3] = TABLE1[*,1] / TABLE1[*,2];
Using an asterisk in place of the row index means that the operation applies to all rows in the table. In this case it is the
same as specifying the range 0:20 since the table has 21 rows. An asterisk can also be used in place of the column
index to specify every column in a table:
TABLE1[1,*] = TABLE1[2,*]; { copy row 2 into row 1 }
Note that when working with rows, columns and ranges, you must use square brackets "[]" rather than parentheses "( )".
Parentheses may only be used when specifying individual cells.
You can only assign ranges or wildcards to each other if the dimensions of the ranges match. For example:
TABLE1[0:2, 0:3] = TABLE1[3:5, 0:3];
copies one 3 by 4 region of the table to another 3 by 4 region. However, the following code will fail since it attempts to
copy a 3 by 4 region to a 3 by 3 region:
TABLE1[0:2, 0:2]= TABLE1[3:5, 0:3]; { This does not work ! }
See also: Post Calculation For Individual Cells, Row and Column Indexing for Post Calculation
When a table includes percents, the row and column indices are a bit more complicated. Although the percent
rows/columns are interleaved with the rows/columns for the counts, the percent rows/columns are numbered after the
counts. In other words the index for the first percent row or column in a subtable always starts after the last count row or
column. For example, in the table below, the rows for the values of marital status under the male caption are counted
one after another as indices 0 through 4 and are then followed by the percent rows for marital status as indices 5 through
9. The first percent row is at index 5 which follows the last count row (Never Married) at index 4.
See also: Post Calculation For Individual Cells, Post Calculation For Rows, Columns and Ranges
This launches the program CSTab.exe to run with the parameters specified in the PFF file MyTabs.pff. Note that using
Start /wait is not strictly necessary; it simply ensures that the command does not terminate until CSTab.exe has
finished processing. This is useful when there are other commands that follow which depend on CSTab completing before
they can be executed.
You can create a PFF file in two ways:
Run the tabulation from CSPro. It will save the *.PFF file it generates in the same folder with your tabulation
application. The *.PFF will have the same name as the application with .PFF appended. Rename and modify this
file with a text editor (such as Notepad or Wordpad).
Create a new *.PFF file using a text editor.
The following shows an example of a tabulation PFF file. Note that a PFF file is not case sensitive. You can use any
combination of upper and lower case text.
[Run Information]
Version=CSPro 7.5
AppType=Tabulation
Operation=All
[Files]
Application=.\MyTabs.xtb
InputData=.\MyData.dat
Listing=.\MyTabs.xtb.lst
AreaNames=.\MyAreaNames.anm
OutputTBW=.\MyTables.xtb.tbw
[Parameters]
Pa ge 344 of 684 Run Produc on Ta bula ons
ViewListing=OnError
ViewResults=Yes
The [Run Information] block is required and must appear exactly as shown in the example above.
The [Files] block is required and defines all files used in the tabulation run. A description of the files is as follows:
Application = the tabulation edit application you created
InputData = the data file to be tabulated -- If there is more than one input data file, insert multiple InputData
lines.
Listing = a report of the tabulation processing
AreaNames = the areanames file used only if there is area processing
OutputTBW = the output formatted tables
If any required files are not coded or are missing, the file association dialog box will be displayed allowing you to fill in or
change the missing file names.
The [Parameters] block is optional. If allows to specify additional aspects of the tabulation run.
ViewListing = specifies how the tabulation run listing is displayed. If ViewListing is missing, Always is
assumed.
Always - the listing is always displayed
OnError - the listing is displayed only when an error or an invalid subscript warning occurred
Never - the listing is never displayed
ViewResults = specifies whether or not the formatted tables file (*.TBW) is displayed in TableViewer at the end
of the run. If ViewResults is missing, Yes is assumed.
Yes - the tables are displayed
No - the tables are not displayed
Run in parts allows you perform each of these processes separately. Run in parts is also used when you need to save
the intermediate .tab files for later use.
To run in parts interactively, from the File menu, select Run in Parts. Then select the process you want to run. (The
Consolidate process will be grayed out if you are not using area processing in your application.)
See also: Run Tabulate Interactively, Run Consolidate Interactively, Run Format Interactively, Run Tabulate in Batch,
Run Format in Batch, Run Consolidate in Batch
Input Data: The data file(s) being tabulated. There is NO required extension for CSPro data files. Multiple input data files
This launches the program CSTab.exe to run with the parameters specified in the PFF file MyTabs.pff. Note that using
Start /wait is not strictly necessary; it simply ensures that the command does not terminate until CSTab.exe has
finished processing. This is useful when there are other commands that follow which depend on CSTab completing before
they can be executed.
The [Run Information] block is required and must appear exactly as shown in the example above.
The [Files] block is required and defines all files used in the tabulation run. A description of the files is as follows:
Application = the tabulation application you created
InputData = the data file to be tabulated -- If there is more than one input data file, insert multiple InputData
lines.
Listing = a report of the tabulate process
TabOutputTAB = the output table matrices file
If any required files are not coded or are missing, the file association dialog box will be displayed allowing you to fill in or
change the missing file names.
The [Parameters] block is optional. If allows to specify additional aspects of the tabulation run.
ViewListing = specifies how the tabulation run listing is displayed. If ViewListing is missing, Always is
assumed.
Always - the listing is always displayed
OnError - the listing is displayed only when an error or an invalid subscript warning occurred
Never - the listing is never displayed
See also: Run Consolidate in Batch, Run Format in Batch
Input TAB: The tabulated table matrices file(s) to be consolidated. Multiple input *.TAB files can be selected using the
browse button. The *.TAB extension is required.
Output TAB: The consolidated table matrices file created [or replaced] from the input TAB file(s). This TAB name must
be different from the input name and the extension is required.
Listing File: The report generated for the run showing area codes found in the data and tables available for those codes.
Blank 'codes' indicate summary levels. The .LST extension is not required.
This is a sample of a listing file for a Consolidation process:
Process Messages
PROVINCE DISTRICT Table Names
-------- -------- -----------
TABLE1 TABLE2 TABLE3 TABLE4
1 TABLE1 TABLE2 TABLE3 TABLE4
1 1 TABLE1 TABLE2 TABLE3 TABLE4
1 2 TABLE1 TABLE2 TABLE3 TABLE4
1 3 TABLE1 TABLE2 TABLE3 TABLE4
1 4 TABLE1 TABLE2 TABLE3 TABLE4
1 5 TABLE1 TABLE2 TABLE3 TABLE4
. . .
This launches the program CSTab.exe to run with the parameters specified in the PFF file MyTabs.pff. Note that using
Start /wait is not strictly necessary; it simply ensures that the command does not terminate until CSTab.exe has
finished processing. This is useful when there are other commands that follow which depend on CSTab completing before
they can be executed.
You can create a PFF file in two ways:
Run the tabulation from CSPro. It will save the *.PFF file it generates in the same folder with your tabulation
application. The *.PFF will have the same name as the application with .PFF appended. Rename and modify this
file with a text editor (such as Notepad or Wordpad).
Create a new *.PFF file using a text editor.
The following shows an example of a PFF file for the consolidation process. Note that a PFF file is not case sensitive.
You can use any combination of upper and lower case text.
[Run Information]
Version=CSPro 7.5
AppType=Tabulation
Operation=Con
[Files]
Application=.\MyTabs.xtb
ConInputTAB=.\MyData.dat.tab
Listing=.\MyTabs.xtb.con.lst
ConOutputTAB=.\MyData.dat.con.tab
Pa ge 349 of 684 Run in Pa rts
[Parameters]
ViewListing=OnError
The [Run Information] block is required and must appear exactly as shown in the example above.
The [Files] block is required and defines all files used in the tabulation run. A description of the files is as follows:
Application = the tabulation application you created
ConInputTAB = the input table matrices file – These are the output from the tabulate process. If there are
multiple input table matrices files, insert multiple ConInputTab lines.
ConOutputTAB = the output table matrices file
Listing = a report of the consolidate process
If any required files are not coded or are missing, the file association dialog box will be displayed allowing you to fill in or
change the missing file names.
The [Parameters] block is optional. If allows to specify additional aspects of the tabulation run.
ViewListing = specifies how the tabulation run listing is displayed. If ViewListing is missing, Always is
assumed.
Always - the listing is always displayed
OnError - the listing is displayed only when an error occurred
Never - the listing is never displayed
See also: Run Tabulate in Batch, Run Format in Batch
Input TAB: The tabulated or consolidated matrices file. If application does not have area processing, it was created in
the Tabulate process, if it has area processing was performed, it was created in the Consolidate process. The .TAB
extension is required.
Area Names: Only used for applications with area processing. An Area Names File is used to associate the areas
codes in the TAB file with their descriptive text. The .ANM extension is required.
Output TBW: The CSPro Table Viewer file created by merging the matrices from the input TAB file, the table format
specifications from the tabulations application, and the optional area names. The .TBW extension is required.
See also: Run Tabulate Interactively, Run Consolidate Interactively, Introduction To Table Viewer
This launches the program CSTab.exe to run with the parameters specified in the PFF file MyTabs.pff. Note that using
Start /wait is not strictly necessary; it simply ensures that the command does not terminate until CSTab.exe has
finished processing. This is useful when there are other commands that follow which depend on CSTab completing before
they can be executed.
You can create a PFF file in two ways:
Run the tabulation from CSPro. It will save the *.PFF file it generates in the same folder with your tabulation
application. The *.PFF will have the same name as the application with .PFF appended. Rename and modify this
file with a text editor (such as Notepad or Wordpad).
Create a new *.PFF file using a text editor.
The following shows an example a PFF file for the format process. Note that a PFF file is not case sensitive. You can
use any combination of upper and lower case text.
[Run Information]
Version=CSPro 7.5
AppType=Tabulation
Operation=Format
[Files]
Application=.\MyTabs.xtb
FormatInputTAB=.\MyData.dat.con.tab
AreaNames=.\MyAreaNames.anm
Listing=.\MyTabs.xtb.fmt.lst
OutputTBW=.\MyTables.xtb.tbw
[Parameters]
ViewListing=OnError
ViewResults=No
The [Run Information] block is required and must appear exactly as shown in the example above.
The [Files] block is required and defines all files used in the tabulation run. A description of the files is as follows:
Application = the tabulation application you created
If any required files are not coded or are missing, the file association dialog box will be displayed allowing you to fill in or
change the missing file names.
The [Parameters] block is optional. If allows to specify additional aspects of the tabulation run.
ViewListing = specifies how the tabulation run listing is displayed. If ViewListing is missing, Always is
assumed.
Always - the listing is always displayed
OnError - the listing is displayed only when an error occurred
Never - the listing is never displayed
ViewResults = specifies whether or not the formatted tables file (*.TBW) is displayed in TableViewer at the end
of the run. If ViewResults is missing, Yes is assumed.
Yes - the tables are displayed
No - the tables are not displayed
See also: Run Tabulate in Batch, Run Consolidate in Batch
In the menu subtables are listed as they appear in the "big" table, from left to right then from top to bottom. The "Names"
of the subtables are created from the value sets present. If an item has only one value set then the name of the item
itself appears, e.g., P03_SEX in example. If an item has more than one value set then the name of the value set appears,
e.g., P04_AGE_VS2 in the example.
Any attributes entered in the "Entire Table" menu except "Unit Tallied" are also incorporated into the subtables. Entries
in this menu apply to all subtables in the table.
In the Tally Attributes (Table)
If an item or value is given for "Value Tallied" or "Weight" it is also MULTIPLIED by any corresponding value give
in any subtable.
Any entry in the "Universe" box is combined with any universe in a subtable using "and". In other words, both
universe criteria must be met in order for a tally to be made.
Of course, any checked "Special values" or "Lowest Break Level" (applicable only for area processing) always
refer to the entire table including any subtables.
With the exception of attributes applying to entire table, individual subtables can each be assigned different weights,
universes, etc.
See Also: Create Tabulations with Multiple Variables, Tally Attributes for a Table
If the "unit of tally" is either non-repeating record then only tally is made per case. If the "unit of tally" is the occurring
item AGES, then a tally is made for each occurrence of age.
See Also: Tabulations Using Relations
Complete – piped water inside the unit, and a private toilet inside the unit and bathing facilities inside the unit.
Some but not all –one or more of the three conditions above, but not all three.
None – none of the above conditions.
The variable "complete plumbing" does not exist in the main dictionary. It can be however, be determined based on the
values of the following three variables that are in the dictionary:
Rather than adding new variables to your existing dictionary, you can add new variables to the working storage
dictionary. The working storage dictionary is a second dictionary that is created automatically when you create a
tabulation application. This dictionary is similar to other dictionaries, however there is no data file associated with it. The
variables in the working storage dictionary must be set by program logic.
In this example, we will add the new "complete plumbing variable" to the working storage dictionary. The working storage
dictionary appears just below the main dictionary in the dictionary tree. Adding a new variable to the working storage
dictionary is the same as adding a variable to any dictionary. Right-click on the "Working Storage Record" under the
working storage dictionary in the dictionary tree and choose "Add Item".
Now drag the new variable onto the table just as if it were a variable in the main dictionary.
The above logic will be executed once for each housing record to set the value of complete plumbing for that household.
This will result in the following table:
If "Age" value set is dropped for the Person record and the "Work last week?" value set is dropped from the LFS record
then the following table is created.
Since the LFS record is the primary link, the tallies will be done for each occurrence of the LFS record. (This assumes
that every LFS record in the case can be linked to a Person record in the case and that no "universe" has been
specified.)
Note: The "Unit Tallied" must be the relation defined in the data dictionary. CSPro will automatically set the default to the
relation if such a relation exists.
Then drag the modified value set onto the table. For additional clarity you can format the stubs corresponding to the
subtotals to make them standout. In the table below, the font for the subtotal stubs is set to bold.
To create such a table, we need a value set that contains only the two categories divorced and not divorced. Creating the
category for "divorced" is simple, it includes only the value 2. The "not divorced" category, however, needs to contain the
value 1 (Married) plus the values 3, 4 and 5 (Separated, Widowed, Never Married). These values do not make up a single
range. In order to create the category you must create two entries in the value set, one that contains the value 1 and the
second that contains the values 3 through 5. The first entry must contain the label for the category and the label for the
second entry must be a single space character. CSPro recognizes the single space as a continuation of the previous
category rather than a separate category. In this case it will merge the value range in the second entry with that of the
first entry, creating a single category that includes the value 1 and the values 3 through 5.
One way to work around this is to modify the value set for Sex to remove the Male category. Note that removing the Male
category will also affect the Total (since males will no longer be counted). To fix this, add your own category to the Sex
value set called Total, which includes the values for Male and Female. Finally hide the system generated total in the
table (see Hide or Change the Position of the Total). The result is the following table:
A second approach is to use multiple subtables rather subgrouping. In our example you would drop the Literacy variable
twice onto the rows to make two separate subtables. This would create the following table:
Then edit the universe for the second subtable to only include males (see Restrict a Universe). In this case set the
universe to SEX=1. Next edit the universe for the third subtable to include only females (SEX=2). Note that you do not
need to edit the universe for the first subtable since you want to include both males and females (the entire population)
which is the default universe. Next, edit the captions of the subtables to be "Total", "Male" and "Female" (see Customize
Table Text). Finally right-click on the Literacy stub in the first (Total) subtable and set the font to bold. This results in the
following table:
Create a new value set that regroups the original values for marital status into these new categories.
Complete – piped water inside the unit (source of water = 1), and a private toilet inside the unit (type of toilet = 1)
and bathing facilities inside the unit (type of bathing = 1).
Some but not all – any one of the three variables (piped water, flush toilet, bathing facilities) inside the unit but
not all three.
None – all other cases.
To create the table for complete plumbing, use 3 subtables, one for each of the 3 possible values of the recoded variable
(complete, some but not all, none). Each of these subtables should have only one row. The easiest way to create a
Pa ge 369 of 684 Ta ble Tips a nd Tricks
subtable with one row is to drag a variable whose value set has only one category onto the table. If no such value set
exists, you can create one. In this case, create the value set on the variable "source of water". In fact, you could use any
variable on the same record as the variables we are using in the recode (the housing record in this case). Note that the
single category in this value set must include all valid values for the variable so that all housing units will be counted
when tabulating the variable.
Drag this new value set onto the table 4 times, once for each of three possible values of complete plumbing and once for
the total.
Since we only need one row for each subtable, hide the system generated total in each of the four subtables (see Hide or
Change the Position of the Total). Also hide the captions for each of the subtables (see Formats for a Part of a Table).
Now set the universes for the subtables to correspond to the appropriate values of the complete plumbing recode. The
first subtable will be the total so it should use the default universe, which includes all cases. The second subtable
represents "complete plumbing" and must include only those cases where piped water, flush toilet AND bathing facilities
all equal 1. The third subtable represents "some but not all" and must include the cases where one or two of the
variables are equal to 1 but not all of them. The fourth subtable represents "none" and must include all cases where none
of the variables are equal to 1. For more information on setting the universe on a subtable see Restrict a Universe. This
produces the following table:
Note that often it is preferable to create recoded variables in a batch edit program rather than during tabulation. This
recode could have been accomplished by adding a new variable to the dictionary for complete plumbing and writing a
batch edit program to set the value of this variable for each case. Then the new complete plumbing variable could be
dropped on the table directly rather than creating multiple subtables with universes. This makes the creation of the table
much simpler although it involves creating a batch edit application and writing a small amount of logic. Using batch edit
is best when the recoded variable will be used in multiple tables.
Another alternative would be to create the recoded variable in the working storage dictionary and use tablogic to set its
value to each case. See Table Logic (tablogic) for more information.
Universe
Tabulate only households where at least one child was born last year with the following universe (assuming that
CHILDREN_BORN_LAST_YEAR is a variable on the person record):
count(CHILDREN_BORN_LAST_YEAR) > 0
Restrict a table to only households with total household income of greater than $20,000 by using the following universe:
This assumes that PERSONAL_INCOME is a variable on the person record. The sum of the incomes of each person in
the household is the total income for the entire household.
Restrict a table to only households where the head of household is female with the following universe:
SEX(1) = 2
This assumes that the head of the household is always the first person record (occurrence 1). This will only be true if
your data entry program and/or edit program ensure this. If this is not true, then you would need a more complicated
expression such as:
This will be true only if there is a person in the household who is the head of household and is female. This assumes that
there is only one head of household in the household, which should be the case for properly edited data.
Value Tallied
Often fertility information is captured separately for male and female children and you wish to tabulate it for both sexes.
For example, you have variables for MALE_CHILDREN_BORN and FEMALE_CHILDREN_BORN but no variable for total children
born and you want to count the total number of children born. You can use the sum of the two variables in the value
tallied:
MALE_CHILDREN_BORN + FEMALE_CHILDREN_BORN
Note that if one or more of the variables is a special value, the counts will not be correct. This is because the sum of a
special value and a number is a special value.
See also: Restrict a Universe, Tabulate Values Instead of Frequencies, Tally Attributes for a Table
Function/Statement Description
abs Returns the absolute value of a numeric expression.
accept Returns the number of a choice from a list made by the data entry
operator.
adjlba Adjusts the lower bound of the CMC of an event based on an age.
adjlbi Adjusts the lower bound of the CMC of an event offset by a number of
months.
adjuba Adjusts the upper bound of the CMC of an event based on an age.
adjubi Adjusts the upper bound of the CMC of an event offset by a number of
months.
advance Moves forward field by field to a specified field during data entry.
alias Creates an aliased name for a dictionary item, typically to shorten or
standardize names.
alpha Declares alphanumeric variables used in the application.
array Declares an array object for storing a collection of numbers or strings.
ask Conditionally jumps forward to the next field during data entry.
audio Declares an audio object for recording or playing audio.
average Returns the average of an item that occurs multiple times.
dateadd Calculates a new date from a starting date and a period of elapsed
time.
datediff Calculates the difference between two dates.
datevalid Determines whether a date in the format YYYYMMDD is valid.
decompress Decompresses a .zip file, extracting some number of files.
delcase Marks a case for deletion in an external file based on a key.
delete Removes a record or item occurrence from the current case.
demode Returns the current data entry mode.
diagnostics Returns troubleshooting information about CSPro.
dircreate Creates a new directory with the given directory name.
dirdelete Deletes an empty directory or group of empty directories.
direxist Determines whether a directory exists.
dirlist Returns a list containing the file and directory listing of the specified
directory.
display Displays a message (this function has been superceded by errmsg).
do Executes one or more statements repeatedly while a logical condition
remains true or until a logical condition is no longer true.
key Returns the key (ID string) of the currently loaded case.
keylist Returns a list containing the keys (ID strings) of cases in a file.
killfocus Declares that the following statements are executed after the object
stops being active.
next Ends a do, while, or for loop early and continues execution with
the next iteration of the loop.
nmembers Returns the number of cases selected during a selcase operation.
noccurs Returns the number of occurrences for a repeating form or roster.
noinput Prevents input for the current field during data entry.
numeric Declares numeric variables used in the application.
OnChangeLanguage Provides control over actions to occur after a user has changed the
language.
OnChar Allows users to trap characters in order to perform special actions or
to change the action of the character.
onfocus Declares that the following statements are executed just before an
object becomes active.
OnKey Allows users to trap keystrokes in order to perform special actions or
to change the action of the key.
OnRefused Provides a way to override the selection of a refused value.
onoccchange Declares that the following statements are executed when the
group's current occurrence changes.
OnStop Provides control over stopping or exiting data entry.
OnSyncMessage Allows users to respond to a message sent by a Bluetooth client using
syncmessage.
OnSystemMessage Provides a way to override the displaying of system error messages.
open Opens a dictionary or external file.
savepartial Saves the current case as a partially added, modified, or verified case.
savesetting Saves, using an attribute-value pair, a setting retrievable with the
loadsetting function.
seed Initializes the random number generator to a particular starting
place.
seek Searches a multiply occurring item for an item that meets a condition.
seekmax Searches a multiply occurring item for an item with the highest value
that meets a condition.
seekmin Searches a multiply occurring item for an item with the lowest value
that meets a condition.
selcase Allows a data entry operator to select and load a case from an
external file.
set Sets the values of various system parameters.
set access Sets the order in which cases in a data file are processed.
set attributes Modifies the field properties in a data entry application.
set behavior Modifies parameters related to field validation or sets export
parameters.
set first Positions the first case in a data file as the next case to be processed.
set errmsg Modifies the way that error messages are displayed on desktop
CSEntry.
set last Positions the last case in a data file as the next case to be processed.
setcapturepos Allows the manual declaration of the window coordinates of extended
controls.
setcapturetype Sets the capture type associated with a field.
setcaselabel Sets a case's label (the text that appears in a case listing).
setfile Assigns a new physical file to a dictionary or declared file.
setfont Changes the default display font in data entry applications.
setlanguage Modifies the current language being used while in data entry mode.
setlb Returns the lower bound of the CMC of an event.
setocclabel Sets the occurrence label associated with a repeating dictionary item
or a roster.
setoperatorid Changes the operator ID associated with the data entry session.
setorientation Modifies the display orientation, allowing dynamic rotation of the
display.
setoutput Changes the output file where cases are saved in a batch application.
setproperty Modifies an application or field property.
valueset Declares a valueset object for storing the contents of a dynamic value
set.
view Launches the system's default viewer to display a file or website.
visualvalue Returns the value of a data item prior to its input.
warning Displays a "soft check" message only when the operator is not
advancing in the case.
when Executes a statement based on the value of one or more other
variables.
while Executes one or more statements repeatedly while a logical condition
remains true.
write Write to a text file.
writecase Writes a case from memory to an external file.
In addition to the list of reserved words below, there are a few reserved words used internally by CSPro. When you are
writing logic, reserved words are shown in blue, so if you attempt to create a variable using one of these reserved words,
you will know this name is not available when it turns blue.
In the table below, reserved words have been linked to the function of the same name, if one exists. If no link exists for a
word, it is either because there was more than one association for the word or the word is for internal usage only.
Deprecated Features
As part of a process to simplify and make the CSPro programming language more efficient, some features are going to
be deprecated. In future versions, these features may no longer be supported and in fact may be removed from CSPro
altogether. When compiling code, you will see deprecation warnings if you are using these features. To turn off these
warnings, use the Deprecation Warnings choices on the Options menu.
* These features do not have workarounds. If you think that you need this functionality, email [email protected]
and describe why this feature should not be deprecated.
Description:
The numeric statement declares temporary numeric variables used only in this application. They will not be saved to
a data file defined by a dictionary. A numeric variable is an integer or decimal number significant to 15 digits.
Local variables, with a limited scope, can also be declared in functions and PROCs. The variable will only exist and
be accessible from within the function or PROC where it is declared. (In the case of a PROC, it is available in all
events after it is declared. For example, a numeric declared in the preproc will still be accessible in the postproc.) A
local variable cannot have the same name as a globally declared variable.
Example:
PROC GLOBAL
numeric NumOfKids;
PROC CHILDREN
NumOfKids = NumOfKids + 1;
See also: Array Statement, Alpha Statement, String Statement, Ensure Statement
String Statement
Format:
string var-1[, var-2[..., var-n]];
Description:
The string statement is used to define alphanumeric variables temporarily used in the application. It is similar to the
alpha statement but, unlike variables created with that statement, variables created as strings have no fixed length.
Strings can be used anywhere an alpha variable can be used. (Strings were only introduced starting in CSPro 6.0 so
much of the documentation refers to alpha variables.) Strings start out with a length of 0 and then automatically resize
to match whatever is assigned to it. This feature make strings the preferred text variable to use, unless the length of
the string is needed for formatting and calculation purposes.
Examples:
PROC GLOBAL
alpha (10) myAlpha;
string myString;
PROC EXAMPLE
// myString starts out empty, with length 0
myString = "This is a test." // myString's length is now 15
myAlpha = "Hello";
myString = myAlpha; // myString's length is now 10: "Hello "
myString = "Hello"; // myString's length is now 5: "Hello"
Config Statement
Format
config variable1『 ,...,variableN』 ;
Description
The config statement declares a string variable. When the application begins, the variable's value is automatically set
to a value defined in the configuration settings. If there is no value defined, the compiler will issue a warning message and
the string variable will be blank. The statement must be coded in the PROC GLOBAL section of the application.
You can use a config variable anywhere in logic as you would a string variable.
Example
PROC GLOBAL
config serverUrl;
PROC EXAMPLE
syncconnect(CSWeb,serverUrl);
Alpha Statement
Feature Upgrade: Since CSPro 6.0, you can now use variable-length strings. You are encouraged to use strings instead
of fixed-width alpha variables whenever possible.
Format:
alpha [(len)] var-1[, var-2[..., var-n]];
Description:
The alpha statement is used to define alphanumeric variables used in the application. The len is the number of
characters in the variable. The len applies to all variables which follow in the same statement. If no len is given, 16 is
assumed. The maximum string length that can be declared is 8,192. If you attempt to assign to an alpha variable a
string that is longer than the variable's size, the string will be truncated from the right. Conversely, it you assign a
string that is shorter than the variable's size, the trailing character positions will be blank-filled.
The following two examples, using the declaration of x below, should clarify this point:
Example 1:
PROC GLOBAL
alpha(10) x,y;
PROC A1
x = "hi mom";
x will now equal "hi mom "
1234567890
Example 2:
PROC GLOBAL
alpha (3) reply;
alpha flag;
PROC Q5
if q5 = 1 then
reply = "Yes";
flag = "Y";
endif;
If the user attempts to assign the string "Not reported" to the variable "reply," CSPro would place the letters "Not" in
the variable and drop the remaining characters of the string.
See also: String Statement, Array Statement, Numeric Statement,
Relation Statement
Format:
relation relation-name primary to secondary-1 method
[to secondary-2 method] ... [to secondary-n method];
where method is
parallel | linked by arith-exp | where condition
Description:
The relation statement allows you define additional relations beyond those defined in the data dictionary.
The relation-name is a unique CSPro name which contains only letters, numbers, or the underscore ('_') character. It
must begin with a letter.
The primary is the name of a multiply occurring record or item. Items defined as secondary are linked to the primary
by the method specified.
The secondary is the name of a multiply occurring record or item which is linked to the primary.
The method type is specified by one of the keywords parallel, linked by, or where.
In the parallel method corresponding occurrences of the primary record or item and secondary record or item are
linked, that is first occurrences are linked, second occurrences are linked and so on.
In the linked by method the value of the arithmetic expression containing values from one record item is a pointer to
the occurrence in the other record or item.
In the where method the value of an item on the primary record is compared to the value of an item on the secondary
record. If the values are equal, the records are linked.
Example 1:
PROC GLOBAL
relation PERSON POP1 to POP2 parallel
to POP3 parallel;
Example 2:
PROC GLOBAL
relation MOTHER-CHILD CHILD to MOTHER linked by MOTHER_LINE;
Function Statement
Format
function 『 return_type』 function_name(『 parameter1_type parameter1_name, ..., parameterN_type
parameterN_name』 )
// statements
『 function_name = return_value;』
end;
Description
The function statement defines a user-defined function with the name function_name. Once defined, the function can
be called in other user-defined functions or in procedures throughout your application.
Numeric expressions, string expressions, objects (such as arrays and file handlers), and function pointers can be
passed to a user-defined function as arguments. These parameters are defined by specifying the variable type (e.g.,
parameter1_type) and the name of the variable (e.g., parameter1_name). If no type is specified, the variable will be
considered numeric. Preceding the parameter type with the optional keyword marks the parameter as an optional
parameter for which a caller does not need to supply a corresponding argument.
The names used in the parameter list of a function are local to the function. They may not be the same as names that
are defined in any dictionary or in PROC GLOBAL, but the names can be reused in other functions or procedures.
Numeric, string, and alphanumeric variables are local to the function. That is, if a variable is passed as an argument, its
value in the rest of the application will not be changed by actions within the function (this is called "pass by value"). On
the other hand, objects (such as arrays and file handlers) passed as arguments refer to the source variable and
interactions on the variable affect the source variable (this is called "pass by reference"). If you want to pass a numeric or
string variable by reference, you can use the ref keyword to signify that changes made in the function should affect the
source variable.
Other than arrays, parameters are defined the same way you would in other parts of logic. However, with arrays, you do
not define the size of the array because the array will match the size of the array passed as an argument. By default, an
array is one-dimensional. To specify more than one dimension, use parentheses with the number of dimensions specified
using commas. For example, a three-dimensional array must include (,,) after the parameter name. Within the
function, the function length returns the dimension sizes of the passed array.
Functions always return a value, either a numeric, alphanumeric, or string value. If return_type is not specified, the
function will by default return a numeric value. To assign the function's return value, assign a value to function_name. If
no return value is specified, the value default or a blank string is returned, depending on the return type. The return
value can also be specified using the exit statement.
Example 1
Example 2
PROC GLOBAL
function string GetNameOfFirstFemale()
numeric female_index = seek(SEX = 2);
if female_index > 0 then
GetNameOfFirstFemale = NAME(female_index);
endif;
end;
Example 3
PROC GLOBAL
function numeric SumList(list numeric numeric_list)
numeric summed_value = 0;
do numeric counter = 1 while counter <= length(numeric_list)
inc(summed_value, numeric_list(counter));
enddo;
SumList = summed_value;
end;
Example 4
PROC GLOBAL
function string CapitalizeFirstLetters(string text)
do numeric counter = 1 while counter <= length(text)
if counter = 1 or text[( counter - 1 ):1] = " " then
text[counter:1] = toupper(text[counter:1]);
endif;
enddo;
CapitalizeFirstLetters = text;
end;
See also: User-Defined Functions, Optional Function Parameters, Passing Function Arguments by Reference,
Additional Examples of User-Defined Functions, Exit Statement
Description
User-defined functions can have optional parameters, which you can specify in two ways. The first way is to use the
optional keyword before the parameter type. This works with all variable types (numerics, strings, and objects) except
for arrays, which cannot be used as optional parameters. The second way is by using an = sign when specifying the
parameter. This works for numeric and string parameters only.
When you call the function, you do not need to supply arguments for optional parameters. If not otherwise defined with
the = sign, a number will be set to notappl and a string will be set to a blank string. Any object (such as a hashmap,
list, etc.) will be set to its default state (an empty hashmap, a blank list, etc.). Once you specify that a parameter is
optional, then all parameters after it must also be optional.
Using an = Sign
Using an = sign is for numeric and string parameters only and cannot be used for objects. It is useful for assigning a
numeric constant or string literal as the default value:
When using the = sign, you do not need to use the optional keyword, but you can specify it if you want.
Example 1
PROC AGE
errmsg("%s is %d years younger than the head of household (%s)",
GetName(), AGE(1) - AGE, GetName(1));
Example 2
PROC GLOBAL
// This function calculates the age based on the birth date and the enumeration date.
// If the birth date is not passed to the function or is not set, the function returns
// a calculated age of 998 which is used to indicate that the age could not be calculated.
// If the enumeration date is not passed to the function, the function calculates the
// date based on the default date of March 11, 2020; otherwise it uses the date passed
// as a function argument.
function numeric CalculateAge(optional numeric birth_date, optional numeric enumeration_date =
20200311)
if special(birth_date) then
exit 998;
else
exit datediff(birth_date, enumeration_date, "y");
endif;
end;
// ...
DOB = 19520218;
ACTUAL_ENUMERATION_DATE = 20200401;
errmsg("%d", CalculateAge(DOB, ACTUAL_ENUMERATION_DATE)); // 68
errmsg("%d", CalculateAge(DOB)); // 67
errmsg("%d", CalculateAge()); // 998
Description
By default, numeric and string variables that are passed as an argument to a function are not changed by actions within
the function. This is called "pass by value." On the other hand, objects (such as arrays and file handlers) passed as
arguments refer to the source variable and interactions on the variable within the function affect the source variable. This
is called "pass by reference."
If you want to pass a numeric or string variable by reference, you can use the ref keyword to signify that changes made
in the function should affect the source variable. You cannot use ref with expressions, only with numeric and string
variables (such as dictionary items, array cells, list cells, etc.).
Example 1
function MyFunc(numeric numeric_value)
numeric_value = 999;
end;
// ...
numeric test_value = 100;
// the value of test_value (100) is passed (copied) to the function
// and the function has no effect on the contents of test_value
MyFunc(test_value);
errmsg("%d", test_value); // pass by value, prints 100
// by reference (ref) indicates the variable location itself is passed
// to the function; performing any operation on the variable inside the
// function is the same as performing that operation outside of the function
MyFunc(ref test_value);
errmsg("%d", test_value); // pass by reference, prints 999
Example 2
Recursive Functions
function numeric NumberDescendantsInHH(numeric ptrPerson)
numeric cntDescendants;
do numeric hhCtr = 1 while hhCtr <= totocc(PERSON_REC)
if LN_FATHER(hhCtr) = ptrPerson or LN_MOTHER(hhCtr) = ptrPerson then
inc(cntDescendants,NumberDescendantsInHH(hhCtr));
endif;
enddo;
NumberDescendantsInHH = cntDescendants;
end;
Function Pointers
PROC ANALYSIS
numeric cntTeenagers = FilteredCountInHH(IsTeenager);
numeric cntFertileWomen = FilteredCountInHH(IsFertileWoman);
See also: Function Statement, Optional Function Parameters, Passing Function Arguments by Reference
variable_name.function_name();
The following objects have functions that can be called on the object: array, audio, file, hashmap, list, map, pff,
SystemApp, and valueset. A list of available functions can be found in each of those summary pages.
One way to understand dot notation is to think of it as a shorthand way to specify what variable should be used when
performing an operation. For example, here are two ways to work through the elements of a list:
Unlike other CSPro functions, the functions that are associated with an object will not show as blue and they are not
reserved words.
Using a dot specifies that you are accessing a function or variable (property) of the object. For example, a value set
object contains two internal variables, codes and labels, both of which are lists, that can be accessed as in:
Description
The array statement creates an array with the name array_name. Only one array at a time can be declared with the
array statement. The array name must be unique and must contain only letters, numbers, or the underscore character.
The name must begin with a letter. You can declare arrays in PROC GLOBAL or locally in functions or procedures.
Arrays can be numeric, alphanumeric or string. By default an array is numeric, but the type can be modified by
specifying the array_type. If creating an alphanumeric array, the length of each array element can be specified by
definining the array_type as follows:
alpha 『 (array_length)』
The size of each dimension is specified by supplying a constant positive numeric value, dimension1 to dimensionN. An
array must have at least one dimension. CSPro supports arrays of an unlimited number of dimensions. A previously
defined numeric value can also be used to specify the dimension size. The length function can be used to query the
size of a dimension.
The initial values of the elements of an array can be set when declaring the array by listing each value, separated by a
comma. If some values are defined and then followed by ..., that set of values will be used over and over until the entire
array has been initialized.
With a numeric array, each element starts with the value 0. For alphanumeric and string arrays, each element starts as
a blank string. If using a numeric saved array, the initial array contents are default.
The optional keyword save indicates that the array values should be saved to a file and loaded from that file when the
program is run again. This allows you to maintain the values of the arrays across multiple runs of the same program.
When one or more arrays in the program are marked with save, the first time the application is run, a saved array file is
created and the values of the arrays are written to the file at the end of program execution. On consecutive runs of the
program, the initial values of the arrays are read in from the file. This is particularly useful for setting the initial values of
hot decks. In this scenario, the program is run twice. The first run fills the hot deck and saves the hot deck array to the
file. The second run loads the values saved from the first run and uses them as the initial values for the hot deck for
imputation. See Initialize Hot Decks in Program Logic for more information.
All arrays marked with save in the application are written to the same file. This file has the same name as the application
but with a .sva file extension appended to it (for example, MyApplication.bch.sva). Saved arrays are not supported in data
entry applications.
Example 1 (Numeric)
Example 2 (String)
PROC GLOBAL
array string Months(12) = "Jan", "Feb", "Mar", "Apr", "May", "Jun",
"Jul", "Aug", "Sep", "Oct", "Nov", "Dec";
function string FormatDate()
FormatDate = maketext("%d %s %d", sysdate("DD"), Months(sysdate("MM")), sysdate("YYYY"));
end;
Example 3 (Assignment)
numeric NumberProvinces = 12;
array numeric SexCountsByProvince(NumberProvinces, 2); // province by sex
PROC SEX
inc(SexCountsByProvince(PROVINCE, SEX));
PROC DISPLAY_COUNTS
do numeric counter = 1 while counter <= SexCountsByProvince.length(1)
errmsg("Province %d has %d males and %d females", counter,
SexCountsByProvince(counter, 1), SexCountsByProvince(counter, 2));
enddo;
See also: Array Object, HashMap Object, List Object, Numeric Statement, Alpha Statement, String Statement, Saved
Arrays File (.sva)
Array.Length Function
Format
i = array_name.length(『 dimension』 );
Description
The array.length function returns the size of an array dimension. When applied to a one-dimensional array, the
dimension does not need to be specified.
Return Value
The function returns the length of the array dimension.
Example
Array.Clear Function
Format
b = array_name.clear();
Description
The array.clear function resets all values of an array to their default value. For numeric arrays, this is 0, unless the
array is a saved array, in which case the value is default. For alphanumeric and string arrays, the default value is a
blank string.
Return Value
The function returns a logical value of 1 (true).
Example
array string HouseholdMemberNames(100);
// clear the array of household member names so that
// any previously stored names are removed
HouseholdMemberNames.clear();
do numeric counter = 1 while counter <= count(NAME)
HouseholdMemberNames(counter) = NAME(counter);
enddo;
numeric selection = showarray(HouseholdMemberNames, title("Select a Household Member"));
Description
The audio statement creates an audio object with the name audio_name. The audio name must be unique and must
contain only letters, numbers, or the underscore character. The name must begin with a letter. You can declare audio
objects globally in PROC GLOBAL or locally in functions or procedures.
Example
Audio.Load Function
Format
b = audio_name.load(audio_filename);
Description
The audio.load function reads the audio file named audio_filename and stores the audio data contained in the file in
an audio object. The loaded audio can then be played back using audio.play or combined with other audio using
audio.concat.
CSPro can load and play most audio files in any of the following formats: m4a, mp4, mp3, wav, 3gp, flac, ogg, mkv.
Return Value
The function returns a logical value of 1 (true) if the file was loaded and 0 (false) if the file could not be loaded.
Example
audio recording;
if recording.load("myrecording.m4a") then
recording.play();
endif;
Audio.Save Function
Format
b = audio_name.save(audio_filename);
Description
The audio.save function saves the audio recording in an audio object to the file specified by audio_filename.
Audio recorded using the functions audio.record and audio.recordInteractive is stored in m4a format (AAC
encoded in an MPEG-4 container). Audio in this format should be saved with the extension .m4a so that it will be
recognized by media players.
Return Value
The function returns a logical value of 1 (true) if the file was successfully saved or 0 (false) otherwise.
Example
PROC GLOBAL
audio recordingQ1;
PROC Q1
preproc
recordingQ1.record();
postproc
recordingQ1.stop();
string audioFilename = maketext("recording%v.m4a", HOUSEHOLD_ID);
recordingQ1.save(audioFilename);
Audio.Play Function
Format
b = audio_name.play(『 message』 );
CSPro can load and play most audio files in any of the following formats: m4a, mp4, mp3, wav, 3gp, flac, ogg, mkv.
However, only m4a files (AAC encoded audio in an MPEG-4 container) can be used with the functions audio.concat,
audio.length, audio.record, and audio.recordInteractive.
Return Value
The function returns a logical value of 1 (true) if the audio can be played and a logical value of 0 (false) otherwise.
Example
audio recording;
if recording.load("myrecording.m4a") then
recording.play();
endif;
See also: Audio Object, Audio.Load Function, Audio.Record Function, Audio.RecordInteractive Function
Audio.RecordInteractive Function
Format
d = audio_name.recordInteractive(『 message』 );
Description
The audio.recordInteractive function launches an interactive audio recorder that stores a recording to an audio
object. When the function is called, the audio recorder screen is displayed. It contains record, save, and clear buttons
that control the audio recording. If the optional string argument message is specified, the message will be displayed at
the top of the recorder screen. The audio.recordInteractive function does not return until the user has finished
recording and taps the save button. This is useful for synchronous recording, for example to have the interviewer record
the response to a single question. To record the audio in the background while the user continues to interact with the
survey, use the function audio.record.
The recorded audio is stored in the audio object and can be played back using audio.play or saved to a file using
audio.save. If the audio object already contains an audio recording, the recorded audio will be appended to the existing
recording. To replace the existing recording rather than adding to it, call the audio.clear function before calling
audio.recordInteractive.
Recorded audio is stored in m4a format (AAC encoded in an MPEG-4 container). Files should be saved with the
extension .m4a so that they will be recognized by media players.
Audio recording is only supported on Android. This function will do nothing on Windows.
Return Value
The function returns the length of the recorded audio in seconds (and fractional seconds) or default if there was an error
during recording.
Example
Audio.Record Function
Format
b = audio_name.record(『 seconds』 );
Description
The audio.record function starts recording audio in the background and stores the recorded audio in an audio object.
The call to record begins audio recording and returns immediately. Recording continues until the audio.stop function is
called. If the optional parameter seconds is specified, recording will stop after the specified number of seconds have
elapsed—or audio.stop—is called, whichever action comes first.
The recorded audio is stored in the audio object and can be played back using audio.play or saved to a file using
audio.save. If the audio object already contains an audio recording, the recorded audio will be appended to the existing
recording. To replace the existing recording rather than adding to it, call the audio.clear function before calling
audio.record.
Recorded audio is stored in m4a format (AAC encoded in an MPEG-4 container). Files should be saved with the
extension .m4a so that they will be recognized by media players.
Audio recording is only supported on Android. This function will do nothing on Windows.
Return Value
The function returns a logical value of 1 (true) if the recording is started successfully and 0 (false) otherwise.
Example
logic>
PROC GLOBAL
audio recordingQ1;
PROC Q1
preproc
recordingQ1.record();
postproc
recordingQ1.stop();
string audioFilename = maketext("recording%v.m4a", HOUSEHOLD_ID);
recordingQ1.save(audioFilename);
See also: Audio Object, Audio.Stop Function, Audio.RecordInteractive Function, Audio.Clear Function
Audio.Stop Function
Format
d = audio_name.stop();
Description
The audio.stop function ends the background recording session for an audio object. Background recording is started
with the function audio.record. The call to audio.stop should be made to finish the recording session started with
audio.record. This function is not used for interactive recording with audio.recordInteractive.
Return Value
The function returns the duration of the recorded audio in seconds (and fractional seconds) or default if the audio failed
to record.
Example
Audio.Concat Function
Format
b = audio_name.concat(audio_filename ‖ audio_object);
Description
The audio.concat function appends an audio recording stored in a file specified by the argument audio_filename or an
audio object specified by the argument audio_object to the current recording in an audio object. The result is a new
recording that combines both recordings and is stored in the initial audio object.
The audio.concat function only works with audio files stored in m4a format (AAC encoded audio stored in an MPEG-4
container). This is the format of files recorded in CSPro using the audio.record and audio.recordInteractive
functions. Audio files that come from sources other than CSPro will most likely not work with audio.concat.
Return Value
The function returns a logical value of 1 (true) if the recordings were concatenated successfully and a logical value of 0
(false) if there was an error combining the recordings.
Example 1
audio recording;
recording.load("recording1.m4a");
recording.concat("recording2.m4a");
recording.save("combined.m4a");
Example 2
Audio.Length Function
Format
d = audio_name.length();
Description
The audio.length function returns the length in seconds of the recording in an audio object.
The audio.length function only supports audio recorded as AAC encoded audio in an mp4 container (.m4a). This is the
format used for audio recorded by CSPro so it will work correctly for audio recorded using the audio.record or
audio.recordInteractive. The length function will return default for audio files recorded by other tools that use a
different format such as mp3 or wav.
Return Value
The function returns the duration in seconds (and fractional seconds) of the audio recording or default if the duration
cannot be computed.
Example
audio recording;
numeric seconds = recording.recordinteractive();
if seconds > 0 then
recording.save("recording.m4a");
else
errmsg("No audio recorded. Please try again");
reenter;
endif;
Audio.Clear Function
Format
b = audio_name.clear();
Description
The audio.clear function removes the current recording from an audio object making it empty.
Return Value
The function returns a logical value of 1 (true).
Example
Functionality
Function Description
barcode.read Reads a barcode and returns its value as a string.
In addition to barcode functions accessible using logic, you can generate a QR code that can be used to install
applications on mobile devices.
Example
PROC BLOOD_SAMPLE
preproc
if BLOOD_SAMPLE = "" then
BLOOD_SAMPLE = barcode.read("Scan the blood sample barcode");
endif;
Barcode.Read Function
Format
s = barcode.read(『 message』 );
Description
The barcode.read function starts an Android device's camera and allows the operator to scan a barcode. The barcode
can be in several formats, including QR codes. An optional string expression, message, is displayed on top of the
camera view and can be used to display instructions to the operator.
This function only works on Android devices and returns a blank string if used on another device.
Return Value
The function returns a string value containing the barcode's value. If the operator canceled before a barcode could be
successfully read, the function returns a blank string.
Example
Description
The file statement creates a file handler that is used to read from and write to text files that are not not associated with
data dictionaries. file_handler1 to file_handlerN are CSPro names that must be unique and must contain only letters,
numbers, or the underscore character. The names must begin with a letter. You can declare files globally in PROC
GLOBAL or locally in functions or procedures.
If declared globally, the physical name of the file can be specified in the Define File Associations dialog when the
application is run. The name can also be specified in a PFF file or at runtime by using the open or setfile functions.
Example
PROC GLOBAL
file VacantHouseholdsReportFile;
File.Open Function
Format
b = file_handler.open(file_name『 , update ‖ append ‖ create』 );
Description
The open function associates the file file_handler to a text file on the disk. The file_name argument is a string
expression containing the file name of the file to be associated with the file. This function is equivalent to the setfile
function.
Using the keyword update, append or create is optional. If no keyword is used, the file is opened in update mode.
If update is used, you are positioned at the beginning of the file. If append is used, the file's contents are not changed
and you are positioned at the end of the file. If create is used, all previous content in the file is removed and you are
positioned at the beginning of the file.
If create or append is used and the file does not already exist, a new empty file will be created. If update is used and
the file does not already exist, the function will fail and return 0.
Return Value
The function returns a logical value of 1 (true) if the new file is successfully assigned and 0 (false) otherwise.
File Example
Pa ge 410 of 684 File Object
file household_report_file;
household_report_file.open("Household Report.txt", create);
File.Close Function
Format
b = file_handler.close();
Description
The file.close function closes the text file associated with file_handler. You must close a file before using it in any
other application. If you do not close the file explicitly, it will be closed when the CSPro application terminates. This
function is equivalent to the close function.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
Example
file report_file;
// ...
report_file.close();
view(filename(report_file));
File.Read Function
Format
b = file_handler.read(string_variable ‖ string_list);
Description
The file.read function reads one or more lines of text from the file associated with file_handler. After the read the file
pointer is positioned to the next line in the file. This function reads lines sequentially. This function is equivalent to the
fileread function.
If a string_variable is provided as a argument, a single line of text is read and placed in the variable, which is either a
temporary string variable or an alphanumeric dictionary item. If the dictionary item is longer than the line of text, blanks
will be added at the end. If the item is shorter, the line of text will be truncated.
Alternatively, a string list can be used as an argument. In this case, all remaining lines in the file are read and stored in
string_list.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
File.Write Function
Format
b = file_handler.write(message『 , argument1, ..., argumentN』 );
Description
The file.write function writes one or more lines of text to the file associated with file_handler. This function is
equivalent to the filewrite function.
The message is either a string expression or a numeric message number that contains the text that is written to the file.
If the text contains any message formatters, the optional arguments argument1 to argumentN will be inserted into the
text. There are some additional options for file output:
If you want to break a line of text into two lines, place \n where you want the line divided.
If you want a text line to begin on a new page, place \f at the beginning of the text string.
To output \n or \f as text instead of a new line or a new page, use a double backslash (e.g., \\n).
Alternatively, the message can be a string list. If a list is provided, then each string contained in the list is written to the
file, allowing for the output of multiple lines of text with one function call.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
Example
See also: File Object, File.Read Function, FileWrite Function, Message Formatting Options, Encode Function
Description
The hashmap statement creates a hashmap with the name hashmap_name. The hashmap name must be unique and
must contain only letters, numbers, or the underscore character. The name must begin with a letter. You can declare
hashmaps globally in PROC GLOBAL or locally in functions or procedures.
Hashmap values can be numeric or string. By default a hashmap is numeric, but the type can be modified by
specifying the value_type.
Unlike an array, a hashmap's dimensions are not of fixed size because hashmaps dynamically grow or shrink in size as
values are added or removed from the hashmap. A hashmap can have one or more dimensions, and when declaring the
hashmap you must specify the type of each dimension. Each dimension_type can be:
If no dimensions are specified, then the hashmap is created with a single dimension of type all.
When assigning a value to a hashmap, all keys will be created as necessary to store the value. However, when retrieving
a value, if the keys do not exist, you will get a runtime error and the value default or a blank string will be returned. If
you want to assign a default value for undefined keys, you can specify a default_value, which must be either a numeric
constant or a string literal (based on the value type).
Example 1
hashmap string simple_hashmap;
simple_hashmap("Kwanzan") = "Cherry Tree";
simple_hashmap(1603) = "Beginning of Edo period";
errmsg("%s", simple_hashmap(1868)); // runtime error (1868 is an undefined key)
Example 2
HashMap.GetKeys Function
Format
i = hashmap_name.getKeys(list_name 『 , key_value1, ..., key_valueN』 );
Description
The hashmap.getKeys function fills a list with the keys at a specific location in the hashmap. Each key_value argument
must be a numeric or string expression matching the type specified when declaring the hashmap. If no arguments are
provided, the first-dimension keys are returned; if one argument is provided, the second-dimension keys are returned; and
so on.
The argument list_name is a numeric or string list whose contents will be cleared and then filled with the hashmap's
keys. The type of the list must correspond to the type of keys at the specified location. If the dimension type is numeric,
then the list must be a numeric list. If the dimension type is string or all, then the list must be a string list. In the case
of all, any numeric values added as keys to the hashmap will be converted to strings before being inserted in the string
list.
Return Value
The function returns the number of keys added to the list.
Example
HashMap.Contains Function
Format
b = hashmap_name.contains(key_value1『 , ..., key_valueN』 );
Description
The hashmap.contains function returns whether or not a specific key exists in a hashmap. Each key_value argument
must be a numeric or string expression matching the type specified when declaring the hashmap.
Return Value
The function returns a logical value of 1 (true) if the hashmap contains the key, and 0 (false) otherwise.
Example
HashMap.Length Function
Format
i = hashmap_name.length(『 key_value1, ..., key_valueN』 );
Description
The hashmap.length function returns the number of keys at a specific location in the hashmap. Each key_value
argument must be a numeric or string expression matching the type specified when declaring the hashmap. If no
arguments are provided, the number of first-dimension keys is returned; if one argument is provided, the number of
second-dimension keys is returned; and so on.
Return Value
The function returns the number of keys or 0 if no keys exist at the specified location.
Example
HashMap.Remove Function
Format
b = hashmap_name.remove(key_value1『 , ..., key_valueN』 );
Description
The hashmap.remove function removes a specific key from a hashmap. Each key_value argument must be a numeric or
string expression matching the type specified when declaring the hashmap.
Return Value
The function returns a logical value of 1 (true) if the hashmap contained the key and it was removed, and 0 (false) if the
key did not exist.
Example
HashMap.Clear Function
Format
b = hashmap_name.clear();
Description
The hashmap.clear function removes all values from a hashmap.
Return Value
The function returns a logical value of 1 (true).
Example
Description
The list statement creates a list with the name list_name. The list name must be unique and must contain only letters,
numbers, or the underscore character. The name must begin with a letter. You can declare lists globally in PROC GLOBAL
or locally in functions or procedures.
Lists can be numeric or string. By default a list is numeric, but the type can be modified by specifying the list_type.
The initial elements of a list can be set on definition by listing each value, separated by a comma. A list can also be
initialized to the values of another list.
Example
// numeric lists
list DaysPerMonthTypical = 31, 28, 31, 30, 31, 30,
31, 31, 30, 31, 30, 31;
list numeric DaysPerMonthLeapYears = DaysPerMonthTypical;
DaysPerMonthLeapYears(2) = 29;
// string lists
list string MonthNames = "Jan", "Feb", "Mar", "Apr", "May", "Jun",
"Jul", "Aug", "Sep", "Oct", "Nov", "Dec";
See also: List Object, Array Object, HashMap Object, ValueSet Object
List.Add Function
Format
i = list_name.add(value ‖ list_name);
Description
The list.add function adds a single value (the numeric or string expression value) or a list of values (list_name) to the
end of a list. The type of the added value or list must match the type of the receiving list.
Return Value
The function returns the number of the values added to the list. If the list is read-only, the function returns default.
Example
List.Insert Function
Format
i = list_name.insert(index, value ‖ list_name);
Description
The list.insert function inserts a single value (the numeric or string expression value) or a list of values (list_name)
at the position in the list referenced by the numeric expression index. The type of the inserted value or list must match
the type of the receiving list. List index values are one-based, so the index must be between 1 and one past the length of
the list. If the index is 1, the values are inserted at the beginning of the list. If the value is list_name.length() + 1, the
values are inserted at the end of the list (as would happen with the list.add function).
Return Value
The function returns the number of the values inserted into the list. The function returns 0 if the index is not valid. If the
list is read-only, the function returns default.
Example
list string eligible_head_names;
do numeric counter = 1 while counter <= count(PERSON_REC)
if AGE(counter) >= 15 and USUAL_MEMBER(counter) = 1 then
// insert the names in sorted order
numeric insert_index;
do insert_index = 1 while insert_index <= eligible_head_names.length()
and eligible_head_names(insert_index) <= NAME(counter)
enddo;
eligible_head_names.insert(insert_index, NAME(counter));
endif;
enddo;
List.Seek Function
Format
i = list_name.seek(value, 『 starting_index ‖ @nth_occurrence』 );
Description
Pa ge 422 of 684 Lis t Object
The list.seek function returns the one-based index of the first occurrence in the list of value. The type of the value
must match the type of the list. If the numeric expression starting_index is provided, then the function starts searching
the for value starting at the index provided. Alternatively, if the @ symbol precedes the numeric expression
nth_occurrence, the function searches for the nth occurrence of the value.
Return Value
The function returns the one-based index of the value, if found, and 0 if no such value is in the list.
Example
list string keys_to_process;
// ...
if keys_to_process.seek(key(DICT_NAME)) = 0 then
skip case;
endif;
List.Remove Function
Format
b = list_name.remove(index);
Description
The list.remove function removes the value at the position in the list referenced by the numeric expression index. List
index values are one-based, so the index must be between 1 and the length of the list.
Return Value
The function returns a logical value of 1 (true) if the value is successfully removed, and 0 (false) otherwise, which
indicates that the index is not valid. If the list is read-only, the function returns default.
Example
list string eligible_heads;
// ...
numeric eligible_head_index = eligible_heads.show("Select the head");
if eligible_head_index <> 0 and eligible_heads.length() > 1 then
eligible_heads.remove(eligible_head_index);
numeric secondary_head_index = eligible_heads.show("Select someone who could also be a
head");
endif;
See also: List Object, List.Clear Function, List.RemoveDuplicates Function, List.RemoveIn Function
List.RemoveDuplicates Function
Pa ge 423 of 684 Lis t Object
Format
i = list_name.removeDuplicates();
Description
The list.removeDuplicates function removes duplicate values from a list, leaving the first instance of any duplicate in
the list.
Return Value
The function returns the number of duplicates removed from the list. If the list is read-only, the function returns default.
Example
list string women_names;
for POPULATION_RECORD where HH_SEX = 2 and HH_AGE in 12:95 do
women_names.add(HH_NAME);
enddo;
women_names.show("Women in Household");
// possible results: Maria, Maria, Linda, Maria, Pamela, Linda, Maria
numeric duplicates_removed = women_names.removeDuplicates();
women_names.show(maketext("Women in Household (%d duplicates removed)", duplicates_removed));
// possible results (4 duplicates removed): Maria, Linda, Pamela
See also: List Object, List.Clear Function, List.Remove Function, List.RemoveIn Function
List.RemoveIn Function
Format
i = list_name.removeIn(in_list);
Description
The list.removeIn function removes from a list any values that are specified in an in list. The values in in_list must be
numeric for a numeric list and string for a string list. Note that the in operator is case sensitive, so when using a string
list, specifying "Cindy" will not remove "cindy".
Return Value
The function returns the number of values removed from the list. If the list is read-only, the function returns default.
Example 1
list household_ages;
// ...
// values: 69, 57, 55, 33, 22, 44, 34, 5
household_ages.removeIn(0:15, 50:99);
// result: 33, 22, 44, 34
Example 2
See also: List Object, List.Clear Function, List.Remove Function, List.RemoveDuplicates Function
List.Clear Function
Format
b = list_name.clear();
Description
The list.clear function removes all values from a list, setting the length to 0.
Return Value
The function returns a logical value of 1 (true). If the list is read-only, the function returns default.
Example
PROC GLOBAL
list possible_ages;
PROC AGE
possible_ages.clear();
do numeric counter = 1 while counter <= curocc()
possible_ages.add(AGE);
enddo;
See also: List Object, List.Remove Function, List.RemoveDuplicates Function, List.RemoveIn Function
List.Length Function
Format
i = list_name.length();
Description
The list.length function returns the size of a list, indicating the number of values contained by the list.
Return Value
The function returns the length of the list.
Example
List.Show Function
Format
i = list_name.show(『 heading』 );
Description
The list.show function displays the values from a list and returns the index of the operator's selection. An optional
string expression, heading, specifies the title of the window that displays the values. The function is similar to the
accept function but with the options taken from the list's values.
Return Value
The function returns the index of the item selected: 1 for the first value, 2 for the second value, etc. The value 0 is
returned if the escape key (or back button) is pressed and none of the values is chosen.
Example
list string drink_options = "Water", "Milk", "Soda", "Coffee", "Tea";
numeric favorite_drink_index = drink_options.show("What is your favorite drink?");
if favorite_drink_index <> 0 then
errmsg("You like %s the most!", drink_options(favorite_drink_index));
endif;
See also: List Object, List.Sort Function, Accept Function, Show Function, ShowArray Function, ValueSet.Show
Function
List.Sort Function
Format
b = list_name.sort(『 ascending ‖ descending 』 );
Description
The list.sort function sorts the values within a list. An optional argument, ascending or descending, allows for the
specification of the sort order. If not specified, the default sort order is ascending. String values are sorted in case
sensitive order, so "c" is recognized as different from "C".
Return Value
The function returns a logical value of 1 (true). If the list is read-only, the function returns default.
Example
Pa ge 426 of 684 Lis t Object
list string children_names;
for POPULATION_RECORD where HH_RELATIONSHIP = 3 do
children_names.add(HH_NAME);
enddo;
children_names.show("Children in Household (before sort)");
// possible results: Betty, Ellen, Charlene, Donna, Danny, Donnie, Cindy, Elaine
children_names.sort();
children_names.show("Children in Household (after ascending sort)");
// possible results: Betty, Charlene, Cindy, Danny, Donna, Donnie, Elaine, Ellen
children_names.sort(descending);
children_names.show("Children in Household (after descending sort)");
// possible results: Ellen, Elaine, Donnie, Donna, Danny, Cindy, Charlene, Betty
Description
The map statement creates a map with the name map_name. The map name must be unique and must contain only
letters, numbers, or the underscore character. The name must begin with a letter. You can declare maps globally in PROC
GLOBAL or locally in functions or procedures.
Example
// Declare a map
map mymap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
mymap.addMarker(38.84839, -76.931098);
// Display the map
mymap.show();
Map.Show Function
Format
b = map_name.show();
Description
The map.show function displays the map named map_name. The map replaces the current question(s) on the screen. The
map will continue to be displayed until the user taps on the back button or the map.hide function is called from within a
callback function. The map.show function will not return until the user has finished interacting with the map. For this
reason you should always do the initial map setup (markers, buttons, base map, etc.) before calling map.show. If you call
any of the map functions directly after the call to map.show they will not be executed until after the map has been hidden.
You can, however, make modifications to the map while it is showing from logic in a callback function supplied to one of
the map functions such as setOnClick, addTextButton, addImageButton, setMarkerOnClick,
setMarkerOnClickInfoWindow, or setMarkerOnDrag.
Return Value
The function returns a logical value of 1 (true) if the map is shown successfully and 0 (false) if there is an error.
Example
Map.Hide Function
Format
b = map_name.hide();
Description
The map.hide function hides the map named map_name and returns to data entry. The hide function should be used in a
callback function supplied to one of the map functions such as setOnClick, addTextButton, addImageButton,
setMarkerOnClick, setMarkerOnClickInfoWindow, or setMarkerOnDrag. Calling hide will cause the earlier call to
map.show to complete and return control to the statement following the call to map.show.
Return Value
The function returns a logical value of 1 (true).
Example
Map.AddMarker Function
Format
markerId = map_name.addMarker(latitude, longitude);
Description
The map.addMarker function creates a new marker at the position defined by latitude and longitude and adds the new
marker to the map map_name. Initially the marker will have the default red and black marker icon and no description. You
can customize the marker icon using the function setMarkerText or setMarkerImage. You can add a description of the
marker using setMarkerDescription. The description will appear in the marker list and in a popup when the user taps
on the marker.
Return Value
The function returns the identifier of the new marker which may be used in subsequent calls to removeMarker,
setMarkerText, setMarkerImage, setMarkerDescription setMarkerOnClick, setMarkerOnClickInfoWindow,
setMarkerOnDrag, setMarkerLocation, getMarkerLatitude, or getMarkerLongitude.
Example
// Declare a map
map mymap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
numeric markerId = mymap.addMarker(38.84839, -76.931098);
// Set marker popup text
mymap.setMarkerDescription(markerId, "CSPro Team at the U.S. Census Bureau");
// Show the map
mymap.show();
Description
The map.removeMarker function removes a marker from the map map_name. markerId should be the id returned by
map.addMarker when the marker was added to the map.
Return Value
The function returns a logical value of 1 (true) if the marker was found and was successfully removed and 0 (false) if there
is an error.
Example
PROC GLOBAL
// Function to allow to modify the marker text, marker description or remove the marker
function editMarker(map m, numeric markerId)
numeric sel = accept("Edit marker", "Text", "Description", "Remove");
if sel = 1 then
string newText = prompt("Enter new icon text");
if newText <> "" then
m.setMarkerText(markerId, newText);
endif;
elseif sel = 2 then
string newDescription = prompt("Enter new icon description");
if newDescription <> "" then
m.setMarkerDescription(markerId, newDescription);
endif;
elseif sel = 3 then
m.removeMarker(markerId);
endif;
end;
PROC SHOW_MAP
preproc
// Declare a map
map mymap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
numeric markerId = mymap.addMarker(38.84839, -76.931098);
mymap.setMarkerDescription(markerId, "US Census Bureau");
// Call edit marker when user taps on popup info window
mymap.setMarkerOnClickInfoWindow(markerId, editMarker(mymap, markerId));
// Display the map
mymap.show();
Map.ClearMarkers Function
Format
Description
The map.clearMarkers function removes all the markers currently on the map named map_name.
Return Value
The function returns a logical value of 1 (true).
Example
Map.SetMarkerDescription Function
Format
b = map_name.setMarkerDescription(markerId, description);
Description
The map.setMarkerDescription function sets the text shown in the popup info window when the marker is tapped. The
description is also shown in the list of markers displayed below or to the left of the map.
Example
// Declare a map
map mymap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
numeric markerId = mymap.addMarker(38.84839, -76.931098);
// Set the popup text when marker is tapped to
// display name of head of household and number of household
// members
string popupText = maketext("Head: %s %s Males: %d Females %d",
strip(HEAD_FIRST_NAME),
strip(HEAD_LAST_NAME),
MALE_MEMBERS, FEMALE_MEMBERS);
mymap.setMarkerDescription(markerId, popupText);
// Display the map
mymap.show();
Map.SetMarkerImage Function
Format
b = map_name.setMarkerImage(markerId, pathToImage);
Description
The map.setMarkerImage sets the icon for the marker with id markerId on map map_name. markerId is the id that was
returned by map.addMarker when the marker was added to the map. pathToImage is the path to a file containing the
image to display as the marker icon. Standard image files (PNG, JPEG, BMP) are supported.
If the marker icon was previously set with map.setMarkerText, calling map.setMarkerImage will replace the marker text
with the image.
Return Value
The function returns a logical value of 1 (true) if the marker was found and the image was set successfully and 0 (false) if
there is an error.
Example
// Declare a map
map mymap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
numeric markerId = mymap.addMarker(38.84839, -76.931098);
// Set the icon
mymap.setMarkerImage(markerId, "resources/marker_icon.png");
// Display the map
mymap.show();
Map.GetMarkerLatitude Function
Format
lat = map_name.getMarkerLatitude(markerId);
Description
The map.getMarkerLatitude function retrieves the latitude of the marker identified by markerId.
Return Value
The function returns the latitude of the marker in degrees if the marker was found and returns default if the marker id is
invalid.
Example
PROC GLOBAL
function onDragMarker(map dragMap, numeric markerId)
numeric lat = dragMap.getMarkerLatitude(markerId);
numeric lon = dragMap.getMarkerLongitude(markerId);
errmsg("Marker was moved to: %f, %f", lat, lon);
end;
PROC SHOW_MAP
preproc
// Declare a map
map dragMap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
numeric markerId = dragMap.addMarker(38.84839, -76.931098);
// Set function that is called on marker drag
dragMap.setMarkerOnDrag(markerId, onDragMarker(dragMap, markerId));
// Display the map
dragMap.show();
Map.GetMarkerLongitude Function
Format
long = map_name.getMarkerLongitude(markerId);
Description
The map.getMarkerLongitude function retrieves the longitude of the marker identified by markerId.
Return Value
The function returns the longitude of the marker in degrees if the marker was found and returns default if the marker id
Pa ge 434 of 684 Ma p Object
is invalid.
Example
PROC GLOBAL
function onDragMarker(map dragMap, numeric markerId)
numeric lat = dragMap.getMarkerLatitude(markerId);
numeric lon = dragMap.getMarkerLongitude(markerId);
errmsg("Marker was moved to: %f, %f", lat, lon);
end;
PROC SHOW_MAP
preproc
// Declare a map
map dragMap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
numeric markerId = dragMap.addMarker(38.84839, -76.931098);
// Set function that is called on marker drag
dragMap.setMarkerOnDrag(markerId, onDragMarker(dragMap, markerId));
// Display the map
dragMap.show();
Map.SetMarkerLocation Function
Format
b = map_name.setMarkerLocation(markerId, latitude, longitude);
Description
The map.setMarkerLocation function moves the marker identified by markerId to a location on the map with
coordinates latitude and longitude. latitude and longitude should be given in degrees.
Return Value
The function returns a logical value of 1 (true) if the marker was found and the location was set successfully and 0 (false)
if there is an error.
Example
Map.SetMarkerOnClick Function
Format
b = map_name.setMarkerOnClick(markerId, callbackFunction);
Description
The map.setMarkerOnClick function sets the function that is called when the user taps on the marker identified by
markerId. When the user taps on the marker the user-defined function callbackFunction will be called.
The callbackFunction is the name of a user function defined in the global procedure. The function name may optionally
be followed by function arguments which will be passed to the function when it is run. These arguments are evaluated at
the time that map.setMarkerOnClick is called rather than at the time the callback is run. This allows you to reuse the
same callback function for multiple markers and to customize the behavior of the callback by passing different arguments
to the callback function for each marker.
If no on-click function is set for a marker then when the marker is clicked, the default behavior is to center the view on the
marker and to display the marker's info window. After setting an on-click function with map.setMarkerOnClick function,
the popup info window is not shown when the marker is clicked. If you wish to still display the popup window and also
handle taps on the marker you can use the function map.setMarkerOnClickInfoWindow().
Return Value
The function returns a logical value of 1 (true) if the marker was found and the callback function was set successfully and
0 (false) if there is an error.
Example 1
Example 2
PROC GLOBAL
function launchHouseholdQuestionnaire(string caseid)
pff household_pff;
household_pff.setproperty("AppType", "Entry");
household_pff.setproperty("Key", caseid);
household_pff.setproperty("Application", "Household.ent");
household_pff.setproperty("InputData", "Household.csdb");
household_pff.setproperty("OnExit", "Menu.pff");
household_pff.exec();
end;
PROC SHOW_HOUSEHOLDS_ON_MAP
preproc
// Declare a map
map mymap;
// Loop through all households in the listing dictionary
forcase LIST_DICT do
// Create a marker for household at latitude and longitude from listing file
numeric markerId = mymap.addMarker(LI_LATITUDE, LI_LONGITUDE);
// Set the marker text to the household number
mymap.setMarkerText(markerId, maketext("%v", LI_HOUSEHOLD_NUMBER));
// When marker is clicked launch household questionnaire application
// and prefill the ID items
string householdCode = key(LIST_DICT);
mymap.setMarkerOnClick(markerId, launchHouseholdQuestionnaire(householdCode));
endfor;
// Show the map
mymap.show();
Description
The map.getLastClickLatitude function retrieves the latitude of the last point on the map that the user tapped. This is
often used in the callback function passed to map.setOnClick to get the coordinates of the point that was just tapped.
The tapped latitude is only recorded if a user taps on the map itself rather than on a marker.
Return Value
The function returns the latitude of the tapped point in degrees or notappl if the user has not tapped a point on the map.
Example
PROC GLOBAL
// Declare a map
map mymap;
// Declare id for map marker
numeric markerId;
// This function is called when user taps on map.
// It moves the marker to the location that the user tapped.
function mapClicked()
numeric lat = mymap.getLastClickLatitude();
numeric lon = mymap.getLastClickLongitude();
mymap.setMarkerLocation(markerId, lat, lon);
end;
PROC SHOW_MAP
preproc
// Add a marker to the map at latitude 38.84839, longitude -76.931098
markerId = mymap.addMarker(38.84839, -76.931098);
// Set function that is called when the user taps on the map to move the marker
// to the new position
mymap.setOnClick(mapClicked());
// Display the map
mymap.show();
Map.GetLastClickLongitude Function
Format
long = map_name.getLastClickLongitude();
Description
The map.getLastClickLongitude function retrieves the longitude of the last point on the map that the user tapped. This
Return Value
The function returns the longitude of the tapped point in degrees or notappl if the user has not tapped a point on the
map.
Example
PROC GLOBAL
// Declare a map
map mymap;
// Declare id for map marker
numeric markerId;
// This function is called when user taps on map.
// It moves the marker to the location that the user tapped.
function mapClicked()
numeric lat = mymap.getLastClickLatitude();
numeric lon = mymap.getLastClickLongitude();
mymap.setMarkerLocation(markerId, lat, lon);
end;
PROC SHOW_MAP
preproc
// Add a marker to the map at latitude 38.84839, longitude -76.931098
markerId = mymap.addMarker(38.84839, -76.931098);
// Set function that is called when the user taps on the map to move the marker
// to the new position
mymap.setOnClick(mapClicked());
// Display the map
mymap.show();
Map.SetMarkerOnClickInfoWindow Function
Format
b = map_name.setMarkerOnClickInfoWindow(markerId, callbackFunction);
Description
The map.setMarkerOnClick function sets the function that is called when the user taps on the popup info window for the
marker identified by markerId. The popup info window is shown when the user taps on the marker and it displays the
marker description set by calling map.setMarkerDescription. After tapping on the marker to display the info window,
when the user then taps on the popup, the user-defined function callbackFunction will be called.
The callbackFunction is the name of a user function defined in the global procedure. The function name may optionally
be followed by function arguments which will be passed to the function when it is run. These arguments are evaluated at
the time that map.setMarkerOnClickInfoWindow is called rather than at the time the callback is run. This allows you to
reuse the same callback function for multiple markers and to customize the behavior of the callback by passing different
arguments to the callback function for each marker.
Example 1
PROC GLOBAL
function clickedOnMarkerInfoWindow()
errmsg("You clicked on a marker popup");
end;
PROC SOMEITEM
// Declare a map
map mymap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
numeric markerId = mymap.addMarker(38.84839, -76.931098);
// Set function that is called on marker info window click
mymap.setMarkerOnClick(markerId, clickedOnMarkerInfoWindow());
// Display the map
mymap.show();
Example 2
See also: Map Object, Map.AddMarker Function, Map.SetMarkerOnClick Function, Map.SetMarkerDescription Function
Map.SetMarkerOnDrag Function
Format
b = map_name.setMarkerOnDrag(markerId, callbackFunction);
Description
The map.setMarkerOnDrag function sets the function that is called when the user drags the marker on the map marker
identified by markerId. Markers may be dragged by first long pressing on them and then sliding them to a new position.
The callbackFunction is called at the end of the drag when the user lifts up their finger to release the marker.
The callbackFunction is the name of a user function defined in the global procedure. The function name may optionally
be followed by function arguments which will be passed to the function when it is run. These arguments are evaluated at
the time that map.setMarkerOnDrag is called rather than at the time the callback is run. This allows you to reuse the
same callback function for multiple markers and to customize the behavior of the callback by passing different arguments
to the callback function for each marker.
Return Value
Example
PROC GLOBAL
function onDragMarker(map dragMap, numeric markerId)
numeric lat = dragMap.getMarkerLatitude(markerId);
numeric lon = dragMap.getMarkerLongitude(markerId);
errmsg("Marker was moved to: %f, %f", lat, lon);
end;
PROC SHOW_MAP
preproc
// Declare a map
map dragMap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
numeric markerId = dragMap.addMarker(38.84839, -76.931098);
// Set function that is called on marker drag
dragMap.setMarkerOnDrag(markerId, onDragMarker(dragMap, markerId));
// Display the map
dragMap.show();
See also: Map Object, Map.AddMarker Function, Map.SetMarkerOnClick Function, Map.GetMarkerLatitude Function,
Map.GetMarkerLongitude Function
Map.SetMarkerText Function
Format
b = map_name.setMarkerText(markerId, text『 , backgroundColor, textColor』 ]);
Description
The map.setMarkerText sets the icon for the marker with id markerId on map map_name to an icon with the specified
text and color. The icon will contain the text surrounded by a small outline. markerId is the id that was returned by
map.addMarker when the marker was added to the map. text is the phrase to display as the marker icon. You can
optionally set the color of the text and the background color of the icon by specifying the backgroundColor and
textColor. Colors can be given by the names of common colors ("blue", "red", etc.) or using an HTML RGB color
specification that starts with "#" such as "#404040". If no colors are specified the text will be black on a white
background.
map.setMarkerText sets the text used in the marker icon. The icon text should be kept short to avoid making the map
crowded. To associate a longer description with a marker use map.setMarkerDescription instead to display additional
text in the marker list and in a popup on the map when the user taps the marker.
If the marker icon was previously set with map.setMarkerImage, calling map.setMarkerText will replace the marker
image with the new text.
Return Value
The function returns a logical value of 1 (true) if the marker was found and the text was set successfully and 0 (false) if
there is an error.
Example
// Declare a map
map mymap;
// Add a marker to the map at latitude 38.84839, longitude -76.931098
numeric markerId = mymap.addMarker(38.84839, -76.931098);
// Set the icon
mymap.setMarkerText(markerId, maketext("%v", HH_NUMBER), "white", "black");
// Display the map
mymap.show();
Map.SetOnClick Function
Format
b = map_name.setOnClick(markerId, callbackFunction);
Description
The map.setOnClick function sets the function that is called when the user taps on the base map away from any
markers. The callbackFunction is the name of a user function defined in the global procedure that will be called when
the user taps on the map. The function name may optionally be followed by function arguments which will be passed to
the function when it is run. These arguments are evaluated at the time that map.setOnClick is called rather than at the
time the callback is run.
To retreive the coordinates of the map location that was tapped use the functions map.getLastClickLatitude and
map.getLastClickLongitude.
If the user taps on a marker rather than on the map itself, callbackFunction is not called. To capture taps on markers
use the map.setMarkerOnClick function.
Return Value
The function returns a logical value of 1 (true).
Example
Map.AddImageButton Function
Format
buttonId = map_name.addImageButton(pathToImage, callbackFunction);
Description
The map.addImageButton function adds a new button to the map with the icon from the image file pathToImage.
pathToImage is the path to a file containing the image to display on the button. Standard image files (PNG, JPEG,
BMP) are supported. Map buttons are displayed on the right hand-side of the map. The new button will be added below
any existing buttons. When the user taps on the button, the user-defined function callbackFunction will be called.
The callbackFunction is the name of a user function defined in the global procedure. The function name may optionally
be followed by function arguments which will be passed to the function when it is run. These arguments are evaluated at
the time that map.addTextButton is called rather than at the time the callback is run. This allows you to reuse the same
callback function for multiple buttons and to customize the behavior of the callback by passing different arguments to the
callback function for each button.
Return Value
The function returns the identifier of the new button which may be used in subsequent calls to removeButton.
Example
Map.AddTextButton Function
Format
buttonId = map_name.addTextButton(text, callbackFunction);
Description
The map.addTextButton function adds a new button to the map with the label text. Map buttons are displayed on the
right-hand side of the map. The new button will be added below any existing buttons. When the user taps on the button
the user-defined function callbackFunction will be called.
The callbackFunction is the name of a user function defined in the global procedure. The function name may optionally
be followed by function arguments which will be passed to the function when it is run. These arguments are evaluated at
the time that map.addTextButton is called rather than at the time the callback is run. This allows you to reuse the same
callback function for multiple buttons and to customize the behavior of the callback by passing different arguments to the
callback function for each button.
Return Value
The function returns the identifier of the new button which may be used in subsequent calls to removeButton.
Example
Map.RemoveButton Function
Format
b = map_name.removeButton(buttonId);
Description
The map.removeMarker function removes a marker from the map map_name. buttonId should be the id returned by
map.addTextButton or map.addImageButton when the button was added to the map.
Return Value
The function returns a logical value of 1 (true) if the button was found and was successfully removed and 0 (false) if there
is an error.
Example
PROC GLOBAL
// Declare a map
map mymap;
// Id of button that will be shown on map
numeric buttonId;
// Function to remove a button from a map
function removeButton()
mymap.removeButton(buttonId);
end;
PROC SHOW_MAP
preproc
// Add a button that removes itself when it is clicked
buttonId = mymap.addTextButton("remove me", removeButton());
// Display the map
mymap.show();
Map.ClearButtons Function
Format
b = map_name.clearButtons();
Description
The map.clearButtons function removes all the buttons currently on the map named map_name.
Return Value
The function returns a logical value of 1 (true).
Example
PROC GLOBAL
numeric showAddHousehold = true;
numeric showFilter = true;
// User function that will be called on button click
// to update the buttons
function updateButtons(map myMap)
// Clear existing buttons
myMap.clearButtons();
if showAddHousehold then
myMap.addTextButton("Add", addHousehold(myMap));
endif;
if showFilterButton then
myMap.addTextButton("Filter", filterHouseholds(myMap));
endif;
myMap.addTextButton("Done", closeMap(myMap));
end;
See also: Map Object, Map.AddTextButton Function, Map.AddImageButton Function, Map.ClearMarkers Function
Map.SetBaseMap Function
Format
b = map_name.setBaseMap(mapType ‖ pathToOfflineMap);
Description
The map.setBaseMap function changes the base map that is displayed for the map map_name. Maps can use either
online base maps, which requires an Internet connection to load maps, or can use offline base maps, which requires a
map file stored on the device.
The first form of the function sets the base map to one of the online map types based on the value of mapType:
Normal Standard road map showing streets and outlines of some buildings.
The second form uses an offline map in a file named named pathToOfflineMap stored on the device. For details on
supported file types for offline maps see offline maps.
Return Value
The function returns a logical value of 1 (true) if the base map was changed successfully and 0 (false) otherwise (for
example, if the offline map is missing or invalid).
Map.SetTitle Function
Format
b = map_name.setTitle(title);
Description
The map.setTitle function sets the text that is displayed directly above the map to the value of the string expression
title. This can be used to display an instruction to the user such as "Tap on the map at the location of the household."
By default the instruction is empty and is not shown. To remove the title call map.setTitle("") function with an empty
string.
Return Value
The function returns a logical value of 1 (true).
Example
Map.ShowCurrentLocation Function
Format
b = map_name.showCurrentLocation(show_condition);
Description
The map.showCurrentLocation function enables or disables display of the device's current location on the map
map_name. If show_condition is non-zero, the current location is displayed on the map as a blue dot. As the user moves
around, the blue dot also moves to follow the user's position. In addition, there is a button in the top left corner of map
that zooms the map to the user's current location. If show_location is zero, both the blue dot and associated button are
hidden. Display of the current location is enabled by default.
Return Value
The function returns a logical value of 1 (true).
Example
// Declare a map
map mymap;
// Disable display of current location
mymap.showCurrentLocation(0);
// Display the map
mymap.show();
Map.ZoomTo Function
Format (Point)
b = map_name.zoomTo(latitude, longitude『 , zoom_level』 );
Format (Rectangle)
b = map_name.zoomTo(min_latitude, min_longitude, max_latitude, max_longitude『 , padding』 );
Description
The first form moves the map so that the point with coordinates (latitude, longitude) is at the center of the screen. If the
optional zoom_level is specified, the map is also zoomed to the specified zoom level. zoom_level is a number
representing the level of detail to show. Zoom level one fits the entire world on the screen and each consecutive zoom
level is twice as detailed as the preceding level. Zoom level 15 shows streets and zoom level 20 shows individual
buildings. The minimum and maximum zoom levels supported will depend on the base map being used. If no zoom level
is specified, the current zoom level is maintained.
The second form pans and zooms the map to fit the rectangular region of the map with corners at (min_latitude,
min_longitude) and (max_latitude, max_longitude). If the optional padding is specified, the map will be zoomed less
so that the rectangular region will fit onto the screen with extra padding on each side. This can be used so that the
edges of the rectangle do not line up exactly with the edge of the screen. The padding is specified as a percentage of
the width of the screen. A padding of 5 would add padding equal to 5% of the device screen width.
If no initial view is set by calling map.zoomTo before the map is shown, then the initial view is determined as follows:
If there are any markers, the view is set to fit all of the markers onto the screen.
Otherwise, if an offline map file that specifies an extent is used as a base map, the view is set to the extent of the
offline map file.
Otherwise, the initial view is the entire world.
Return Value
The function returns a logical value of 1 (true) if the map is zoomed/panned and 0 (false) if there is an error (for example if
the rectangle is invalid).
Example 1
// Declare a map
map mymap;
// Set initial view to center on the point 38.84839,-76.931098 at street level
mymap.zoomTo(-15.430227, 28.308464, 15);
// Display the map
mymap.show();
Example 2
// Declare a map
map mymap;
// Set initial view to fit a rectangular region with 15% padding
mymap.zoomTo(38.841546, -76.937428, 38.853679, -76.911550, 15);
// Display the map
mymap.show();
Offline Maps
The default base map shown is the Google Maps road map which requires an Internet connection. You can instead use
an offline map from a file stored on your device. Offline map files store map tiles which are images of pieces of a map at
various levels of detail. Rather than downloading map images from Google Maps, CSPro will read the images directly
from an offline map file.
MBTiles An open format orginally developed by Mapbox but now widely used by many other systems.
MBTiles can be exported from QGIS. You can also download MBTiles files created from Open
Street Map data from the Humanitarian Open Street Maps project at
https://www.hotosm.org. Only MBTiles containing image tiles (PNG or JPG) are supported.
Vector tiles are not supported.
ArcGIS Tile Package Tile packages exported from ArcGIS. See the ArcGIS Help for details. Tile packages must be
created using the ArcGIS Online/Bing Maps/Google Maps tiling scheme to be compatible with
CSPro.
File Size
Since offline map files may contain hundreds or thousands of images, they can become very large. The size of the file
will depend on the extents of the map and the level of detail. It is not possible to store a map of the entire world, or even
of an entire country at a high level of detail on most devices. When creating your map files you will need to adjust the
extents and level of detail to ensure that the map files will not exceed the available storage space on your device. If your
survey will be done over a large geographic area you may need to use separate maps for each interviewer or each team
that cover only their assigned area.
The level of detail of an offline map is a big component of the file size. Levels of detail in offline maps start at zero where
the entire world is covered by a single image tile. Each successive level contains four times the number of images as the
previous level so level one contains four tiles, level two contains 16 tiles, level 3 contains 64 tiles, etc. Level 20 is detailed
enough to see individual buildings. At level 20 the entire world requires over a trillion tiles. By removing the last level of
detail of your map you reduce the size of the file by close to two-thirds.
Pa ge 451 of 684 Pa th
Path
Path Namespace
The path namespace is a set of functions used to perform operations on a path or to get information from a path
specification.
Functionality
Function Description
path.concat Combines multiple strings into one string representing a file path.
path.getDirectoryName Extracts the directory name from a file path.
path.getExtension Extracts the file extension from a file path.
path.getFileName Extracts the file name from a file path.
path.getFileNameWithoutExtension Extracts the file name without its extension from a file path.
Example
string data_path = "C:\CSPro Data Files\Region 1\COVID-19 Survey.csdb";
string data_directory = path.getDirectoryName(data_path);
// result: C:\CSPro Data Files\Region 1\
string data_filename = path.getFilename(data_path);
// result: COVID-19 Survey.csdb
string data_extension = path.getExtension(data_path);
// result: .csdb
string data_filename_without_extension = path.getFilenameWithoutExtension(data_path);
// result: COVID-19 Survey
Path.Concat Function
Format
s = path.concat(path_string1『 , ..., path_stringN』 );
Description
The path.concat function concatenates the values of multiple string expressions (path_string1 + ... + path_stringN)
into one string representing a file path. The first argument can be either a string expression or one of the arguments used
in the pathname function. The combined paths are returned as an absolute, not relative, path.
Return Value
The function returns the concatenated path string.
Example
Pa ge 452 of 684 Pa th
string image_filename = path.concat(Application, "../Images", maketext("%v%v.jpg", CLUSTER,
HHNO));
Path.GetDirectoryName Function
Format
s = path.getDirectoryName(path_string);
Description
The path.getDirectoryName function returns the directory name for the path specified in the string expression
path_string.
Return Value
The function returns a string with the directory name extracted from the full path.
Example
string data_path = "C:\CSPro Data Files\Region 1\COVID-19 Survey.csdb";
string data_directory = path.getDirectoryName(data_path);
// result: C:\CSPro Data Files\Region 1\
string data_filename = path.getFilename(data_path);
// result: COVID-19 Survey.csdb
string data_extension = path.getExtension(data_path);
// result: .csdb
string data_filename_without_extension = path.getFilenameWithoutExtension(data_path);
// result: COVID-19 Survey
Path.GetExtension Function
Format
s = path.getExtension(path_string);
Description
The path.getExtension function returns the file extension for the path specified in the string expression path_string.
The extension will begin with a ".". If the file does not have an extension, a blank string is returned.
Return Value
The function returns the extension extracted from the full path.
Example
Pa ge 453 of 684 Pa th
string data_path = "C:\CSPro Data Files\Region 1\COVID-19 Survey.csdb";
string data_directory = path.getDirectoryName(data_path);
// result: C:\CSPro Data Files\Region 1\
string data_filename = path.getFilename(data_path);
// result: COVID-19 Survey.csdb
string data_extension = path.getExtension(data_path);
// result: .csdb
string data_filename_without_extension = path.getFilenameWithoutExtension(data_path);
// result: COVID-19 Survey
Path.GetFileName Function
Format
s = path.getFileName(path_string);
Description
The path.getFileName function returns the file name for the path specified in the string expression path_string. The file
name will include the extension of the file; to get the file name without the extension, use the
path.getFileNameWithoutExtension function.
Return Value
The function returns a string with the directory name removed from the full path.
Example
string data_path = "C:\CSPro Data Files\Region 1\COVID-19 Survey.csdb";
string data_directory = path.getDirectoryName(data_path);
// result: C:\CSPro Data Files\Region 1\
string data_filename = path.getFilename(data_path);
// result: COVID-19 Survey.csdb
string data_extension = path.getExtension(data_path);
// result: .csdb
string data_filename_without_extension = path.getFilenameWithoutExtension(data_path);
// result: COVID-19 Survey
Path.GetFileNameWithoutExtension Function
Format
s = path.getFileNameWithoutExtension(path_string);
Description
Pa ge 454 of 684 Pa th
The path.getFileNameWithoutExtension function returns the file name for the path specified in the string expression
path_string. The file name will not include the extension of the file; to get the file name with the extension, use the
path.getFileName function.
Return Value
The function returns a string with the directory name and extension removed from the full path.
Example
string data_path = "C:\CSPro Data Files\Region 1\COVID-19 Survey.csdb";
string data_directory = path.getDirectoryName(data_path);
// result: C:\CSPro Data Files\Region 1\
string data_filename = path.getFilename(data_path);
// result: COVID-19 Survey.csdb
string data_extension = path.getExtension(data_path);
// result: .csdb
string data_filename_without_extension = path.getFilenameWithoutExtension(data_path);
// result: COVID-19 Survey
Description
The pff statement creates a dynamic pff with the name pff_name. The pff name must be unique and must contain only
letters, numbers, or the underscore character. The name must begin with a letter. You can declare pffs globally in PROC
GLOBAL or locally in functions or procedures.
Example
PROC INTERVIEWER_MENU
// run the listing program in the interviewer's assigned cluster
if INTERVIEWER_MENU = 1 then
pff listing_pff;
listing_pff.load("Listing.pff");
listing_pff.setproperty("Key", maketext("%v", CLUSTER));
listing_pff.exec();
endif;
Pff.Load Function
Format
b = pff_name.load(pff_filename ‖ application_name);
Description
The pff.load function loads the contents from an existing PFF file into a pff. If supplying a string expression,
pff_filename, the properties from that PFF will be loaded. If supplying an application_name (the name of your
application, which generally ends with _FF), then the PFF used to run the current application will be loaded.
Return Value
The function returns a logical value of 1 (true) if the PFF file exists and was successfully loaded and 0 (false) otherwise.
Example 1
// run the household data entry application but change the name
// of the data file to the current cluster
pff household_pff;
household_pff.load("Household.pff");
household_pff.setproperty("InputData", maketext("%v.csdb", CLUSTER));
household_pff.exec();
Pff.Save Function
Format
b = pff_name.save(pff_filename);
Description
The pff.save function saves the contents of the pff object to the disk to a PFF file with the name supplied by the string
expression pff_filename. Because pff objects can be executed without saving them, this function is not necessary, but
may be useful if you want a copy of your PFF file on the disk.
Return Value
The function returns a logical value of 1 (true) if the PFF was successfully saved and 0 (false) otherwise.
Example
// create PFFs for running the listing application for each province
pff listing_pff;
listing_pff.load("Listing.pff");
do numeric counter = 1 while counter <= PROVINCE_VS.length()
listing_pff.setproperty("Description", maketext("List in %s", PROVINCE_VS.labels(counter)));
listing_pff.setproperty("Key", maketext("%02d", PROVINCE_VS.codes(counter)));
string new_pff_filename = maketext("Listing%02d.pff", PROVINCE_VS.codes(counter));
listing_pff.save(new_pff_filename);
enddo;
Pff.GetProperty Function
Format
s = pff_name.getproperty(property_name『 , string_list』 );
Description
Pa ge 457 of 684 Pff Object
The pff.getproperty function returns a string containing the current value associated with the pff's property specified
by the string expression property_name. PFF properties all have default values, so even if a property was not explicitly
defined in the PFF file, it may have a default value. Some properties, such as input data filenames for batch applications,
can have multiple values, and if you want to retrieve all of these values, you must supply a string_list, which will contain
zero, one, or more strings with the values.
Properties names can be prefixed with the section name, as explained in pff.setproperty, but it is generally not
necessary to do so when using this function.
The function can be used similarly to sysparm to retrieve properties defined in a PFF's [Parameters] section.
Return Value
The function returns a string with the property information. If the property is invalid, a blank string is returned.
Example
// show our custom report unless the PFF specifies ViewListing=Never
pff this_application_pff;
this_application_pff.load(CENSUS_FF);
if this_application_pff.getproperty("ViewListing") <> "Never" then
view("Disability Statistics.csv");
endif;
Pff.SetProperty Function
Format
b = pff_name.setproperty(property_name, property_value ‖ string_list);
Description
The pff.setproperty function modifies the value associated with the pff's property specified by the string expression
property_name. The property will be modified to the value given in the string expression property_value. If the
property_value is blank, the property will be reset to its default value. Alternatively, a string_list can be supplied
containing values for properties that have multiple values (such as input data filenames).
Properties names can be prefixed with the section name. For example, if you load an existing PFF that has a persistent
ID value defined for the item PROVINCE, you can code:
pff_name.setproperty("PROVINCE", "50");
However, if no such property was defined in the PFF, the pff.setproperty function will not know whether you are
modifying a value in the [DataEntryIds] section, the [ExternalFiles] section, or another section. To avoid this ambiguity,
when setting new custom properties, you can specify the section name when setting the property:
pff_name.setproperty("DataEntryIds.PROVINCE", "50");
Return Value
The function returns a logical value of 1 (true) because it will always successfully set a property. If the property name is
unknown, then the value is assigned to a new parameter (defined in a PFF's [Parameters] section).
Example 1
Example 2
PROC REPORT_SELECTION
// run the batch application to generate a report on...
pff report_pff;
report_pff.load("..\Report\Report.pff");
// ...the data file for the selected province and region
if REPORT_SELECTION = 1 then
report_pff.setproperty("InputData", maketext("..\Data\%v%v.csdb", PROVINCE, REGION));
// ...the data files for all the regions in the selected province
elseif REPORT_SELECTION = 2 then
report_pff.setproperty("InputData", maketext("..\Data\%v*.csdb", PROVINCE));
// ...the data files for the entire country
else
list string data_files_listing;
dirlist(data_files_listing, "..\Data", "*.csdb", recursive);
report_pff.setproperty("InputData", data_files_listing);
endif;
report_pff.exec();
Pff.Exec Function
Format
b = pff_name.exec();
Description
The pff.exec function starts another CSPro application using the properties defined in the pff object. If you modified any
of the pff object's properties, it is not necessary to save the file to disk when running it, as this function will save a
temporary file to the disk with the defined properties.
For data entry PFFs, the function executes the application and immediately closes the current application (without
saving any data). This is the same as running execpff with the stop argument. For other PFFs, the function executes
the application using the wait argument. That is, this function is equivalent to:
pff my_pff;
// ...
if my_pff.getproperty("AppType") = "Entry" then
execpff(filename(my_pff), stop);
else
execpff(filename(my_pff), wait);
endif;
execpff(filename(my_pff), wait);
Return Value
The function returns a logical value of 1 (true) if the new application was started successfully and 0 (false) otherwise.
Example
PROC INTERVIEW_SELECTION_MENU
pff application_pff;
// conduct the main household interview
if INTERVIEW_SELECTION_MENU = 1 then
application_pff.load("..\Household\Household.pff");
// conduct the agriculture interview
else
application_pff.load("..\Agriculture\Agriculture.pff");
endif;
// set the key to the household the user selected
application_pff.setproperty("Key", SELECTED_HOUSEHOLD_KEY);
// run the program
application_pff.exec();
Description
The SystemApp statement creates a SystemApp object with the name system_app_name. The SystemApp name must
be unique and must contain only letters, numbers, or the underscore character. The name must begin with a letter. You
can declare SystemApp objects globally in PROC GLOBAL or locally in functions or procedures.
Example
// play the training video in Windows Media Player
SystemApp windows_media_player;
windows_media_player.setArgument(pathconcat("../Videos/Census Training.mp4"));
windows_media_player.exec("wmplayer.exe");
SystemApp.SetArgument Function
Format
b = system_app_name.setArgument(argument『 , value』 );
Description
The SystemApp.setArgument function adds an argument to a SystemApp object. The argument will later be used when
the system application is executed. The string expression argument and the optional string or numeric expression
value are processed differently on Android and Windows platforms.
Return Value
The function returns a logical value of 1 (true).
Android Example
On Android, a system application is typically an Android activity. Each argument will be part of the "extras" passed in
the Intent Bundle that opens the activity. If value is a string expression, it will be put as a String extra associated with
the name argument. If value is a numeric expression, it will be put as a double extra associated with the name
argument. If value is not specified, a null String extra will be used. There is special processing for three argument
types:
Windows Example
On Windows, a system application is an executable program. On Windows, every argument is turned into a command
line argument passed to the executable. If an argument is provided without a corresponding value and the argument
contains spaces and is not wrapped in quotes, the argument will be surrounded by quotes so that it is processed as a
single argument by the executable. If a value is provided, then the argument and value are concatenated and will not be
surrounded by quotes. If value is numeric, it will be converted to a string.
SystemApp.Exec Function
Format
// Android format
b = system_app_name.exec(『 package_name『 , activity_name』 』 );
// Windows format
b = system_app_name.exec(executable_name);
Description
The SystemApp.exec function executes a system application associated with the SystemApp object. Any arguments
set using the SystemApp.setArgument function are passed to the system application. On Android, a system application
can return results which can be queried with the SystemApp.getResult function.
Return Value
The function returns a logical value of 1 (true) if the system application is started successfully and 0 (false) otherwise.
Android Example
On Android, the optional string expression package_name specifies the name of the Android application (package) that
will be executed. An optional string expression activity_name specifies the name of the activity within the package that
should be opened. If no arguments are provided, then the arguments specified using SystemApp.setArgument will be
processed by the Android operating system and, if there is a way to handle these arguments, then an appropriate
application will be opened.
Windows Example
On Windows, a system application is an executable specified by the string expression executable_name. Typically the
name is the full path of the executable, but for applications that have registered their name in the Windows Registry, you
only need to provide the executable file name.
SystemApp.GetResult Function
Format
s = system_app_name.getResult(argument);
Description
The SystemApp.getResult function returns a result from the system application that was executed using the
SystemApp object. The function only works for system applications executed on Android devices.
Return Value
The function returns a string value with the result associated with the string expression argument. If no result exists,
then the function returns a blank string.
Android Example
SystemApp fingerprint_reader;
fingerprint_reader.setArgument("prompt", maketext("Please take the fingerprints for %s",
strip(NAME)));
fingerprint_reader.exec("com.example.entry_add_ons",
"com.example.entry_add_ons.FingerPrintReaderActivity");
FINGERPRINT_FILENAME = fingerprint_reader.getResult("filename");
SystemApp.Clear Function
Format
b = system_app_name.clear();
Description
The SystemApp.clear function clears any stored arguments and results associated with a SystemApp object.
Example
PROC GLOBAL
SystemApp google_maps;
PROC LOCATION
// clear any previously used arguments
google_maps.clear();
// show barber shops near the location
google_maps.setArgument("action", "android.intent.action.VIEW");
google_maps.setArgument("data", maketext("geo:%f,%f?q=barber+shop", LATITUDE, LONGITUDE));
google_maps.exec("com.google.android.apps.maps");
Description
The valueset statement creates a dynamic value set with the name valueset_name. The value set name must be
unique and must contain only letters, numbers, or the underscore character. The name must begin with a letter. You can
declare value sets globally in PROC GLOBAL or locally in functions or procedures.
Value sets can be numeric or string. By default a value set is numeric, but the type can be modified by specifying the
valueset_type. When associating a value set with an item using setvalueset, the type must match the item type
(numeric for numeric items, and string for alphanumeric items).
Example
PROC MENU
onfocus
valueset menu_valueset;
// interviewer options
if USER_ROLE = 1 then
menu_valueset.add("Add New Household", 1);
menu_valueset.add("View Existing Households", 2);
// supervisor options
else
menu_valueset.add("View Household Report", 3);
endif;
menu_valueset.add("Quit", 99);
setvalueset(MENU, menu_valueset);
ValueSet.Add Function
Format
i = valueset_name.add(label『 , image_filename』 , from_code『 , to_code ‖ special_value』 );
Description
The valueset.add function adds one or more values to the end of a value set. The type of the added code or value set
must match the type of the receiving value set.
Pa ge 465 of 684 Va lue Set Object
In the first version, you provide a string expression label and either a numeric or string expression from_code, which
must match the type of the value set. An optional string expression, image_filename, provides the filename of an image
to be shown as a value set image. For numeric value sets, it is possible to specify a numeric expression to_code, which
creates a range, or a special_value, which is then associated with the from_code.
In the second version, you provide a valueset_name. This adds all of the values from that value set to the receiving value
set. If you specify a from_code or to_code, then only the values in that range will be added to the receiving value set.
Return Value
The function returns the number of the values added to the value set. If the value set is read-only, the function returns
default.
Example 1
PROC SUPERVISOR_MENU
onfocus
valueset string menu_valueset = SUPERVISOR_MENU_VS;
if CAN_SYNC = 1 then
menu_valueset.add(SYNC_MENU_VS);
endif;
forcase HOUSEHOLD_DICT do
menu_valueset.add("Review " + getcaselabel(HOUSEHOLD_DICT), key(HOUSEHOLD_DICT));
endfor;
setvalueset(SUPERVISOR_MENU, menu_valueset);
Example 2
PROC OCCUPATION_MINOR
onfocus
numeric first_occupation_code = OCCUPATION_MAJOR * 100;
numeric last_occupation_code = first_occupation_code + 99;
valueset occupation_minor_valueset;
occupation_minor_valueset.add(OCCUPATION_MINOR_VS1, first_occupation_code,
last_occupation_code);
setvalueset(OCCUPATION_MINOR, occupation_minor_valueset);
ValueSet.Remove Function
Format
i = valueset_remove.remove(code);
Description
The valueset.remove function removes all values from a value set that have a code matching the specified numeric or
string expression code. The type of the code must match the type of the value set.
When using a numeric value set with ranges—both from and to codes are defined—or defined special values, the
function will remove a value only if it is the from value or if it is the special value.
Example
PROC SECOND_FAVORITE_DRINK
preproc
valueset drink_valueset = FAVORITE_DRINK_VS;
drink_valueset.remove(FAVORITE_DRINK);
setvalueset(SECOND_FAVORITE_DRINK, drink_valueset);
ValueSet.Clear Function
Format
b = valueset_name.clear();
Description
The valueset.clear function removes all values from a value set.
Return Value
The function returns a logical value of 1 (true). If the value set is read-only, the function returns default.
Example
PROC GLOBAL
valueset name_valueset;
PROC LINE_NUMBER_OF_MOTHER
onfocus
name_valueset.clear();
// construct value set...
ValueSet.Length Function
Format
i = valueset_name.length();
Description
Return Value
The function returns the length of the value set.
Example
PROC HOUSEHOLD_SELECTION
onfocus
valueset households_vs;
// ...
if households_vs.length() = 0 then
errmsg("There are no households to select");
reenter MAIN_MENU;
endif;
ValueSet.Randomize Function
Format
b = valueset_name.randomize(『 exclude(exclude_code1, ..., exclude_codeN)』 );
Description
The valueset.randomize function scrambles the order of values in the value set. The function is useful when using
extended controls in a data entry application. Some survey applications choose to display the possible response
categories in a randomized way so as to minimize an enumerator or respondent's selection bias.
An optional exclusion list, exclude_code1, ..., exclude_codeN, allows you to prevent certain values from being given a
random order. This is useful for variables like Don't Know, which, after the randomization, would remain at the original
location in the list of values.
You can use the seed function to initialize the random number generation.
Return Value
The function returns a logical value of 1 (true).
Example
FAVORITE_FOOD_GROUP_VS1.randomize(exclude(5));
See also: ValueSet Object, Random Function, RandomIn Function, Seed Function, ValueSet.Sort Function
ValueSet.Show Function
Format
d = valueset_name.show(『 heading』 );
Description
The valueset.show function displays the labels from a value set and returns the code (or index) of the operator's
selection. An optional string expression, heading, specifies the title of the window that displays the values. The function
is similar to the accept function but with the options taken from the value set's labels. For numeric value sets, the
returned value corresponds with the selected label's code. For a string value set, the returned value is the index of the
selected label.
Return Value
The function returns the code (or index) of the item selected. If the selected value contains a special value, then that
value is returned. The value 0 is returned if the escape key (or back button) is pressed and none of the values is chosen.
Example
valueset respondent_query;
do numeric counter = 1 while counter <= count(NAME)
respondent_query.add(NAME(counter), LINE_NUMBER(counter));
enddo;
numeric respondent_line_number = respondent_query.show("Who in the household is responding to
questions?");
See also: ValueSet Object, Accept Function, Show Function, ShowArray Function, List.Show Function
ValueSet.Sort Function
Format
b = valueset_name.sort(『 ascending ‖ descending 』 『 by code ‖ label 』 );
Description
The valueset.sort function sorts the order of entries in the value set by either label or code. An optional argument,
ascending or descending, allows for the specification of the sort order. Another optional argument, by code or by
label allows you to specify whether to sort the value set's entries by code or label. By default the value set will be
Labels are sorted in case insensitive order, so "c" is considered the same as "C". On the other hand, codes for
alphanumeric value sets are sorted in case sensitive order, so "c" is recognized as different from "C".
Return Value
The function returns a logical value of 1 (true).
Example
valueset enumerators_vs;
forcase STAFF_DICT where STAFF_ROLE = 1 do
enumerators_vs.add(STAFF_NAME, STAFF_CODE);
endfor;
enumerators_vs.sort();
setvalueset(ENUMERATOR_LOGIN, enumerators_vs);
Description
The break statement exits a do, while, for, or forcase loop early and continues execution with the first statement after
the enddo.
Example
// find the spouse
numeric ptrSpouse,ctr;
for ctr in PERSON do
if P02_REL = 2 then
ptrSpouse = ctr;
break;
endif;
enddo;
Do Statement
Format:
do [[varying] var = expression] while/until condition [by expression]
statements;
enddo;
In the varying clause, the variable must be a numeric variable. The variable assignment is performed once, before the
first repetition of the loop. The varying keyword has no effect on the command, and so may be omitted. It is possible
to declare a numeric variable after the varying keyword, in which case the numeric variable's scope is the duration of
the do loop.
Example:
HEAD = 0;
do varying i = 1 until HEAD > 0 or i > totocc(PERSON)
if RELATIONSHIP(i) = 1 then
HEAD = i;
endif;
enddo;
This same example could be rewritten using the while condition as follows:
HEAD = 0;
do varying i = 1 while HEAD = 0 and i <= totocc(PERSON)
if RELATIONSHIP(i) = 1 then
HEAD = i;
endif;
enddo;
Exit Statement
Format
exit 『 return_value』 ;
Description
The exit statement terminates a procedure or user-defined function before normal processing is expected to end. When
the statement is executed, processing stops for the current procedure or function, and control is passed to the next
procedure or function.
When utilized in a user-defined function, an optional argument, return_value, sets the function's return value before
exiting the function. The type of the argument, either a numeric or string expression, must match the function's return
type.
Example 1
function numeric FirstWoman()
do numeric ctr = 1 while ctr <= HH_MEMBERS
if SEX(ctr) = 2 then
FirstWoman = ctr;
exit; // exit the function, we've found our first woman!
endif;
enddo;
FirstWoman = 0; // no woman was found
end;
See also: Universe Statement, Skip Case Statement, Stop Function, EndCase Statement
For Statement
Format
for 『 counter in』 multiple_occurring_group 『 where condition』 do
// statements
endfor;
Description
The for statement executes one or more statements repeatedly within a loop for each occurrence of a multiply occurring
group. The multiple_occurring_group can be a form, roster, record, item, relation, or even a dictionary (following a
selcase function call). The multiply occurring group controls how many times the for loop is executed. An optional
logical condition, referencing items in the group, can be supplied to restrict the cases processed by the loop.
The numeric variable counter contains the number of the current occurrence being examined. It cannot be changed
inside the loop, but it can be referenced. Its starting value is 1 and its ending value is determined by the number of
occurrences of the group. It is possible to declare a variable local to the loop by adding numeric before the counter
variable.
If the group name is a record, item, or relation, then the appropriate keyword record, item, or relation can be used
before the name.
The for statement should be coded outside of the group it references. In the example below, note that the code is
executed in PROC QUEST. It should not be located in PROC PERSON_REC_EDT or in a procedure for any of the data items
within the person record.
Example
See also: ForCase Statement, For (Dictionary) Statement, Do Statement, While Statement
ForCase Statement
Format
forcase dictionary_name 『 where condition』 do
// statements
endfor;
Description
The forcase statement executes one or more statements repeatedly within a loop for each case in a file opened as
external dictionary dictionary_name. An optional logical condition, referencing items in the external dictionary, can be
supplied to restrict the cases processed by the loop.
It is possible, by specifying dictionary access parameters after the dictionary name, to limit the cases that the forcase
loop processes.
Example
numeric incomplete_households;
forcase CENSUS_DICT where INTERVIEW_STATUS = 3 do
inc(incomplete_households);
endfor;
See also: For Statement, For (Dictionary) Statement, CountCases Function, LoadCase Function
Description
The for (dictionary) statement executes one or more statements repeatedly within a loop for each selection made during
The dictionary_name must be supplied and refers to an external dictionary associated with your application. An optional
argument, mark_type, indicates what kinds of cases to process, and can be one of the following values:
mark_type Description
marked Process only the cases selected by the operator during the selcase function call.
unmarked Process the cases not selected by the operator.
all Process all of the cases that could have been selected by the operator.
Example
selcase(OCCUPATION_DICT,"Plantation") multiple;
numeric maxEducation;
for OCCUPATION_DICT do
maxEducation = high(maxEducation,MAXIMUM_EDUCATION);
endfor;
If Statement
Format:
if condition then [ ] indicates that this part is optional.
statements;
[elseif condition then
statements;]
[else
statements;]
endif;
Description:
The if statement executes different statements based on the value of "condition". The condition following the if
command is evaluated. If the condition is true, then the statements following it are executed and execution moves to
the first statement after the endif keyword. If the condition is false, execution moves to the first elseif keyword or the
else keyword (if there are no elseif keywords).
The elseif blocks are evaluated in the same way as the first if block. When CSPro finds a condition that is true it
executes the statements following it and moves to the first statement after the endif keyword. If all the conditions are
false, the statements following the else keyword are executed. If none of the conditions are true and there is no else
keyword, execution moves to the first statement after the endif keyword without the execution of any statements
within the if statement.
Every if statement must contain an endif keyword. However, if multiple elseif keywords are nested within an if block,
they may be terminated with a single endif keyword. The statements within the if statement can be any number of
CSPro statements. If a condition contains an inequality (e.g., >, <, >=, <=) and one of the values tested in the
inequality is a special value (e.g., MISSING, REFUSED, NOTAPPL, or DEFAULT), the result of the condition is false
and execution skips to the statement following the else.
Example:
if X = 3 then
z = 6;
elseif x in 4:5 or y in 7:9,12 then
Next Statement
Format
next;
Description
The next statement ends a do, while, for, or forcase loop early and continues execution with the next iteration of the
loop. If the next iteration results in the termination of the loop, then execution will begin with the first statement after the
enddo.
Example
// fill an array with the occurrence numbers of all spouses
spouseNumber = 0;
for ctr in PERSON do
if P02_REL <> 2 then
next;
endif;
inc(spouseNumber);
spouseIndices(spouseNumber) = ctr;
enddo;
Universe Statement
Format
universe condition 『 case』 ;
Description
The universe statement determines whether to allow normal execution of a procedure or function based on the value of
condition. The condition following the universe command is evaluated. If the condition is true, then the statements
following it are executed. If the condition is false, processing stops for the current procedure or user-defined function, and
control is passed to the next procedure or user-defined function.
In batch editing mode, if the optional case keyword is specified and the condition is false, the program will stop
processing the case but will still write it to the output file.
Example 1
Example 2
PROC ROOF_TYPE
universe HHTYPE = 1 case;
// the universe is equivalent to:
if not HHTYPE = 1 then
endcase;
endif;
When Statement
Format
when expression1 『 :: expression2 :: expressionN』 ;
『 value1』 『 :: value2 :: valueN』 -> statement;
『 -> else_statement;』
endwhen;
Description
The when program control statement executes a statement based on the value of one or more other variables, combining
if statements with the power of recode. The statement is similar to statements in other programming languages (like
switch in C or when in Kotlin).
One or more expressions, expression1 to expressionN, are given, with each expression separated by two colons ::.
These expressions must evaluate to either a number or string. Based on the values of each expression, CSPro evaluates
each line between the when and endwhen, trying to match a line's values, value1 to valueN, with the evaluated
expressions. Once a line matches, a statement, which is given after the arrow ->, is executed and program control
moves to statements following the endwhen. Only a single statement can be provided per line.
Each value provided must evaluate to the same type (number or string) as its respective expression. A value omitted is
considered a match, and if no values are provided, then the optional else_statement is executed (assuming no preceding
line matched).
Example 1
Example 2
PROC EMPLOYMENT_STATUS
when EMPLOYMENT_STATUS;
1:2 -> skip to UNEMPLOYMENT;
<= 6 -> skip to INFORMAL_EMPLOYMENT
>= 7 -> skip to FORMAL_EMPLOYMENT;
endwhen;
While Statement
Format
while condition do
// statements
enddo;
Description
The while statement executes one or more statements repeatedly, in a loop, while the logical condition is true. Unlike
a do loop, a counter is not automatically incremented, so you must ensure the termination of the loop. The condition is
evaluated on each repetition of the loop before any of the statements are executed.
Example
setfile(inputFile,"Countries.txt");
string countriesLine;
while fileread(inputFile,countriesLine) do
// ...
enddo;
Description
The assignment statement sets a variable equal to the value of an expression. If the expression is numeric or conditional,
then the variable must be numeric. If the expression is a string expression, then the variable must be alphanumeric.
Examples
AGE = 10;
Q102 = PREV_AGE;
Y = sqrt(X);
NAME = "John Doe";
NAME = "John " + toupper("Doe");
Recode Statement
(Prior to CSPro 7.4, a different version of the recode/box command function existed. That version should no longer be
used as it will be removed in a future release.)
Format
recode expression1 『 :: expressionN』 -> destination_variable1 『 :: destination_variableN』 ;
value1 『 :: valueN』 -> result1 『 :: resultN』 ;
『 -> else_result1 『 :: else_resultN』 ;』
endrecode;
Description
The recode statement assigns a value to one or more output variables based on the value of one or more other input
variables. It can be used to rescale variables, to assign values to variables, or to create a composite variable from
existing variables. In many instances it is easier to use than writing multiple if statements.
One or more expressions, expression1 to expressionN, are given, with each expression separated by two colons ::.
These expressions must evaluate to either a number or string. Based on the values of each expression, CSPro evaluates
each line between the recode and endrecode, trying to match a line's values, value1 to valueN, with the evaluated
expressions. Once a line matches, one or more results, result1 to resultN, which are given after the arrow ->, are
assigned to destination_variable1 to destination_variableN, which can be dictionary items, working variables, array
values, list values, or the return values of a user-defined function. A destination variable can also be included among the
expressions. It is possible to declare a new numeric, alpha, or string destination variable as part of the recode
statement (see Example 2 below).
Each value provided must evaluate to the same type (number or string) as its respective expression, and each result
Pa ge 479 of 684 As s ignment Sta tements
must be the same type as the destination variable. A value omitted is considered a match, and if no values are provided,
then the optional else_result1 to else_resultN are used in the assignment (assuming no preceding line matched).
Example 1
recode AGE -> AGE_GROUP;
0:19 -> 1;
20:29 -> 2;
30:39 -> 3;
40:49 -> 4;
>= 50 -> 5;
-> 9;
endrecode;
Example 2
PROC AGE
recode RELATIONSHIP :: AGE(1) - AGE -> numeric valid_age_difference_with_head;
3, 4 :: < 12 -> false; // biological and step children must be 12+
years younger
6 :: > -12 -> false; // parents must be 12+ years older
7 :: < 24 -> false; // grandchildren must be 24+ years younger
-> true;
endrecode;
Example 3
function string GetDisplayName()
recode NAME -> GetDisplayName;
"" -> "(Undefined)";
-> strip(toupper(NAME));
endrecode;
end;
Example 4
recode INDUSTRY_SECTION -> min_division :: max_division;
"A" -> 1 :: 3;
"B" -> 5 :: 9;
"C" -> 10 :: 33;
// ...
endrecode;
For example, this shows the differences between using recode prior to CSPro 7.4 and using it starting with CSPro 7.4:
Format:
recode var-1 [:var-2 [:var-n]] => var-out;
[range-1] [:range-2 [:range-n]] => exp;
[range-1] [:range-2 [:range-n]] => exp;
: : :
[: [:]] => other-exp;
endrecode;
Example 2:
recode ATTEND : ED_LEVEL => EDUC;
2,notappl : => 1;
1:1 => 2;
1 : 2,3 => 3;
: => 9;
endrecode;
is equivalent to the following if statements:
Example 3:
recode UNITS : NUMBER => DAYS;
: notappl => notappl;
: missing => missing;
1: => NUMBER;
2: => NUMBER*7;
Pa ge 482 of 684 As s ignment Sta tements
3: => NUMBER*30;
4: => NUMBER*365;
: => missing;
endrecode;
Impute Function
Format:
impute (item-name, expression)
[stat (item-name1, item-name2,....,item-nameN)]
[title (alpha-expression)]
[vset (vset-number)]
[specific];
Column one: lists the values that were assigned during the imputations (1, 2, etc)
Column two: shows the frequency (that is, the total number of times) each value was
assigned (i.e., number '2' was assigned 193 times)
Column four: indicates what percentage each imputation represents from the total number of
imputations made (i.e., number '2' was imputed 193 times, representing barely
one percent (0.8) of the total number of imputations made)
Example:
impute(P04_AGE, TEMPAGE) title("Age updated via TempAge") vset(2);
Code Example:
A code example of this statement can be found in the Examples/Impute folder in the CSPro software folder.
SetValue Function
Format
b = setvalue(variable_name, value『 , occurrence1, ..., occurrence3』 );
Description
The setvalue function executes an assignment when the name of the variable to be assigned to is not known at
compilation time (when writing the application). The function searches for a variable with a name stored in the string
expression variable_name, and if such a variable is found, the function assigns to it the value stored in value. If the
variable is numeric, then value should be a numeric expression, and if the variable is alphanumeric, then it should be a
string expression. Optional arguments, occurrence1 to occurrence3, allow you to pass occurrence numbers to the
function. This function is especially useful when combined with a userbar button or an OnKey character sequence.
Return Value
Pa ge 484 of 684 As s ignment Sta tements
The function returns a logical value of 1 (true) if the variable name was found and the value was successfully changed.
The function returns 0 (false) if the variable was not found, if a string expression was assigned to a numeric variable, or if
a numeric expression was assigned to an alphanumeric variable.
Examples
// the following statements are equivalent but you would not use setvalue
// because the left-hand side of the assignment is already known
setvalue("NAME", "Amelia");
NAME = "Amelia";
setvalue("AGE", 20);
AGE = 20;
// below, the left-hand side of the assignment is not known at compilation time
setvalue(getsymbol(), missing); // set the current field to missing
setvalue(getsymbol(), "DK", 3); // set the third occurrence of the current field to "DK"
GetValue Function
Formats
d = getvalue(variable_name『 , occurrence1, ..., occurrence3』 );
d = getvaluenumeric(variable_name『 , occurrence1, ..., occurrence3』 );
s = getvaluealpha(variable_name『 , occurrence1, ..., occurrence3』 );
Description
The getvalue function returns the value of a variable when the variable name is not known at compilation time (when
writing the application). The function searches for a variable with a name stored in the string expression variable_name,
and if such a variable is found, the function returns the value of that variable.
The functions getvalue and getvaluenumeric return the value of numeric variables. The function getvaluealpha
returns the value of an alphanumeric variable. Optional arguments, occurrence1 to occurrence3, allow you to pass
occurrence numbers to the function.
Return Value
The function returns the variable's value if the variable name was found. If a numeric variable was not found, the functions
getvalue and getvaluenumeric return default. If an alphanumeric variable was not found, getvaluealpha returns a
blank string.
Example
errmsg("The current field's value is %d", getvalue(getsymbol()));
Format 2:
i = accept(heading, alpha-array | string-list);
Description:
The accept function displays a menu with the heading and list of choices ("opt-1", "opt-2", etc.). The operator can
move the down- or up-arrow keys to select the desired options and press Enter. The operator can also use the mouse
to click on the desired option.
The number of options will be calculated dynamically if an alpha array is passed to the accept function. Each element
of the array will be displayed until a blank string is found in the array. A string list can also be used for the options.
Return value:
The function returns the number of the option selected: 1 for the first option, 2 for the second option, etc. The value 0
is returned if the Esc key is pressed and none of the options is chosen.
Example 1:
PROC UR
preproc
I = 0;
do until I in 1:2
I = accept("Area Designation?", "Urban", "Rural");
enddo;
$ = I;
noinput;
Example 2:
PROC GLOBAL
array string acceptOptions(20);
PROC ACTION
preproc
numeric ctr;
do ctr = 1 while ctr <= count(NAME)
acceptOptions(ctr) = NAME(ctr);
enddo;
acceptOptions(ctr) = ""; // mark the end of the array
ACTION = accept("Select a Person to Interview",acceptOptions);
Description
The advance statement moves forward, field by field, to the specified field, executing logic in fields' preproc and
postproc events as it proceeds. It acts as though the Enter key were pressed repeatedly until either the specified field
appears or one of the procedures executed during the advance goes to a different field.
The optional field_name is either the name of a field or is a string variable specifying the name of the field. If the
field_name is not specified, the program attempts to advance to the end of the level.
The target field can be located in any record at the same level as the current field, but it cannot be located at a different
level. The field must be later on the path than the current field, meaning that it is a field that has not yet been entered. If
the field has already been entered, an error message will be displayed during data entry. If you do not know whether the
field is earlier in the data path, use the move statement.
Note that the advance statement behaves differently from the skip statement, moving over some number of fields, rather
than skipping past the fields.
Example
PROC CEB
preproc
// for girls aged less than 15, prefill in 0 for children ever born, but
// keep the information on path in case the keyer wants to change it
if AGE < 15 then
CEB = 0;
CEB_M = 0;
CEM_F = 0;
advance to MARITAL_AGE;
endif;
See also: Ask Statement, Move Statement, NoInput Statement, Reenter Statement, Skip Statement
Ask Statement
Format
ask if condition;
Description
The ask statement determines whether a field, roster, or form is eligible for data entry. The statement must be coded in a
preproc. The condition is evaluated, and if true, any additional logic is executed until the appropriate field is reached
and entered. If the condition is false, then control moves to the next field, roster, or form.
The ask statement is an inversion of the skip statement and can be written as:
See also: Advance Statement, Move Statement, Reenter Statement, Skip Statement
ChangeKeyboard Function
Feature Upgrade: Starting with CSPro 7.1, you should no longer use this function as it may soon be removed from
CSPro. To replicate the behavior of the function, you can use the setproperty function with the argument "Keyboard".
Format
i = changekeyboard(dictionary_symbol『 ,keyboard_id』 );
Description
The changekeyboard function modifies the keyboard input associated with a field on a form. The value
dictionary_symbol may be an item, group, form, or an entire dictionary. The numeric expression keyboard_id refers to
a keyboard ID (which can be determined using the Change Keyboard Input dialog box). A keyboard ID of 0 will reset the
field to the default keyboard for the application.
Return Value
The function returns the number of items whose capture positions were successfully changed. If a keyboard ID is not
specified, the function will instead return the current keyboard ID associated with a given field.
Example
PROC NAME
onfocus
// use the Dvorak keyboard to key in the name
changekeyboard(NAME,66569);
Connection Function
Format
b = connection(『 WiFi ‖ Mobile』 );
Description
The connection function determines whether a device has access to the Internet. If called with no arguments, then the
function determines whether the device can connect to the Internet via Wi-Fi or via a mobile network like 3G, 4G, or
GPRS. Optional arguments (WiFi and Mobile) allow you to determine if a given connection is valid on mobile CSEntry.
On desktop CSEntry, the optional arguments are ignored and the function returns whether a connection can be made via
a LAN, a Wi-Fi connection, or a mobile network connection.
Pa ge 488 of 684 Da ta Entry Sta tements a nd Func ons
Return Value
The function returns a logical value of 1 (true) if a connection exists and 0 (false) otherwise. The presence of a connection
does not guarantee access to the Internet, however. For example, a device could be connected to a Wi-Fi router that
does not have an Internet connection. The function would still return 1 in that case.
Example
// only sync when connected on Wi-Fi
if connection(WiFi) then
if syncconnect(Dropbox) then
syncdata(PUT, HOUSEHOLDQUESTIONNAIRE_DICT);
syncdisconnect();
endif;
endif;
DeMode Function
Format
i = demode();
Description
The demode function returns the data entry mode and is often used to limit the execution of certain statements to a
specific mode. For example, a variable may need initialization when the operator invokes add or verify mode, but can be
left unaltered for modify mode.
Return Value
The function returns an integer but there are also aliased values for the modes. There are three data entry operator
modes:
Example
if demode() = add then
INTERVIEW_START_DATE = sysdate("YYYYMMDD");
endif;
Display Orientation
Display Orientation
In a data entry application that uses a rich visual interface, you may want to exert programmatic control over the
orientation of the screen, e.g., whether the display is horizontal or vertical. For computers with video cards that support
changing the display orientation, which includes most new computers as well as tablet PCs, two functions can be used
to control the orientation. Calls to these functions are ignored when executed on mobile devices.
Format:
Description:
The getorientation function returns the current display orientation. The function will return one of four values:
0: The natural orientation of the display device
90: The display orientation is rotated 90 degrees from the natural orientation
180: The display orientation is rotated 180 degrees from the natural orientation
270: The display orientation is rotated 270 degrees from the natural orientation
SetOrientation Function
Format:
b = setorientation(numeric-expression);
Description:
The setorientation function changes the orientation of the display. The numeric expression must be one of the four
values listed above. The function returns 1 if the display orientation was successfully changed, 0 otherwise.
Example:
function rotateScreen()
numeric nextOrientation = getorientation() + 90;
if nextOrientation = 360 then
nextOrientation = 0;
endif;
setorientation(nextOrientation);
end;
EditNote Function
Format
s = editnote(『 dictionary_symbol』 『 ,operator_id』 );
Description
The editnote function displays the note entry dialog box for adding or changing the note for a field or other dictionary
symbol. You can use this function to force the collection of note text. The operator can always create or edit a note
manually by pressing Ctrl+N (on desktop CSEntry) or by clicking on the Notes button on mobile CSEntry.
If no arguments are supplied, the note for the current field is displayed for editing. The optional argument
dictionary_symbol allows you to edit the note for a dictionary item, record, or level. If using the dictionary name, the
case note is edited. Another optional string argument, operator_id, allows you to edit the note for a particular operator. If
no operator ID is supplied, then the note for the current operator will be edited.
Return Value
The function returns a string containing the note text. If there is no note, the length of the string will be 0.
Example
EndLevel Statement
Format
endlevel;
Description
The endlevel statement ends data entry for the current level of the current questionnaire. The effect of this statement
depends on where it is used. If it is used in a field, roster, or form procedure, all remaining procedures within that level
are skipped and control passes to the level postproc.
If the statement is executed in a level preproc or postproc, control passes to the postproc of the next-highest level. If it
is used in the highest-level postproc, control passes to the form file's postproc (if there is one), and then data entry is
terminated for the current case.
In system-controlled applications, CSPro will continue to add cases at the lowest level of a multiple-level dictionary until
it is told to stop by endlevel. Therefore, the statement should be used in the postproc of the lowest level to end data
entry at that level.
Example
if MORE_WOMEN = 0 then
endlevel;
endif;
EndGroup Statement
Format
endgroup;
Description
The endgroup statement finishes data entry for the current group (roster or multiply-occurring form) in a data entry
application. It can not be used in a batch application. If the statement is used in an item procedure, it causes an
automatic skip to the postproc of the current group/record. If the statement is executed in the preproc of the
group/record, the entire group/record is skipped and control passes to the group/record's postproc.
This function has superseded the endsect statement. Where endsect exists in an application, it will continue to work,
but users creating new applications should adopt the endgroup instruction.
Example
if KIDSBORN = 0 then
endgroup;
endif;
Pa ge 491 of 684 Da ta Entry Sta tements a nd Func ons
See also: EndLevel Statement
Enter Statement
Format
enter form_file_name;
Description
The enter statement allows the use of a secondary form file to capture data in a secondary data file.
The form_file_name is the name of the secondary form file that you want to use. The secondary form file must be part of
your data entry application.
Example
if V108 = 6 then
enter OTHERS_FF;
endif;
GetCaptureType Function
Feature Upgrade: Starting with CSPro 7.1, you should no longer use this function as it may soon be removed from
CSPro. To replicate the behavior of the function, you can use the getproperty function with the argument
"CaptureType".
Format
i = getcapturetype(field_name);
Description
The getcapturetype function returns the capture type currently associated with a field on a form. The field_name must
be located on one of the application's forms.
Return Value
The function returns the capture type, which is one of the following values:
Example
GetCaseLabel Function
Format
s = getcaselabel(dictionary_name);
Description
The getcaselabel function returns the label for the case currently associated with the dictionary dictionary_name. The
dictionary can be either the main input dictionary of a data entry application or an external dictionary.
Return Value
The function returns the label if one has been set, or a blank string if there is no label for the case.
Example
PROC LAST_FIELD
string end_query = maketext("Are you finished entering '%s'?", getcaselabel(CENSUS_DICT));
if accept(end_query, "Yes", "No") <> 1 then
reenter;
endif;
GetDeviceID Function
Format
s = getdeviceid();
Description
The getdeviceid function returns a string that can be used to uniquely identify a device. As programmed now, it returns
a MAC address on Windows and an ANDROID_ID on Android, but this may change in the future. It is theoretically
possible that this ID is not unique, but the chance of this happening without deliberate user invention is virtually
impossible.
Return Value
The function returns a string with the unique ID.
Example
GetImage Function
Format
s = getimage(item_name ‖ value_set_name, value);
Description
The getimage function returns the filename of a value set image. Based on the provided item or value set, the image
filename associated with the value will be returned. The value is either a numeric or string expression, based on the
type of the item. If an item_name is used instead of a value_set_name, then the function will search for the relevant
image in the item's current value set.
Return Value
The function returns a string containing the image filename, or a blank string if there is no value set image for the
provided value or if the provided value does not exist in the value set.
Example
PROC IMAGE_SELECTOR
string imageFilename = getimage(IMAGE_SELECTOR_VS1, IMAGE_SELECTOR);
view(imageFilename);
GetNote Function
Format
s = getnote(『 dictionary_symbol』 『 ,operator_id』 );
Description
The getnote function returns a string containing the note for a field or other dictionary symbol. If no arguments are
supplied, the note for the current field is returned. The optional argument dictionary_symbol allows you to query the
note for a dictionary item, record, or level. If using the dictionary name, the case note is returned. Another optional string
argument, operator_id, allows you to query the note for a particular operator. If no operator ID is supplied, then the note
for the current operator will be returned.
Example
string caseInformation = getcaselabel(CENSUS_DICT);
if length(getnote(CENSUS_DICT)) > 0 then
// add the case note to the case label for a full description of the case
caseInformation = caseInformation + " - " + getnote(CENSUS_DICT);
endif;
GetOperatorId Function
Format
s = getoperatorid();
Description
The getoperatorid function returns the operator ID for the current operator. The operator ID may have been entered by
the operator or passed as a parameter in the PFF file for the run.
Return Value
The function returns a string containing the operator ID assigned to or entered by the current operator. In batch mode it
returns a blank string.
Example
LF_USER_ID = getoperatorid();
if not loadcase(LOGINS_DICT, LF_USER_ID) then
errmsg("You do not have access to this system");
stop(1);
endif;
See also: SetOperatorId Function, GetDeviceID Function, GetUserName Function, GetBluetoothName Function
GetOS Function
Format
i = getos(『 detailed_information』 );
Description
The getos function returns a code that indicates what operating system is running the program. The codes are:
Return Value
The function returns the operating system code or, optionally, a text description of the operating system.
Example
string operating_system_name;
getos(operating_system_name);
errmsg("Code = %d, Text = %s", getos(), operating_system_name);
// the above code might display, for example:
// Code = 10, Text = Windows;6.1
// Code = 20, Text = Android;4.2.2
GetRecord Function
Format
s = getrecord(item_name_string);
Description
The getrecord function returns the name of the record that contains the item identified by the string expression
item_name_string.
Return Value
The function returns a string with the name of the record. If the item does not exist, the function returns a blank string.
Example
errmsg("SEX belongs to %s",getrecord("SEX")); // SEX belongs to POPULATION
errmsg("WATERSOURCE belongs to %s",getrecord("WATERSOURCE")); // WATERSOURCE belongs to
HOUSING
GetUserName Function
Format
s = getusername();
Description
This getusername function returns the login of the current user. On Windows this is the name of the user currently
logged in, whereas on Android this is the first account registered on the device.
Example
PROC USER_NAME
preproc
USER_NAME = getusername();
noinput;
See also: GetDeviceID Function, GetOperatorId Function, GetOS Function, GetBluetoothName Function
GPS Function
It is possible to take advantage of the functionality of a Global Positioning System (GPS) receiver when designing an
application for use on either a laptop or tablet with a GPS receiver or using either Android or Windows Mobile operating
systems. Manipulating the GPS receiver is done from within the program's logic.
Basic Example
gps(open); // on Android
gps(open,3,4800); // on a laptop or Windows tablet; COM3, 4800 baud
if gps(read,5) then // a successful attempt at a read, for up to five seconds
errmsg("Latitude is %f, longitude is %f",gps(latitude),gps(longitude));
else
errmsg("GPS signal could not be acquired");
endif;
gps(close);
Opening and Closing the GPS Receiver
b = gps(open | close);
Before using the device's GPS receiver, it is necessary to open a connection to the GPS unit. After making all necessary
GPS readings, close the connection. The function returns 1 if successful, 0 otherwise. If using CSPro on a Windows
laptop or tablet, it is necessary to specify the hardware settings of the GPS unit, specifically the number of the COM port
and the baud rate (see the above example).
Obtaining the Last Successful GPS Reading
b = gps(readlast);
The readlast command obtains the last successful GPS reading, a reading that might be very old. If the GPS unit has
been turned on for some time and the device is being used outdoors, it is likely that the reading is very fresh (recent), but
if, for example, an enumerator walks inside a building to conduct an interview, the reading may be minutes or hours old,
from the last time that the enumerator was outside. The function returns 1 if there was a successful previous reading, 0
otherwise.
Obtaining a New GPS Reading
b = gps(read);
b = gps(read,numeric-expression);
i = gps(read,numeric-expression,alpha-expression);
The read command obtains a new GPS reading in a specified time period (in seconds). If no time period is specified, the
program will pause for up to three seconds to obtain a reading. A reading time of up to ten minutes (600 seconds) may
be specified. An optional third argument displays a message while the program attempts to obtain a GPS reading. The
Pa ge 497 of 684 Da ta Entry Sta tements a nd Func ons
message box has a cancel button and if the user cancels the operation, the function returns -1. Otherwise the function
returns 1 if a reading was successful, 0 otherwise. Unlike the readlast command, a successful function call with read
guarantees a fresh GPS reading.
It is possible on Android devices to specify a desired level of accuracy for the reading. When specified, the function will
continue to take GPS readings until a reading at or below the level of accuracy is achieved. The function will still return 1
if, though timing out, a successful reading at an accuracy level greater than specified was achieved during the allotted
time period. This ensures that a GPS reading will be as accurate as possible, but with the assumption that any reading
is better than no reading. Use a second numeric expression to specify the level of the accuracy.
b = gps(read,time-period,accuracy-level);
i = gps(read,time-period,accuracy-level,alpha-expression);
Querying a Successful GPS Read
If the readlast or read commands returned successfully, the GPS system has valid values for latitude and longitude.
Further attributes of the reading can be queried though they are not guaranteed to be valid.
f = gps(latitude);
f = gps(longitude);
f = gps(altitude);
f = gps(satellites);
f = gps(accuracy);
f = gps(readtime);
Latitude and longitude return coordinates in degrees. Altitude returns the number of meters above sea level of the
reading. Satellites returns the number of satellites used to calculate the values in the last reading. Generally, the greater
number of satellites, the better the quality of the reading. Accuracy is a calculation of the precision of the last reading.
On Windows devices, an accuracy value of 1 is the most accurate and 50 is the least accurate reading. On Android
devices, the value signifies the accuracy of the reading, measured in meters. Readtime returns, in local time, the time of
the last successful reading. If the queried value (other than latitude and longitude) is not available, or if the last GPS read
was unsuccessful, the function will return DEFAULT.
Calculating Distances
The gps function can also calculate great-circle distances between two sets of GPS coordinates. Great-circle
calculations give a rough approximation of the distance between two points. The function returns the distance as
measured in meters.
f = gps(distance,numeric-expression,numeric-expression,numeric-expression,numeric-
expression);
numeric kilometersToCensusBureau =
gps(distance,38.846261,-76.929445,gps(latitude),gps(longitude)) / 1000;
HideOcc Function
Format
b = hideocc(roster_name ‖ form_name『 (occurrence_number)』 );
Description
The hideocc function hides an occurrence of a roster or form from showing in the mobile CSEntry case tree. It will also
hide the row from appearing on a roster on desktop CSEntry. If no numeric occurrence_number is supplied, then the
current occurrence for the roster or form will be hidden.
Return Value
The function returns a logical value 1 (true) if successful and 0 (false) otherwise.
Example
Pa ge 498 of 684 Da ta Entry Sta tements a nd Func ons
PROC KEEP_PERSON
if KEEP_PERSON = 0 then
hideocc(PERSON000);
endif;
Highlighted Function
Format
b = highlighted(field_name);
Description
The highlighted function is used to determine whether a field, field_name, is on the path (green) in system-controlled
mode, or has been passed through (green or yellow) either directly or by skips in operator-controlled mode. This can be
used to determine whether an item has been entered or skipped because of logic in system-controlled mode, or whether
it is before or after the high water mark in operator-controlled mode.
Return Value
The function returns 1 (true) if the field has been passed (green or yellow) and 0 (false) if the field is yet to be entered
(white).
Example
if highlighted(HOURS_WORKED) then
errmsg("Hours Worked = %d",HOURS_WORKED);
else
errmsg("Skipped or not entered yet!");
endif;
See also: VisualValue Function, Operator vs. System Controlled, Color of Fields in Data Entry User's Guide
IsPartial Function
Format
b = ispartial(『 dictionary_name』 );
Description
The ispartial function determines, for data entry input dictionaries whether a case was opened from a partial case, and
for external dictionaries where the case is currently marked as a partial save. By default the function looks at the case
loaded by the main data entry input dictionary, but an optional argument, dictionary_name, allows you to determine the
partial save status of cases in external dictionaries.
Return Value
The function returns a logical value of 1 (true) if the case was opened from a partial case or if it has been partially saved
during the data entry session and 0 (false) otherwise.
Example
IsVerified Function
Format
b = isverified(dictionary_name);
Description
The isverified function determines whether a case has been verified by double-keying. The function, with argument
dictionary_name, can look at cases in external dictionaries or in batch edit input dictionaries.
Return Value
The function returns a logical value of 1 (true) if the case was verified and 0 (false) otherwise.
Example
PROC CASE_MENU
if CASE_MENU = 1 then // verify case
if isverified(SURVEY_DICT) then
errmsg("The case has already been verified.");
reenter;
else
// ...
endif;
endif;
Move Statement
Format
move 『 to』 field_name 『 advance ‖ skip』 ;
Description
The move statement allows movement to a field without regard to whether it is before or after the current field. This is
particularly useful in an OnKey or userbar function because you may not know at what point the function will be called.
The field_name is either the name of a field or is a string variable specifying the name of the field.
Movement to a field before the current field acts exactly like a reenter statement. The action of move to a field after the
current field depends on the optional keywords advance or skip. If no keyword is specified, or if skip is coded, then
forward movement acts exactly like a skip statement, skipping past fields. If advance is coded, then forward movement
acts exactly like an advance statement, moving over fields.
See also: Advance Statement, Ask Statement, Reenter Statement, Skip Statement
NoInput Statement
Format
noinput;
Description
The noinput statement prevents input of a field during data entry. This command can be coded only in a preproc or
onfocus procedures.
When the statement is executed in a preproc, control passes directly from the field's preproc to the field's postproc,
executing the onfocus and killfocus procedures (if present) and performing the item range check, but not permitting
input of the field.
When the statement is executed in an onfocus, control passes directly from the field's onfocus to the field's postproc,
executing the killfocus procedure if present and performing the item range check, but not permitting input of the field.
The field is on the data entry path even though entry is prevented.
The effect of the noinput statement is similar, but not identical, to that of a protected field. If a noinput statement is
used, it is possible to back-tab to the field. It is not possible to back-tab to a field that is protected.
Example
PROC MARITAL_STATUS
preproc
if AGE < 12 then
MARITAL_STATUS = 1;
noinput;
endif;
Description:
The onchangelanguage global function allows you to execute certain code after the user has changed the CAPI
language via the CSEntry Options menu. This function must be placed in the Global Procedure.
Returned value:
The return value of the function is ignored.
Example:
function OnChangeLanguage()
setvaluesets(concat("_",getlanguage()));
end;
Description:
The onchar global function allows you to trap characters in order to perform special actions or to change the action of
the character. It can also be used to disable or remap characters. This function must be placed in the Global
Procedure.
If an onchar global function is coded, every character the operator types is sent to the onchar function for processing. If
the onchar function returns a value, then the return value is processed by the field as the character. If a statement in
the onchar function causes movement to another field within the case, then the movement is executed. If no onchar
function is coded, then characters are unmodified.
The key value is a number code identifying what character was typed using the keyboard. Its value can be used
within the function.
You can use the OnKey Character Map to determine the value of characters.
Calls to this function are ignored when executed on mobile devices.
Differences between onkey and onchar:
The onchar function differs from the onkey function. A keystroke contains information about the key pressed on the
keyboard, regardless of what this keystroke eventually maps to. A character refers to the final representation of one or
more keystrokes. This is important when using non-Latin languages that require multiple keystrokes to create one
character. For example, creating the Chinese character '马' using a Pinyin input system requires two keystrokes: 'm'
and 'a.' The code for this character is 39532. If typing such a keystroke, onkey will be called several times before
onchar is called. If both onkey and onchar functions exist, onkey will always be called before onchar is called.
The onchar function also returns values different from the onkey function for some Latin keystrokes. For example, with
Caps Lock off, if a keyer holds down Shift and types 'M,' onkey will return 1077 (1000 for the shift, 77 for 'm'). Onchar,
on the other hand, will return 77, the character code for 'M.' For a lowercase 'm,' onkey returns 77 and onchar returns
109, the character code for 'm.'
The onchar function does not return any information about whether any of the Shift, Ctrl, or Alt keys were held down
when the character was typed.
Returned value:
Like any global function, the onchar global function returns an integer value. The value should be either the value of the
character pressed (the same as the value passed to the function), a substituted character value (remapping the
character), or zero (0) to indicate that the character is to be ignored.
Example:
See also: Global Procedure, User Defined Functions, OnKey Character Map, Onkey Global Function
Description:
The onkey global function allows you to trap keystrokes in order to perform special actions or to change the action of
the key. It also can be used to disable or remap keys. This function must be placed in the Global Procedure.
If an onkey global function is coded, every keystroke the operator types is sent to the onkey function for processing. If
the onkey function returns a value, then the return value is processed by the field as the keystroke. If a statement in
the onkey function causes movement to another field within the case, then the movement is executed. If no onkey
function is coded, then keystrokes are unmodified.
The key value is a number code identifying what key was pressed on the keyboard. Its value can be used within the
function. See detailed description below.
You can use the OnKey Character Map to determine the value of characters.
Calls to this function are ignored when executed on mobile devices.
Returned value:
Like any global function, the onkey global function returns an integer value. The value should be either the value of the
key pressed (the same as the value passed to the function), a substituted key value (remapping the key), or zero (0)
to indicate that the key is to be ignored.
Example:
function OnKey(numeric x)
if x in 114:116 then { F3, F4, F5 }
{ don't allow these keys to work, eat the key }
OnKey = 0;
elseif x = 2069 then { Ctrl+E go to END_FIELD }
move to END_FIELD;
else
OnKey = x; { return rest of keys }
endif;
end;
For example, if Shift, Ctrl, and/or Alt are held down when A is pressed
Keys Code
A 65
Shift+A 1065
Ctrl+A 2065
Shift+Ctrl+A 3065
Alt+A 4065
Alt+Shift+A 5065
Alt+Ctrl+A 6065
Alt+Shift+Ctrl+A 7065
Numbers
For the number keys across the top of the keyboard, or on the numeric keypad when the "NumLock" button is
depressed, the following keyboard key value will be returned:
Key Code
0 48
1 49
2 50
3 51
4 52
5 53
6 54
7 55
8 56
9 57
Letters
For the letters a-z and A-Z, the following keyboard key values will be returned. Please note that the status of the
CapsLock key does not effect the code. However, if the Ctrl, Shift, and/or Alt keys are being held down, they will
change the code returned. See above.
Key Code
a 65
b 66
c 67
d 68
e 69
f 70
g 71
h 72
i 73
j 74
k 75
l 76
m 77
n 78
o 79
p 80
q 81
r 82
s 83
t 84
Pa ge 504 of 684 Da ta Entry Sta tements a nd Func ons
t 84
u 85
v 86
w 87
x 88
y 89
z 90
Function Keys
Key Code
F1 112
F2 113
F3 114
F4 115
F5 116
F6 117
F7 118
F8 119
F9 120
F10 121
F11 122
F12 123
Miscellaneous Keys:
Key Code
SysReq no code returned
Bksp 8
Tab 9
Enter 13 (with or without
num lock)
Break 19
Caps 20
Escape 27
Space 32
Page Up 33
Page Down 34
End 35
Home 36
Left Arrow 37
Up Arrow 38
Right 39
Arrow
Down 40
Arrow
Pa ge 505 of 684 Da ta Entry Sta tements a nd Func ons
Insert 45
Delete 46
Wnd 91 (flying window)
Scroll Lock 145
; 186
= 187
, 188 (comma)
- 189
. 190 (period)
/ 191
` 192 (accent)
[ 219
] 221
\ 220
' 222
Control Keys:
Key Code
Shift 1016
Ctrl 2017
Alt 4018
See also: Global Procedure, User Defined Functions, OnKey Character Map, OnKey Character Map
Description
OnStop is a special global function. It has no return value and must be placed in the PROC GLOBAL section just like any
other user-defined functions.
When defined, it provides control over what happens when the data entry operator tries to stop data entry using the ESC
key, the Stop button, pressing Ctrl+S, or attempting to exit data entry. When any of the above events occur, the
OnStop function is executed and no stop dialog (discard, save, cancel) occurs.
If an OnStop function has been coded in a data entry application, then when resuming a partial case, no resume dialog
("Do you want to go to last...") occurs. If special actions are required when entering a partial case, check whether a
partial case has been entered using the ispartial function and program the appropriate action. You can retrieve the
name and occurrence number of the last field entered (on a one-level application) by calling getsymbol(savepartial).
The OnStop function is not executed when the stop function is executed.
The OnStop function can be used to keep the operator from stopping data entry (see Example 1 below) or to allow
stopping only under certain conditions (see Example 2 below).
Example 1
PROC GLOBAL
function OnStop()
reenter;
end;
Example 2
Pa ge 506 of 684 Da ta Entry Sta tements a nd Func ons
PROC GLOBAL
function OnStop()
if getsymbol() in "FIRST_NAME", "LAST_NAME" then
reenter;
else
savepartial();
stop(1);
endif;
end;
See also: User-Defined Functions, Function Statement, Stop Function, SavePartial Function, IsPartial Function,
EndLevel Statement
Description
OnSystemMessage is a special global function. It is a function that gets called when a system message is issued. This is
similar to the OnKey or OnStop functions, which get called automatically by the running CSPro application, not because
of a call to the function in your logic. A system message is a message such as "Invalid subscript," not a user message
that comes from an errmsg statement.
The function must return a numeric value and you can provide from one to three parameters. If one numeric parameter is
provided, then it will receive the message number. If two numeric parameters are provided, the second numeric parameter
will get the message type (1 = error, 2 = warning). If a string parameter is provided, then it will receive the message text.
Return Value
Return 0 (false) to suppress the message. Returning anything other than 0 means that the message will be issued
(displayed to the enumerator in a data entry application or written to the listing file in a batch application).
Example 1
function OnSystemMessage(numeric message_number, numeric message_type)
// suppress all warning messages
recode message_type -> OnSystemMessage;
1 -> true;
2 -> false;
endrecode;
end;
Example 2
Prompt Function
Format
s = prompt(message『 , initial_value』 『 , prompt_type』 );
Description
The prompt function displays a dialog box showing the string expression message and provides a text box for an
operator to enter a string value. An optional string expression initial_value determines the starting value displayed to the
operator. Another optional argument, prompt_type, provides a way to customize the kind of entry permitted. Types
include:
Return Value
The function returns a string value containing the entered value. If the operator canceled without entering a string, the
function returns a blank string (even if an initial value was provided).
Example 1
string surveyPassword = prompt("Enter your password:",password);
Example 2
if ROOF_TYPE = 9 then // other (specify)
ROOF_TYPE_OTHER = prompt("Specify the kind of roof on the dwelling unit:", ROOF_TYPE_OTHER,
multiline);
endif;
Protect Function
Pa ge 508 of 684 Da ta Entry Sta tements a nd Func ons
Format
i = protect(symbol, protect_property);
Description
The protect function modifies the protected property of the field symbol. If symbol is a block, group, form, or dictionary,
the function will try to apply the protected property to all fields belonging to that symbol. The numeric expression
protect_property turns the protected property off (if zero) or on (if nonzero).
Return Value
The function returns the number of items whose protected property was successfully changed.
Example
PROC RELATIONSHIP
onfocus
// automatically set the first person to be the head of the household
// and protect the field so that it is not shown to the enumerator
if curocc() = 1 then
RELATIONSHIP = 1;
protect(RELATIONSHIP, true);
else
protect(RELATIONSHIP, false);
endif;
PutNote Function
Format
b = putnote(note_text『 ,dictionary_symbol』 『 ,operator_id』 );
Description
The putnote function sets the note for a field or other dictionary symbol, replacing the note with the text found in the
string expression note_text. If the note text is an empty string, the note will be removed. The optional argument
dictionary_symbol allows you to set the note for a dictionary item, record, or level. If using the dictionary name, the
case note is set. Another optional string argument, operator_id, allows you to set the note for a particular operator. If no
operator ID is supplied, then the note for the current operator will be set.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
Example
PROC STATUS
// set the case note to a description of the interview status
putnote("Interview Status: " + getlabel(STATUS,STATUS),CENSUS_DICT);
Format
i = randomizevs(dictionary_symbol『 , exclude(exclude_code1, ..., exclude_codeN)』 );
Description
The randomizevs function scrambles the order of values in a value set. The function is useful when using extended
controls in a data entry application. Some survey applications choose to display the possible response categories in a
randomized way so as to minimize an enumerator or respondent's selection bias.
The dictionary_symbol may be the name of a value set, item, group, form, or an entire dictionary. If randomizing the
value set for an item, an optional exclusion list, exclude_code1, ..., exclude_codeN, allows you to prevent certain
values from being given a random order. This is useful for variables like Don't Know, which, after the randomization, would
remain at the bottom of the list of values.
Return Value
The function returns the number of items for which the value sets were successfully randomized.
Example
randomizevs(FAVORITE_FOOD_GROUP, exclude(5));
becomes
See also: Random Function, RandomIn Function, Seed Function, ValueSet.Randomize Function
Reenter Statement
Format
reenter 『 field_name』 ;
Description
The reenter statement is used to force the entry operator to reenter the contents of the current field or of a field that was
entered earlier. The optional field_name is either the name of a field or is a string variable specifying the name of the
field. If the field_name is not specified, the current field is reentered. The field to be reentered must be earlier on the data
path than the current variable. If it is not, an error message will be displayed during data entry. If you do not know
whether the field is earlier in the data path, use the move statement.
When a reenter statement is executed, the preproc of the reentered field will not be executed. The postproc of the
Example 1
PROC RELATIONSHIP
if curocc() = 1 and RELATIONSHIP <> 1 then
errmsg("The head of household must be entered on the first row.");
reenter;
endif;
Example 2
PROC DOB_YEAR
if DOB_YEAR = CensusYear and DOB_MONTH > CensusMonth then
errmsg("The date of birth cannot be after the census date. Reenter the month and
year.");
reenter DOB_MONTH;
endif;
See also: Advance Statement, Ask Statement, Move Statement, Skip Statement
SavePartial Function
Format
b = savepartial(『 clear』 );
Description
The savepartial function saves the current case as a partially added, modified or verified case. It is useful in a large
data entry application to perform intermediate backups. It can also be used to automatically perform a partial save when
the keyer stops the case before completing it. You can also have CSEntry automatically partially save cases.
The function can be coded only in the preproc or postproc of a field. The function cannot be used before all ID values for
the case have been entered. You can get the field name where the last partial save occurred by coding
getsymbol(savepartial).
An optional argument, clear, results in the removal of visual values before the data is saved. This behavior is the same
as would happen if the case was completed in a system controlled application. Generally you will want to maintain
skipped-over fields in case an interviewer decides to make changes to the case at a later point, so it is rare that you
would use this argument.
Return Value
The function returns a logical value of 1 (true) if the case was successfully saved as a partial case and 0 (false)
otherwise.
Example
PROC LINE_NUMBER
preproc
savepartial();
Description:
The selcase function allows a data entry operator to select and load a case from an external file. This function can
only be used in data entry applications. It searches the index of the external file named by "ext-dict-name" for all
cases whose keys match the criterion specified by "alphanumeric-expression." If two or more matching keys are
found, they will be presented to the entry operator in a display box. Using a highlight bar, the operator can select one
of the keys. The case identified by that key is then read into memory.
The "offset" tells CSPro the number of characters, beginning with the first character of the ID items for the external
file, that should be suppressed if multiple matches are found.
• alphanumeric-expression: can be a literal or a CSPro expression. The matching is case sensitive. If an empty string
is passed, all cases in the external file are returned.
• include: tells CSPro to list additional items from the specified dictionary in the display box.
• where: applies the logical expression to all cases returned by the selcase statement. The resulting display box will
only show cases in which the logical expression evaluated to true (returning a nonzero value).
It is possible, by specifying dictionary access parameters after the dictionary name, to limit the cases that the selcase
function processes.
Return value:
The function returns a logical value of true (1) if a case is found or selected by the entry operator and false (0)
otherwise.
Example 1:
OK = selcase(LOOKUP,concat(PROV, DIST));
Example 2:
OK = selcase(OCCUPATION_DICT, "Plantation");
will return cases whose key begins with "Plantation."
Example 3:
OK = selcase(OCCUPATION_DICT, "Plantation")
include(OCCUPATION_CODE, OCCUPATION_SUMMARY_LEVEL);
Format 1:
set attributes (field-1[, field-2, ..., field-N])
display | visible | autoskip | return | protect | hidden | native;
Format 2:
set attributes(field-1[, field-2, ..., field-N])
assisted on|off [(question, responses)];
One or more dictionary items can be named in the field list. If the dictionary name is used, all the fields in the
dictionary are affected. If a form name is used, all the fields on the form are affected.
In Format 1, only one attribute setting can be used in any single set attributes statement. The options are as follows:
visible If a field is hidden, its value will now be visible; if it was already visible, the setting
has no effect
autoskip This is equivalent to leaving the statically-set field property "Use Enter Key"
unchecked. If this option is used, the cursor automatically advances to the next field,
after the maximum number of characters have been entered. This option will override
any statically-set field property settings.
return This is equivalent to checking the statically-set field property "Use Enter Key." If this
option is used, the operator must press the <Enter> key to advance from the listed
field(s). This option will override any statically-set field property settings.
protect This is identical to the statically-set field property "protected." If a field is set to
'protect', the operator will not be able to enter it. If the field was already statically set
Pa ge 513 of 684 Da ta Entry Sta tements a nd Func ons
to "protected," the setting has no effect.
hidden If a field is visible, its value will now be hidden from view; if it was already hidden, the
setting has no effect.
native Regardless of what settings have been made dynamically in the program, if a field is
set to native, all field settings will revert to their initial, statically-set properties.
Example of Format 1:
set attributes (total_HH_income) protect;
Description of Format 2:
The set attributes statement with the assisted keyword switches on or off a popup responses box during data entry.
The values in the responses box come from the first value set in the data dictionary for that field. The user can either
select a response or type a response. This behavior is true in any CSPro data entry application.
One or more dictionary items can be named in the field list. If the dictionary name is used, all the fields in the
dictionary are affected. If a form name is used, all the fields on the form are affected.
By default, the responses are taken from the first value set of the item. You can modify the values and responses
using the function setvalueset.
Description of Format 2 in CAPI applications
This statement has further meaning in CAPI data entry applications. Note that when you create a CAPI data entry
application, the question text for each item is automatically shown during data entry, but NOT the responses box.
There are two ways to make the responses box appear (or disappear):
1. Use this statement in the application's logic. For example:
set attributes(MYDICT) assisted on;
set attributes(MYDICT) assisted off;
set attributes(REL, SEX, EDUC) assisted on;
To show or hide responses for all fields, press Ctrl+K or from the Options menu, select Show Responses (All Fields).
The operator can also move the response box around on the screen using the mouse.
Use the following forms of this command to get the desired behavior in your program logic for CAPI applications:
// show both questions and responses
set attributes(MYDICT) assisted on;
set attributes(MYDICT) assisted on (question, responses);
// show question text but not responses
set attributes(MYDICT) assisted on (question);
< no set attributes command >
// show responses but not question text
set attributes(MYDICT) assisted on (responses);
See also: Change Field Properties, Change Data Entry Options, Introduction to CAPI
Format 2:
set behavior([field-2, ..., field-N]) canenter(notappl|outofrange) off;
Description:
The set behavior canenter statement allows the entry of blanks (notappl) for numeric data items during data entry or
to bypass the system 'Out of Range' message during data entry. You may wish to enter blanks when answers are
missing from the form. You may wish to bypass the system 'Out of Range' message in order to code your own
message.
If no arguments are specified, then the set behavior statement affects all data items from the point where it is executed
onward. To limit its scope, it must be turned on and off at appropriate times. It is possible, however, to specify the
name of a field, group, record, form, or dictionary to narrow the scope of the behavior to only fields that fall within the
specified object.
In operator-controlled applications, notappl defined as a value in the value set for the item usually allows blank to be
accepted. In system-controlled applications the set behavior function must be used to allow blanks even if notappl is in
the value set.
The keywords confirm or noconfirm must be coded when on is used. Confirm means that a message box is
displayed asking if it OK to enter this value. Noconfirm means that no message box is displayed.
Example 1:
set behavior() canenter(notappl) on (noconfirm);
Example 2:
set behavior(AGE,AGE_FIRST_MARRIAGE) canenter(outofrange) on (noconfirm);
set behavior(RELIGION) canenter(notappl) on (noconfirm);
SetCapturePos Function
Feature Upgrade: Starting with CSPro 7.1, you should no longer use this function as it may soon be removed from
CSPro. To replicate the behavior of the function, you can use the setproperty function with the argument
"CapturePosX" or "CapturePosY".
Format
i = setcapturepos(symbol, x_coord, y_coord);
Description
The setcapturepos function modifies the capture position for the extended control associated with a field on a form. The
symbol_name may refer to a field, group, form, or an entire dictionary. The two numeric expressions x_coord and
y_coord refer to the x (horizontal) and y (vertical) positions of the top-left corner of the extended control window. This
position is relative to the form window, not the whole CSEntry window, with (0,0) referring to the top-left corner of the
form. If the position given for a field is greater than the size of the form, CSEntry will ignore the argument when displaying
the extended control window. This function is useful for CAPI applications in which a part of the screen is left blank
specifically for extended control windows. Calls to this function are ignored when executed on mobile devices.
Return Value
The function returns the number of items whose capture positions were successfully changed.
Example
Pa ge 515 of 684 Da ta Entry Sta tements a nd Func ons
setcapturepos(CAPI_DICT, 700, 20); // draw all windows on the right side of the form
SetCaptureType Function
Feature Upgrade: Starting with CSPro 7.1, you should no longer use this function as it may soon be removed from
CSPro. To replicate the behavior of the function, you can use the setproperty function with the argument
"CaptureType" or "CaptureDateFormat".
Format
i = setcapturetype(symbol_name, capture_type『 , date_format』 );
Description
The setcapturetype function modifies the capture type currently associated with a field on a form. The symbol_name
may refer to a field, group, form, or an entire dictionary. The numeric expression capture_type refers to a capture type
code, listed in the table below. If specifying a date, the format can be specified with the optional string expression
date_format.
Return Value
The function returns the number of items that were successfully changed to the specified capture type. An item's capture
type can be successfully changed if the current value set associated with the item supports the requested capture type.
For instance, if an item's value set contains ranges, the capture type cannot be changed to a radio button. The success
of changing a field to a date picker does not depend on the date format, so if the date format is not valid for the field the
capture type will be changed to the default date format for the item.
Example
setcapturetype(DATE_OF_BIRTH, 5, "YYYYMMDD");
errmsg("%d fields changed to radio buttons on the housing form",setcapturetype(HOUSING_FORM,1));
SetCaseLabel Function
Format
b = setcaselabel(dictionary_name, case_label);
Description
Return Value
The function returns 1 (true) if successful or 0 (false) if the function failed.
Example
PROC HEAD_HH_NAME
setcaselabel(CENSUS_DICT, maketext("Household %d headed by %s", HH_NUMBER,
strip(HEAD_HH_NAME)));
Description
CSPro data entry applications have two operating modes: operator- and system-controlled modes. Each mode has a
different style error message box. System-controlled mode uses a standard Windows dialog box whereas operator-
controlled mode uses a customized yellow box. The set errmsg statement allows the user to choose which kind of box
to use. The first argument, message_style, is one of the following:
default: Reset the box to the default style for the data entry mode.
operator: Use the operator-controlled box.
system: Use the system-controlled box.
For operator-controlled message boxes, the optional string expression close_message allows you to modify the text that
appears beneath the error message, which by default is "Press F8 to clear." Another optional argument, the numeric
expression close_keystroke, modifies the keystroke required to close the box.
Examples
set errmsg(system); // use the system-controlled style
set errmsg(operator); // use the operator-controlled style
set errmsg(default); // use the default style for the data entry mode
// defines the error message and defaults to F8 to close the box
set errmsg(operator,"Appuyez sur F8 pour fermer");
// 67 (C) is the key that will close the box
set errmsg(operator,"Appuyez sur C pour fermer",67);
// 67 (C) is the key that will close the box
// but the default error message text is used
set errmsg(operator,67);
Resetting Format
b = setfont(ErrMsg ‖ ValueSets ‖ NumberPad UserBar ‖ Notes ‖ All, default);
Description
In a data entry application, the setfont function allows you to modify the font that CSPro uses to display text in:
ErrMsg: The font that appears in boxes generated by the errmsg function.
ValueSets: The font that appears in extended controls (other than the number pad).
NumberPad: The font that is used by the number pad.
UserBar: The font that appears in buttons and text strings on the userbar.
Notes: The font that appears when a user edits a field note.
All: The font is changed for all of the four above entities.
The font is changed to the font indicated by the string expression font_name and the numeric expression font_size, or if
the resetting format is used, the font is restored to CSPro's default font selection. The user must ensure that the desired
font is installed on the machine that will run the data entry application. Optional bold and italics markers may be
indicated. Calls to this function are ignored when executed on mobile devices.
Return Value
The function returns 1 (true) if the font(s) were changed successfully, and 0 (false) otherwise.
Example
PROC AGE
if AGE > 95 then
minorWarning(maketext("Age (%d) is over 95, set to 95", AGE));
AGE = 95;
elseif not AGE in 12:95 then
minorError(maketext("Age (%d) is invalid for this survey, reenter", AGE));
reenter;
endif;
SetOperatorId Function
Format
b = setoperatorid(operator_id);
Description
The setoperatorid function sets the operator ID to the string expression operator_id for the current data entry session.
The operator ID is generally entered by the operator or passed as a parameter in the PFF file, but for some programs,
such as menu systems, it may be useful to set the operator ID in logic.
Return Value
The function returns 1 (true) if successful or 0 (false) if the function failed or was executed in batch mode.
Example
Show Function
Format
i = show(『 heading,』 group_name,item_list『 ,where condition』 『 ,title(text_list)』 );
Description
The show function displays items from a roster in the form of a menu that looks like a roster. The group_name specifies
which group contains the items to be displayed, one or more of which are specified in the item_list. This function is
similar to the show and accept functions, and is useful as a menu or simply as a way to show roster values in another
part of the questionnaire. An optional string expression, heading, defines the title of the grid. An optional where clause,
with a specified condition, allows you to only specify that only some occurrences of the roster should be displayed. If
you want to override the column headings, you can specify them with string expressions in text_title.
Return Value
The function returns the number of the item selected: 1 for the first item, 2 for the second item, etc. This is the number of
the item on the display, not in the roster. The value 0 is returned if the escape key (or back button) is pressed and none
of the options is chosen.
Example 1
if RELATIONSHIP = 2 and SEX = SEX(ptrHead) then
show("Sex of spouse is the same as the sex of head!",PERSON_REC,NAME,RELATIONSHIP,SEX,AGE,
title("Name","Rel","Sex","Age"));
endif;
Example 2
if ctrHead > 1 then // more than 1 head in the household
errmsg("More than 1 head of household (count=%d)",ctrHead);
numeric headToDelete = show(PERSON_REC,NAME,RELATIONSHIP,SEX,AGE,WHERE RELATIONSHIP = 1);
// ...
endif;
ShowArray Function
Format
i = showarray(『 heading,』 array_name『 ,row_count』 『 ,column_count』 『 ,title(text_list)』 );
Description
The showarray function, similar to the show and accept functions, displays items from an array in a grid. An optional
string expression, heading, defines the title of the grid, which contains the items in array_name. The function is useful
as a menu or simply as a way to show values that are relevant to data collection. If the optional numeric expressions
row_count and column_count are not specified, then the function will parse the array, determining the size of the menu
based on where the first blank alphanumeric element is found. If you want to override the column headings, you can
specify them in with string expressions in text_title.
Return Value
The function returns the number of the item selected: 1 for the first item, 2 for the second item, etc. The value 0 is
returned if the escape key (or back button) is pressed and none of the options is chosen.
Example 1
PROC GLOBAL
array string unitedNationsCountryCodes(5,3) =
"004", "Afghanistan", "AFG",
"248", "Åland Islands", "ALA",
"008", "Albania", "ALB",
"012", "Algeria", "DZA",
"016", "American Samoa", "ASM",
// ...
;
PROC EXAMPLE
numeric countrySelection = showarray("Select a Country",unitedNationsCountryCodes,
title("Numerical Code","Country or Area Name","ISO ALPHA-3 Code"));
Example 2
numeric countrySelection = showarray(unitedNationsCountryCodes,3,2);
ShowOcc Function
Format
b = showocc(roster_name ‖ form_name『 (occurrence_number)』 『 ,show_condition』 );
Description
The showocc function shows an occurrence of a roster or form in the mobile CSEntry case tree. It will also show the row
on a roster on desktop CSEntry. If no numeric occurrence_number is supplied, then the current occurrence for the
roster or form will be shown.
By default, all occurrences of a roster and form are shown by CSEntry, but occurrences may be hidden with the hideocc
function. Alternatively, an optional numeric argument show_condition, allows the showocc function to conditionally hide
occurrences, based on whether the condition is true. The occurrence will be hidden if show_condition is 0; the
occurrence will be shown for any other value.
Return Value
The function returns a logical value 1 (true) if successful and 0 (false) otherwise.
Example
PROC MYSURVEY_QUEST
preproc
// fill in and potentially hide occurrence labels for the household roster
do numeric ctr = 1 while ctr <= totocc(PERSON_ROSTER)
showocc(PERSON_ROSTER(ctr),KEEP_PERSON(ctr));
setocclabel(PERSON_ROSTER(ctr),strip(NAME(ctr)));
enddo;
Skip Statement
Formats
skip 『 to』 field_name;
skip 『 to』 next 『 multiply_occurring_field_name』 ;
skip;
The next keyword skips to the next occurrence of multiply_occurring_field_name. If the target field is on the same
repeating form or roster as the current field, control will move to the next occurrence of the target field. If not, control will
move to the first occurrence of the target field. Occurrence numbers cannot be used with the next keyword.
When using the next keyword without the optional target field, control passes to the next occurrence of the current
repeating form or roster, with the target field as the first field in that form or roster. This is a useful way to skip to the
beginning of the next occurrence.
If using skip without specifying a target field, then control passes to the next field in the application. This targetless skip
can only occur in the preproc of a field, roster, form, or user-defined function. CSPro will automatically figure out what
the target field should be, as it does with an ask statement.
The target field can be located in any record at the same level as the current field, but it cannot be located at a different
level. The field must be later on the path than the current field, meaning that it is a field that has not yet been entered. If
the field has already been entered, an error message will be displayed during data entry. If you do not know whether the
field is earlier in the data path, use the move statement.
When a skip statement is executed, the preproc of the target field will be executed but none of the statements between
the skip statement and the preproc of the target field will be executed. Skipped fields are assigned the special value of
notappl.
Note that the skip statement behaves differently from the advance statement, skipping past some number of fields,
rather than moving over the fields.
Example 1
PROC ATTENDED_SCHOOL_YN
if ATTENDED_SCHOOL_YN = 2 then
skip to REASON_NEVER_ATTEND;
endif;
Example 2
PROC CHILDREN_EVER_BORN
preproc
if SEX = 1 or AGE < 12 then
skip to next;
endif;
Example 3
PROC MARRIAGE_AGE
preproc
if MARITAL_STATUS = 1 then
skip;
endif;
See also: Advance Statement, Ask Statement, Move Statement, Reenter Statement
PROC USERBAR_FF
preproc
userbar(clear);
userbar(add text,"Enter two values and select an operation:");
leftOperatorRID = userbar(add field," ");
rightOperatorRID = userbar(add field," ");
userbar(add button,"Addition",performMathOperation(ADD_OP));
userbar(add button,"Subtraction",performMathOperation(SUBTRACT_OP));
userbar(add button,"Multiplication",performMathOperation(MULT_OP));
userbar(add button,"Division",performMathOperation(DIVIDE_OP));
userbar(add button,"Modulo",performMathOperation(MOD_OP));
userbar(add button,"Exponentiation",performMathOperation(EXP_OP));
userbar(add spacing,50);
resultsRID = userbar(add text," ");
userbar(show);
VisualValue Function
Format
d = visualvalue(field_name);
Description
The visualvalue function is used to access the contents of a data item before the data item has been entered. When
an item is on a form as a field, CSPro will give its value in logic as notappl until the item has been entered. If you know
that the field has been previously filled, perhaps because the operator is in modify mode or is resuming from a partial
save, you can use this function to get the value stored in the item that is on the form and represented as field_name.
This function is for use only with numeric fields. Refering to alphanumeric fields will always result in the evaluation of the
field's visual value.
Return Value
The function returns the numeric value of the field.
Example
PROC INTERVIEW_START_DATE
preproc
// do not overwrite the start date if modifying the case
if visualvalue(INTERVIEW_START_DATE) = notappl then
INTERVIEW_START_DATE = sysdate("YYYYMMDD");
endif;
Description
The endcase statement ends batch editing for the current questionnaire (case). All remaining procedures beyond where
the statement is executed will be skipped. The statement is similar to the skip case command, but in the latter
command the case is not saved to the output file. When using endcase the case will be saved to the output file.
Example
if HHTYPE = 2 then
endcase;
endif;
Export Statement
Format:
Export to file-name
[rec_name(rec-name | alpha-exp)]
[rec_type(rec-name | alpha-exp)]
[case_id ([item-list])]
rec-item-list
Description:
The export statement writes a record to an export file. Export statements can only be coded in level procedures.
In the to phase, the file-name is a name declared in the file statement in PROC GLOBAL.
The rec_name, rec_type, and case_id phrases can each be coded only once but can be coded in any order. They
all must be coded before the rec-item-list. The order in which rec_type and case_id are coded determines the order
of output of the record type and case ids in the exported record.
The rec_name phrase is optional and only used when data are exported in CSPro format. When coded the
rec_name phase is used to give a label and name to the record type to the CSPro data dictionary created by the
export statement. If a rec-name is coded, then the label and name from that record name in the input data dictionary
is used for the label and name of the record type created in the exported data dictionary. If an alpha-exp is coded,
then the label of the record type in the exported data dictionary is the result of the alphanumeric expression and the
name is derived from the label. If rec_name is not coded, the labels and record names in CSPro will be
RECORD001, RECORD002, etc.
The rec_type phrase is optional. When coded it is used to place a record type on the exported data record. If a rec-
name is coded, then the record type value from the record name in the input data dictionary is placed on the
exported data file. If an alpha-exp is coded, then the value of the expression is placed on the exported data file.
The case_id phrase is optional. When coded it is used to place case id items on the exported data record. If
case_id() is coded then ALL the case ids from the level in which the export statement is coded are placed on the
PROC CENSUS_2000_DICTIONARY_FF
See also: Set Behavior Export Statement, File Statement, SetFile Function, Export Data Tool
GetDeck Function
Format:
f = getdeck(array-name[,override-dim1,override-dim2,override-dim3]);
Description:
The getdeck function returns the value in the DeckArray hotdeck using the current values in the items identified by the
value sets used in the declaration of the DeckArray. The function automatically recodes the values and accesses the
proper cell in the hotdeck. If any of the dimensions of the DeckArray are not value set dimensions, you must specify
the numeric index when calling the function.
Return value:
The function returns the value in the hotdeck or DEFAULT in the case that the values supplied are not valid entries in
the value sets and thus could not be recoded to a proper cell in the hotdeck.
Example:
array education_HD_SexAge(SEX_VS,AGE_FOR_EDUCATION_HD_VS) save;
...
PROC EDUCATION
EDUCATION = getdeck(education_HD_SexAge); // use current values for sex and age
EDUCATION = getdeck(education_HD_SexAge,1); // override sex only
EDUCATION = getdeck(education_HD_SexAge,,28); // override age only
EDUCATION = getdeck(education_HD_SexAge,1,28); // override both value and age
PutDeck Function
Format:
Pa ge 529 of 684 Ba tch Edit Sta tements
f = putdeck(array-name,numeric-expression[,override-dim1,override-dim2,override-
dim3]);
Description:
The putdeck function updates the value in the DeckArray hotdeck using the current values in the items identified by
the value sets used in the declaration of the DeckArray. The function automatically recodes the values and accesses
the proper cell in the hotdeck, where it places the value of the numeric-expression. If any of the dimensions of the
DeckArray are not value set dimensions, you must specify the numeric index when calling the function.
If a (+) is specified after the array-name, the "leftover" rows for the hotdeck will also be updated with the value. See the
leftover rows page for more information.
Return value:
The function returns 1 if successful or DEFAULT in the case that the values supplied are not valid entries in the value
sets and thus could not be recoded to a proper cell in the hotdeck.
Example:
array education_HD_SexAge(SEX_VS,AGE_FOR_EDUCATION_HD_VS) save;
...
PROC EDUCATION
putdeck(education_HD_SexAge,EDUCATION); // use current values for sex and age
putdeck(education_HD_SexAge,EDUCATION,1); // override sex only
putdeck(education_HD_SexAge,EDUCATION,,28); // override age only
putdeck(education_HD_SexAge,EDUCATION,1,28); // override both value and age
Description:
The set behavior export statement is coded before the first export statement.
The model is required. It is one of the keywords SPSS, SAS, Stata, R, All, CSPro, TabDelim, CommaDelim, or
SemiColonDelim, indicating the type of file being exported.
Item-type is optional. It is one of the keywords ItemOnly, SubitemOnly, or ItemSubitem, indicating how subitems
and their parent item are handled when entire records are exported. If not coded, SubitemOnly is assumed.
Text-encoding is optional. It is either ANSI or Unicode, and specifies the text encoding of the exported data file and
any description or script files. If not coded, ANSI is assumed. The Unicode option will output UTF-8 files.
Decimal-mark is optional. If specified with the keyword CommaDecimal, decimal marks will use commas instead of
periods in the exported data file.
See also: Export Statement
SetOutput Function
Format:
b = setoutput(alpha-exp);
Description:
The setoutput function redirects the output cases of a batch application to the data file specified in the alpha-exp. All
Description
The skip case statement ends batch edit processing of the current case and skips to the next case in the input file. If
an output file is specified, the skipped case is not saved in the output file.
Example
// remove any vacant households
if totocc(PERSON_REC) = 0 then
skip case;
endif;
Description
The abs function returns the absolute value of the numeric expression value.
Return Value
The function returns the absolute value. If the value of the numeric expression is a special value, the function returns the
special value.
Example
if abs(X - target) < abs(Y - target) then
// ...
endif;
CMCode Function
Format
i = cmcode(month, year);
Description
The cmcode function returns the century month code (CMC) of the given date using the month and year numeric
expressions. The CMC is the number of months since January 1900. (The CMC for January 1900 is 1.) It is calculated by
multiplying the number of years between the argument year and 1900 by twelve, then adding the value of argument
month.
The function returns the value 9999 if the month is less than one or greater than 12, or if either the month or year are
equal to a special value. The function accepts either 2- or 4-digit years. If a 2-digit year is used, the function assumes
that the year is in the 20th century (i.e., 19xx). Four-digit years can be used for years in the 20th or 21st century.
Return Value
The function returns the CMC of the date.
Example 1
XMONTH = 6;
XYEAR = 82;
DATE = cmcode(XMONTH, XYEAR);
The value of DATE for the given arguments (June 1982) would be ( 82 * 12 ) + 6 = 990.
See also: DateAdd Function, DateDiff Function, DateValid Function, SysDate Function
CountNonSpecial Function
Format
i = countnonspecial(item_name ‖ record_name ‖ array_name ‖ numeric_expression『 ,...』 );
Description
The function countnonspecial counts the number of non-special values within the passed arguments. Special
arguments include missing, refused, notappl, and default. The function can receive multiple numeric arguments,
including items, records, arrays, and numeric expressions. This function can greatly simplify some programming tasks;
for example, the following two lines of code are the same:
if countnonspecial(RELATIONSHIP,SEX,AGE) = 3 then
if not special(RELATIONSHIP) and not special(SEX) and not special(AGE) then
If an array is passed, the function will count the non-special values among all the values in the array. For a multiply
occurring item or record, to count the non-special values for all the items or records instead of just the current
occurrence, include (*) after the item or record name.
Example 1
PROC GLOBAL
numeric tempVal = notappl;
array tempArray(2,3) = 1 2 3 notappl missing default;
PROC EXAMPLE
countnonspecial(5,tempVal,default,28 + 31,3 / 0); // returns 2
tempVal = 0.123456789;
countnonspecial(5,tempVal,default,28 + 31,3 / 0); // returns 3
countnonspecial(5,tempVal,default,28 + 31,3 / 0,tempArray); // returns 6
tempArray(2,1) = 0;
countnonspecial(5,tempVal,default,28 + 31,3 / 0,tempArray); // returns 7
Example 2
Example 3
numeric numDefinedValues = countnonspecial(SEX(*),AGE(*));
numeric numPossibleValues = 2 * totocc(POPULATION);
errmsg("%d% of sex and age values are missing",100 * ( 1 - ( numDefinedValues /
numPossibleValues ) ));
Exp Function
Format
d = exp(numeric_expression);
Description
The exp function raises the value of e (2.7182818...) to the power given by numeric_expression. This value e is called
Napier's constant or Euler's number and is the basis of natural logarithms.
Return Value
The function returns a decimal number. If the value of numeric_expression is a special value, the function returns that
value.
Example
X = exp(Y);
High Function
Format
d = high(number1『 ,number2...numberN』 );
Description
The high function returns the maximum (highest) value in a group of numbers represented by numeric expressions
number1 ... numberN. The function ignores special values.
Return Value
The function returns the highest value, or default in the case that no valid values were passed.
Example
Inc Function
Format
i = inc(numeric_item『 ,increment_value』 );
Description
The inc function increments numeric_item, which is either a dictionary item or a numeric variable. If the optional
numeric expression increment_value is present, then the value of that expression is added to numeric_item. If no
expression is present, 1 is added to numeric_item. The increment value can be negative or nonnegative. inc(X) is
essentially shorthand for X = X + 1;
Return Value
The function returns the sum of the numeric item and the increment value.
Example
X = 5;
inc(X);
// X is 6 after X = X + 1;
inc(X,4);
// X is 10 after X = X + 4;
inc(X,-5);
// X is 5 after X = X + (-5);
X = 5 + inc(X,inc(X));
// X is 17 after X = X + 1; X = X + X; X = 5 + X;
Int Function
Format
i = int(numeric_expression);
Description
The int function returns the integer portion of the decimal value numeric_expression.
Return Value
The function returns an integer value. If the value of numeric_expression is a special value, the function returns that
value.
Example
X = int(5 / 3); // X will be 1
Log Function
Format
d = log(numeric_expression);
Description
The log function calculates the base-10 logarithm of numeric_expression. The number numeric_expression should be
a positive value.
Return Value
The function returns a decimal number logarithm. If the value of the number is a special value, the function returns the
special value given. If the value of the number is negative, the function returns default.
Example
numeric incomeLog = log(INCOME);
Low Function
Format
d = low(number1『 ,number2...numberN』 );
Description
The low function returns the minimum (lowest) value in a group of numbers represented by numeric expressions
number1 ... numberN. The function ignores special values.
Return Value
The function returns the lowest value, or default in the case that no valid values were passed.
Example
errmsg("Lowest value is %f",low(50,-123.45,1982.0605,20)); // displays -123.45
errmsg("Highest value is %f",high(50,-123.45,1982.0605,20)); // displays 1982.0605
Random Function
Format
i = random(min_value, max_value);
Description
The random function returns a uniformly distributed random number between the number expressions min_value and
Pa ge 536 of 684 Numeric Func ons
max_value inclusive.
By default the sequence of random numbers generated by repeated calls to random will always be the same. You can
use the seed function to generate a different sequence.
Return Value
The function returns an integer random value. The function will return a value of default if min_value is greater than
max_value or if either value is equal to one of the special values.
Example
NUMBER = random(1, 100);
See also: RandomIn Function, ValueSet.Randomize Function, Seed Function, UUID Function
RandomIn Function
Format
d = randomin(in_list);
Description
The randomin function returns a random number from a grouping of numeric values expressed as an in list. You can use
the seed function to initialize the random number generation. A non-integer can appear as part of the in list, but if it is
part of a range, both the low and high values of the range will be converted to integers and the function will return a
uniformly distributed random integer in that range.
Return Value
The function returns a random value from the values of the in list. The function will return default if there were no
applicable values to construct a group of valid numbers from which to pick a random number. If a value appears more
than once in the in list, it will have a higher probability of being selected by the function.
Examples
errmsg("Random tribe code: %d", randomin(TRIBE_VS1));
errmsg("Non-continuous random number: %d", randomin(-100:-50, 50:100, 999));
errmsg("After many calls, 1.23 will be selected 75% of the time: %d", randomin(1.23, 1.23, 1.23,
8));
errmsg("Random month: %d", randomin(1:12)); // same as random(1, 12)
Round Function
Format
i = round(numeric_expression);
Description
Return Value
The function returns the rounded value. If the value of the numeric expression is a special value, the function returns the
special value.
Example
// round a price to the nearest nickel (five cents)
numeric roundedPrice = round(PRICE * 20) / 20;
Seed Function
Format
b = seed(numeric_expression);
Description
The seed function is used to determine the first value generated by the random function. If you want to reproduce a series
of random numbers, then for best results, the numeric_expression should be set to a prime number, such as 1009.
However, if you want different random numbers for each run of your program, the systime and timestamp functions
return good seed values.
Versions of CSPro 6.0 and greater will return different random numbers, given the same seed value, than earlier versions
of CSPro. If you expect a certain sequence to the random numbers and want to match a previously recorded sequence,
then you will want to use an old version of CSPro to run your application.
Return Value
The function returns a logical value of 1 (true) if the seeding is successful, and 0 (false) otherwise.
Example
seed(systime());
numeric randomAge = random(1,99);
Sqrt Function
Format
d = sqrt(numeric_expression);
Description
The sqrt function returns the square root of numeric_expression. The number numeric_expression should be a
positive value.
Return Value
Example
numeric incomeSqrt = sqrt(INCOME);
Format:
set behavior() specialvalues(zero) on | off;
Description:
The set behavior specialvalues statement allows special values to be treated as zero (0) values during arithmetic
operations, including addition, subtraction, multiplication, division, and modulo. By default this behavior is disabled.
The set behavior statement affects all numeric data items from the point where it is executed onward. To limit its
scope, it must be turned on and off at appropriate times.
Example:
PROC GLOBAL
numeric var1,var2,var3;
numeric result;
PROC SUMMATION
var1 = 5;
var2 = 10;
var3 = default;
set behavior() specialvalues(zero) on;
result = var1 + var2 + var3; { result will be 15 }
set behavior() specialvalues(zero) off;
result = var1 + var2 + var3; { result will be DEFAULT }
Description
The compare function compares the two strings expressions string1 and string2 character by character to determine the
alphabetical (collating sequence) order of the strings. If string1 and string2 are of different lengths, the function will pad
the shorter string with blanks to carry out the comparison.
Return Value
The function returns an integer value of:
Example
if compare(thisCaseKey,previousCaseKey) = -1 then
// ...
endif;
Direct string comparisons can also be made. For example, the following code is permissible:
Concat Function
Format
s = concat(string1, string2『 , ..., stringN』 );
Description
The concat function concatenates the values of two or more string expressions (string1 + string2 + ... + stringN). The
strings can be alphanumeric items, text strings, or functions that return strings. You can also use the + operator to
concatenate strings.
Return Value
The function returns the concatenated string.
Example
Pa ge 540 of 684 String Func ons
FIRST_NAME = "John";
LAST_NAME = "Henry";
string full_name = concat(FIRST_NAME, " ", LAST_NAME); // full_name is: John Henry
The + operator is a convenient way to build strings and can be used along with other functions that return alphanumeric
values.
Example
string message;
do numeric counter = 1 while counter <= 13 by 2
message = message + edit("99 ",counter);
enddo;
// message is now: "01 03 05 07 09 11 13"
Edit Function
Format:
s = edit(edit-pattern,numeric-expression);
Description:
The edit function converts a number to a character string defined by the given "edit pattern". The "edit pattern" is a
string containing "Z"s or "9"s (i.e., "9999" or "ZZ9.99"). Both "9" and "Z" represent a digit.
9 display a digit
Z display a digit, but if it is a leading zero, display a blank
. display the decimal character
, display the thousands separator character
Any other character will be displayed as itself.
Return value:
The function returns a string derived from the "numeric-expression" argument.
Example 1:
X = 87;
A1 = edit("ZZZ9",X); yields A1 = " 87"
A2 = edit("9999",X); yields A2 = "0087"
A3 = edit("Z999",X); yields A3 = " 087"
Example 2:
Y = 0;
Example 3:
A = edit("99:99:99",sysdate());
Example 4:
A = edit("99/99/99",sysdate("DDMMYY"));
Example 5:
A = edit("ZZZ,ZZZ,ZZ9",INCOME);
GetBuffer Function
Format
s = getbuffer(item_name);
Description
The getbuffer function returns a string containing a data item's contents. The item_name can refer to either a numeric
or alphanumeric item.
The function is especially useful when a numeric data item in a data file contains a non-numeric value, such as "*", "-", or
"a". You cannot test the contents of the numeric data item for alphanumeric values because CSPro stores default as
the value of any numeric data item which contains non-numeric values. Therefore, to find out what non-numeric value(s)
exist in a data item, you would use this function to return its contents in the form of a string.
Return Value
The function returns a string containing the item's contents.
Example
if special(AGE) then
errmsg("Person's age is invalid, AGE = %s",getbuffer(AGE));
endif;
Length Function
Format
i = length(string_expression);
Description
The function length calculates the length of an alphanumeric string_expression. If the string is a dictionary item, the
value returned is the length of the item. If it the string is the result of a function, the value returned is the length of the
string returned by the function.
Return Value
The function returns the length of the string as an integer value. If using an alpha variable or a dictionary item, you may
Example
alpha (30) NAME = "John Henry";
length(NAME); // returns 30
length(strip(NAME)); // returns 10
Format
i = length(list_name);
i = length(array_name『 , dimension』 );
Examples
array string usmca_countries(3) = "Canada", "Mexico", "United States";
length(usmca_countries); // returns 3;
// ...
array AgeSexDeckArray(AGE_VS, SEX_VS);
do numeric age_counter = 1 while age_counter <= length(AgeSexDeckArray, 1)
do numeric sex_counter = 1 while sex_counter <= length(AgeSexDeckArray, 2)
// ...
enddo;
enddo;
MakeText Function
Format:
s = maketext(string-exp[,p1[,p2[,...,pn]]]);
Example 2::
Value = %d 23456
23456
%10d 23456
%-10d 23456
%010d 0000023456
%+10d +23456
%+010d +000023456
%f 23456.000000
Value = %f 12.567
12.567
%10.3f 12.567
%-10.3f 12.567
%10.2f 12.57
%10.5f 12.56700
%010.3f 000012.567
%+10.3f +12.567
%+010.3f +00012.567
%d 12
Value = %s abcdef
"abcdef"
%10s abcdef
%-10s abcdef
%10.3s abc
%- abc
10.3s
Description
Several functions take an argument that consists of a message and then an optional number of arguments that are
dynamically inserted into the text to create a final message. The message is either a string expression or a numeric
message number that contains the message text.
The message can contain optional arguments, argument1 to argumentN, that will be inserted into the message. Each
argument, denoted in the message text with a percent sign, is sequentially inserted into the message. Arguments can
be numeric or string expressions, but the type of the argument must match the type of the receiving field in the message
text. There are seven formatters:
Formatter Result
Pa ge 544 of 684 String Func ons
"%d" Inserts a number that is displayed as an integer.
"%f" Inserts a number that is displayed as a decimal value (with six decimal places shown by default).
"%s" Inserts a text string.
"%v" Inserts a variable formatted intelligently (see below).
"%l" Inserts a variable's value label (see below).
"%p" Inserts the name of the current procedure, which can be useful for debugging. No argument should be
supplied if using this option.
"%c" Inserts the first character of a text string, or no characters if the string is blank.
The "%l" formatter looks up the value set label associated with the variable's current value. If found, it displays that label
(in the same way that the getvaluelabel function does). If no value set label is associated with the value, then the "%l"
formatter behaves as if it were the "%v" formatter.
If the number is positive, as in "%+5d", the text is right justified to the size of the field (e.g., " 9").
If the number is negative, as in "%-5d", the text is left justified to the size of the field (e.g., "9 ").
For numbers, if a leading zero proceeds the number, as in "%05d", the text is right justified to the size of the field
but instead of being padded with spaces, it is padded with zeros (e.g., "00009").
For numbers, if the number is preceded by a plus sign, as in "%+d", the sign of the number is always displayed
(e.g., "+9").
Numbers are never truncated. Text strings are not truncated unless used as in "%0.5s", where the second part,
".5", indicates the maximum number of characters.
Example
See also: ErrMsg Function, FileWrite Function, File.Write Function, LogText Function, MakeText Function, Trace
Function, Warning Function, Write Function
Pos Function
Format
i = pos(substring, source_string);
Description
The pos function searches for a grouping of characters, represented in a substring, within a source_string. The function
returns the beginning position of the first occurrence of the substring. Both arguments are string expressions, and they
are case sensitive, meaning that "children" is recognized as different from "CHILDREN".
If using an alpha string or dictionary item, be aware that searching for a space character can lead to the function finding
one at the end of your string. You may want to strip alphanumeric variables before searching for spaces.
Return Value
The function returns the position of the first occurrence of the substring, or if the substring is not found, the function
returns 0.
Examples
See also: PosChar Function, StartsWith Function, Replace Function, IsChecked Function
PosChar Function
Format
i = poschar(pattern_string, source_string);
Description
The poschar function searches for a collection of characters, represented in a pattern_string, within a source_string.
The function returns the beginning position of the first occurrence of the pattern string in the source string. Both
arguments are string expressions, and they are case sensitive, meaning that "c" is recognized as different from "C".
Return Value
The function returns the position of the first occurrence of the pattern string, or if no characters from the pattern string are
found, the function returns 0.
Examples
VALUE = poschar("L", "CHILDREN"); // VALUE will be 4; this is where the
// pattern string (the letter "L")
// occurs in the source string ("CHILDREN")
VALUE = poschar("LCN", "CHILDREN"); // VALUE will be 1; of the characters in the
// pattern string, "C" is the first character
// encountered in the source string, and it is
// found at position 1
See also: Pos Function, StartsWith Function, Replace Function, IsChecked Function
RegExMatch Function
Format
i = regexmatch(target_string,regex_string);
Description
The regexmatch function tests whether the target_string matches the regex_string.
Return Value
The function returns an integer value of:
Example
// Matches format xxx-xxx-xxxx
if regexmatch(TELEPHONE_NUMBER, "^([0-9]{3}-){2}[0-9]{4}$") then
// ...
endif;
Replace Function
Format
s = replace(source_text, old_text, new_text);
Description
The replace function looks at source_text and replaces one or more instances of the substring old_text with the value
in new_text. All arguments are string expressions and the replacement is case sensitive, meaning that "c" is
recognized as different from "C".
Return Value
The function returns a new string with the replaced text.
Example 1
string example = "Robert Smith's son's name is Robert, Jr.";
example = replace(example, "Robert", "Bob");
// result: Bob Smith's son's name is Bob, Jr.
Example 2
StartsWith Function
Format
b = startswith(prefix, source);
Description
The startswith function determines whether the string expression source begins with the string expression prefix. It is
the same as checking if:
pos(prefix, source) = 1
In addition to standalone use as a function, startswith can be used when specifying dictionary access parameters.
Return Value
The function returns a logical value of 1 (true) if the source string starts with the prefix and 0 (false) otherwise.
Examples
if startswith("9999", MENU_GEOCODE_SELECTION) then
errmsg("Are you sure that you want to continue in training mode?")
select("Yes", continue, "No", reenter);
endif;
Strip Function
Format
s = strip(string_expression);
Description
The strip function removes trailing blanks from the string_expression. The result of the function is often used as an
argument to other functions such as the length and concat functions.
Pa ge 549 of 684 String Func ons
You will generally not have to use the strip function when working with variable-length strings since they do not have
spaces automatically added to the end of the text to match a fixed width.
Return Value
The function returns a string with no trailing blanks.
Example
PROC GLOBAL
alpha (30) firstName, lastName, fullName;
PROC EXAMPLE
firstName = "John";
lastName = "Henry";
fullName = concat(strip(firstName), " ", strip(lastName));
numeric nameLength = length(strip(fullName));
// fullName is "John Henry "
// nameLength is 10
ToLower Function
Format
s = tolower(string_expression);
Description
The tolower function scans the given string_expression and converts any uppercase letters to lowercase letters.
Return Value
The function returns a string with all uppercase letters converted to lowercase letters.
Example
// each function call will return "hello james!"
X = tolower("hello james!");
Y = tolower("Hello JaMeS!");
Z = tolower("HELLO JAMES!");
ToNumber Function
Format
d = tonumber(string_expression);
Description
Return Value
The function returns a decimal number. If the string begins with a non-numeric character (other than a blank), the function
returns default.
Example
numeric districtParameter = tonumber(sysparm("DISTRICT"));
ToUpper Function
Format
s = toupper(string_expression);
Description
The toupper function scans the given string_expression and converts any lowercase letters to uppercase letters.
Return Value
The function returns a string with all lowercase letters converted to uppercase letters.
Example
// each function call will return "HELLO JAMES!"
X = toupper("hello james!");
Y = toupper("Hello JaMeS!");
Z = toupper("HELLO JAMES!");
Description
The average function returns the average of multiple_item, an item that occurs multiple times. During data entry, the
result of the average calculation depends on where the statement is located. If the function is executed prior to the form
or roster containing the item, it returns default. If it is executed within the form or roster containing the item, it returns
the average up to the current occurrence number. If it is executed after the form or roster containing the item, it returns
the average for all occurrences of the item.
During batch editing, the function returns the average value for all occurrences of the item, regardless of the statement's
placement in the program.
If a where condition is included, the function returns the average of the occurrences for which condition is true.
If the value of an occurrence of the item is a special value, the occurrence will not be included in the calculation. If none
of the occurrences have values other than special values, default is returned.
Return Value
The function returns the decimal value of the average.
Examples
AVG_INCOME = average(INCOME);
AVG_FEMALE_INCOME = average(INCOME where SEX = 2);
Count Function
Format:
i = count(multiple-item [where condition]);
Description:
The count function returns the number of occurrences for a repeating form or roster. During data entry, the occurrence
value is updated after the postproc of the first field within a repeating form or roster is executed. If the count function is
executed prior to the form or roster, it returns 0. If it is executed from a field within the form or roster, it returns the
current occurrence number. If it is executed after the form or roster, it returns the total number of occurrences in the
form or roster.
During batch editing, count always returns the total number of occurrences in the multiply-repeating item/record.
If a where condition is included, the function returns the number of occurrences for which the condition is true. If the
where condition is not included, the count function and the noccurs function return the same result.
Return value:
The function returns an integer count value.
See also: Noccurs Function, Soccurs Function, Totocc Function, Curocc Function, Seek Function, Has Operator
CurOcc Function
Format:
i = curocc([group]);
Description:
The curocc function returns the current occurrence number for a roster, form, or record.
During data entry, you may determine the current occurrence of a roster or form. If the form does not repeat, curocc
will return 1 (a roster must always repeat). The current occurrence can be determined by calling the curocc function
from any field contained within the roster or form. If it is executed prior to the roster or repeating form it names, it
returns 0. If it is invoked after entry of the roster or form has completed, it returns the total number of occurrences
keyed.
During batch editing, you may determine the current occurrence of a record or repeating item. If the curocc function is
used in a procedure not associated with an item on a record then curocc will return the total number of occurrences
found. If the curocc function is used in a procedure associated with an item on the record, it will return the sequence
number of the record in the case. The curocc of a repeating item will be its sequence number within the group.
Return value:
The function returns the occurrence number as an integer.
Examples 1:
PROC RELATION
if curocc(PERSON_REC) = 1 then
if (RELATION <> 1) then
errmsg("First person must be head of household.");
endif;
endif;
See also: Maxocc Function, Totocc Function, Noccurs Function, Soccurs Function, Count Function , If Statement,
Errmsg Function
Delete Function
Format
b = delete(group_name『 (occurrence)』 );
b = delete(group_name, first_occurrence)『 , last_occurrence』 );
Description
The delete function removes incomplete, or otherwise unneeded, records or item occurrences from the current case. It
can be used to remove singly- or multiply-occurring records, although if the record is non-repeating (such as the housing
record in a typical census application), then it must be defined as "Required=No" in the dictionary. This function is
primarily intended for batch applications. It should be used with extreme caution in data entry applications because of
possible conflicts between the operator's actions and the program logic.
The first syntax deletes a single occurrence of the record. When deleting a singly-occurring record, it is not necessary to
specify the occurrence. The second syntax deletes one or more occurrences of a record, starting with the
last_occurrence and deleting up to the first_occurrence. This allows for the mass deletion of record occurrences.
In this example blank person records are deleted from the case. Records following any deleted record are "shifted up" to
cover the vacated area. For example, if you delete the second of four records, the third record shifts to the second
position and the fourth record shifts to the third position.
It is best to delete the records starting with the last record and moving toward the first. Use a subscript that starts at the
last occurrence then is decremented (decreased by 1). In this way you will not need to worry about the records that are
shifting positions.
GetOccLabel Function
Format
s = getocclabel(『 item_name ‖ group_name』 );
Description
The getocclabel function returns the occurrence label for a repeating item or group (i.e., the row label of a roster). If no
item_name or group_name is given, the function returns the occurrence label of the current field, as calculated as
follows:
s = getocclabel($(curocc($)));
In addition to uses in logic, the function can be used as a fill in question text.
Return Value
The function returns a string containing the occurrence label. If an occurrence label for the item or group does not exist,
the function returns a blank string.
Example
Insert Function
Format
b = insert(group_name『 (occurrence)』 );
b = insert(group_name, first_occurrence)『 , last_occurrence』 );
Description
The insert function inserts missing or otherwise needed data records or item occurrences in the current case. It is
primarily intended for use in batch applications. It should be used with extreme caution in data entry applications
because of possible conflicts between the operator's actions and the program logic.
The first syntax inserts a single occurrence of the record. When inserting a singly-occurring record, it is not necessary to
specify the occurrence. The second syntax inserts one or more occurrences of a record, starting with the
first_occurrence and inserting up to the last_occurrence. This allows for the mass inserts of record occurrences.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
It makes no difference if the population record has been defined in the dictionary as required or not. What is important is
that it was defined as a multiply-occurring record.
if totocc(HOUSING_REC) = 0 then
insert(HOUSING_REC); // note the absence of a subscript
endif;
To accomplish this, the housing record must be set to "Required=No" in the dictionary. You cannot use this function for
a singly-occurring record when the record is required.
Description:
The max function returns the maximum value of an item that occurs multiple times. During batch editing, if the values
of the items are not changed, the result of the maximum calculation is the same, no matter where the function is
located.
During data entry, the result of the maximum calculation depends on where the statement is located. If the max
function is executed prior to the form or roster containing the item, it returns DEFAULT. If it is executed within the form
or roster containing the item, it returns the maximum value up to the current occurrence number. If it is executed after
the form or roster containing the item, it returns the maximum value for all occurrences of the item.
During batch editing, max always returns the maximum value for all occurrences of the item.
If a where condition is included, the function returns the maximum value of the occurrences for which the condition is
true.
If the value of an occurrence of the item is a special value (DEFAULT, MISSING, REFUSED, or NOTAPPL), the
occurrence will not be included in the calculation. If none of the occurrences have values other than special values,
DEFAULT is returned.
Return value:
The function returns a decimal value.
Examples:
MAX_INCOME = max(INCOME);
MAX_FEMALE_INCOME = max(INCOME where SEX = 2);
MaxOcc Function
Format:
i = maxocc([group]);
Description:
The maxocc function returns the maximum number of multiply occurring records or the maximum number of multiply-
occurring items defined for a group in the dictionary.
This value remains the same throughout the application run.
Return value:
The function returns an integer value of the maximum number of occurrences.
Example 1:
PROC HOUSING
errmsg("Maximum number of persons is %d", maxocc(PERSON));
Example 2:
PROC PERSON
errmsg("Maximum number of persons is %d", maxocc());
See also: Curocc Function, Totocc Function, Soccurs Function, Noccurs Function
Description:
The min function returns the minimum value of an item that occurs multiple times. During batch editing, if the values of
the items are not changed, the result of the minimum calculation is the same, no matter where the function is located.
During data entry, the result of the minimum calculation depends on where the statement is located. If the min
function is executed prior to the form or roster containing the item, it returns DEFAULT. If it is executed within the form
or roster containing the item, it returns the minimum value up to the current occurrence number. If it is executed after
the form or roster containing the item, it returns the minimum value for all occurrences of the item.
During batch editing, min always returns the minimum value for all occurrences of the item.
If a where condition is included, the function returns the minimum value of the occurrences for which the condition is
true.
If the value of an occurrence of the item is a special value (DEFAULT, MISSING, REFUSED, or NOTAPPL) the
occurrence will not be included in the calculation. If none of the occurrences have values other than special values,
DEFAULT is returned.
Return value:
The function return a decimal value.
Examples:
MIN_INCOME = min(INCOME);
MIN_MALE_INCOME = min(INCOME where SEX = 1);
NOccurs Function
Format:
i = noccurs(group);
Description:
The noccurs function returns the number of occurrences of a roster, form, or record. It is equivalent to the count
function without the where phrase.
During data entry, you may determine the current occurrence of a roster or form. The occurrence value is updated after
the first entry into the first field. If the noccurs function is executed prior to the roster or form it specifies then it returns
0. If it is executed from a field within the roster or form, it returns the current occurrence number after the first field is
on the path. For example, its value in the PREPROC of the first occurrence of the first item entered in a form or roster
is zero (0), i.e., before entry. After entry its value will always be one (1). This is true for each occurrence keyed, the
noccurs function is not incremented until the cursor is in or has passed through the first field on the roster or form. If it
is executed after the form or roster, it returns the total number of occurrences in the form or roster. If the form does not
repeat, noccurs will return 1 (a roster must always repeat). When used in Data entry noccurs and soccurs functions
are the same.
During batch editing, noccurs always returns the total number of occurrences in the group.
Return value:
The function returns the number of occurrences as an integer value.
Example:
TOTAL_PERSONS = noccurs(PERSON);
Seek Function
Format:
i = seek(multiple-item condition[,numeric-expr]);
Description:
The seek function returns the occurrence number of the first item in a multiply occurring item that satisfies a certain
condition. If numeric-expr is included, the function starts searching the multiply occurring record for a true condition
starting at occurrence numeric-expr. If a @ symbol precedes the numeric-expr, the function searches for the nth
occurrence of the condition.
Return value:
The function returns the occurrence number of an item that meets the condition or 0 if no such item is found.
Example 1:
numeric femaleIndex = seek(SEX = 2);
while femaleIndex do
write("Person #%d is a female with name '%s'",femaleIndex,NAME(femaleIndex));
femaleIndex = seek(SEX = 2,femaleIndex + 1);
enddo;
Example 2:
numeric secondSpouse = seek(RELATIONSHIP = 2,@2);
See also: Count Function, Has Operator, Seekmax Function, Seekmin Function
SeekMax Function
Format:
i = seekmax(multiple-item condition[,numeric-expr]);
Description:
The seekmax function returns the occurrence number of the item with the greatest value that satisfies a certain
condition in a multiply occurring item. If numeric-expr is included, the function starts searching the multiply occurring
record for a true condition starting at occurrence numeric-expr.
Return value:
The function returns the occurrence number of an item that meets the condition or 0 if no such item is found.
Example
// seekmax
numeric eldest_index = seekmax(AGE);
numeric eldest_aged_12to64_index = seekmax(AGE in 12:64);
// seekmin
numeric youngest_aged_65plus_index = seekmin(AGE in 65:95);
See also: Count Function, Has Operator, Seek Function, SeekMin Function, Max Function
SeekMin Function
Format:
i = seekmin(multiple-item condition[,numeric-expr]);
Example
// seekmax
numeric eldest_index = seekmax(AGE);
numeric eldest_aged_12to64_index = seekmax(AGE in 12:64);
// seekmin
numeric youngest_aged_65plus_index = seekmin(AGE in 65:95);
See also: Count Function, Has Operator, Seek Function, SeekMax Function, Min Function
SetOccLabel Function
Format
b = setocclabel(『 item_name ‖ group_name,』 occurrence_label);
Description
The setocclabel function sets the occurrence label for a repeating item or group (i.e., the row label of a roster). This is
especially useful when designing applications for mobile devices, as it allows you to specify the label that is displayed in
the navigational case tree. If no item_name or group_name is given, the function sets the occurrence label of the
current field. The occurrence label is set to the string expression occurrence_label.
Return Value
The function returns a logical value of 1 (true) if the label was set successfully and 0 (false) if the occurrence for the item
or group item does not exist.
Example 1
PROC PERSON_NAME
setocclabel(PERSON_ROSTER,strip(PERSON_NAME));
Example 2
PROC MYSURVEY_QUEST
preproc
// fill in occurrence labels for the household roster
do numeric ctr = 1 while ctr <= totocc(PERSON_ROSTER)
setocclabel(PERSON_ROSTER(ctr),strip(NAME(ctr)));
enddo;
SOccurs Function
Pa ge 559 of 684 Mul ple Occurrence Func ons
Format:
i = soccurs(record-name);
Description:
The soccurs function returns the total number of occurrences of a record.
During data entry, you may determine the current occurrence of a record. The occurrence value is updated after the
first entry into the first field. If the soccurs function is executed prior to the roster or form it specifies then it returns 0. If
it is executed from a field within the roster or form, it returns the current occurrence number after the first field is on the
path. For example, its value in the PREPROC of the first occurrence of the first item entered in a form or roster is zero
(0), i.e., before entry. After entry its value will always be one (1). This is true for each occurrence keyed, the soccurs
function is not incremented until the cursor is in or has passed through the first field on the record. If it is executed
after the form or roster, it returns the total number of occurrences in the form or roster. If the form does not repeat,
noccurs will return 1 (a roster must always repeat). When used in Data entry noccurs and soccurs functions are the
same.
During batch editing, soccurs always returns the total number of record occurrences found.
Return value:
The function returns the number of occurrences as an integer value.
Example:
NUM_HH_MEMBERS = soccurs(PERSON_REC);
See also: Totocc Function, Curocc Function, Noccurs Function, Count Function
Sort Function
Format
b = sort(group_name using 『 -』 item_name1 『 ,『 -』 item_name2, ...』 『 where where_condition』 );
Description
The sort function sorts occurrences of records or items based on the value of an item or multiple items. It orders the
multiple records or items in the specified group in ascending sequence using the specified data item as the sort key. The
sort key item must be contained within the record or item sorted. If a negative sign is included before the item name, the
sort will be in descending order. If a where_condition is included, the function sorts only the occurrences for which the
condition is true. When multiple sort keys are provided, the second, third, ... key is only evaluated when the values of the
previous keys are identical.
Sort is primarily intended for use in batch applications. It should be used with extreme caution in data entry applications
because of possible conflicts between the operator's actions and the program logic.
Return Value
The function returns a logical value 1 (true) if successful and 0 (false) otherwise.
Examples
// sort the people using the line number
sort(PERSON_EDT using LINE_NUM);
// sort the children by age from oldest to youngest
sort(PERSON_EDT using -AGE where RELATIONSHIP = 3);
// sort the people first by relationship code (ascending) and then by age (descending)
sort(PERSON_EDT using RELATIONSHIP, -AGE);
Sum Function
Format:
d = sum(multiple-item [where condition]);
Description:
The sum function returns the sum of an item that occurs multiple times. If a where condition is included, the function
returns the sum of the occurrences for which the condition is true.
During data entry, the result of the sum calculation depends on where the statement is located. If the sum function is
executed prior to the form or roster containing the item, it returns DEFAULT. If it is executed within the form or roster
containing the item, it returns the sum up to the current occurrence number. If it is executed after the form or roster
containing the item, it returns the sum for all occurrences of the item.
During batch editing, sum always returns the sum for all occurrences of the item.
If the value of an occurrence of the item is a special value (DEFAULT, MISSING, REFUSED, or NOTAPPL) the
occurrence will not be included in the calculation. If none of the occurrences have values other than special values,
DEFAULT is returned.
Return value:
The function returns a decimal value of the sum.
Example:
TOTAL_INCOME = sum(INCOME);
TOTAL_FEMALE_INCOME = sum(INCOME where SEX = 2);
Swap Function
Format:
b = swap(group-name,record-number,record-number);
Description:
The swap function reorders the sequence of occurrences of records or items. It is useful for reorganizing the position
(occurrence number) of items in a roster, for instance if you want to ensure that the head of household is the first
person on the roster.
Swap is primarily intended for use in batch applications. It should be used with extreme caution in data entry
applications because of possible conflicts between the operator's actions and the program logic.
Return value:
The function returns a logical value 1 (true) if successful and 0 (false) otherwise.
Example:
swap(PERSON,1,ptrHead);
TotOcc Function
Format:
i = totocc([group]);
Description:
Example 2:
PROC HOUSING
if totocc() > 1 then
errmsg("More than 1 housing record");
endif;
See also: Curocc Function, Maxocc Function, Count Function, Soccurs Function, Noccurs Function
Description
The compress function compresses one or more files and saves the compressed files in a single .zip file. That file can be
decompressed with the decompress function or with other popular tools. Compressing data, in addition to saving size, is
a convenient way to bundle several files into a single file.
The argument zip_file_name is a string expression containing the file name of the output .zip file. If an existing .zip file
exists with that name, it will be overwritten. The list of input files can come from a string list, input_file_list, that could
be returned from a dirlist function call. Alternatively, the list of input files can come from a string expression
input_file_name. If using a string expression, you can use the wildcard characters "*" and "?" to specify a group of files.
Return Value
The function returns the number of files (but not folders) added to the compressed .zip file. If there was an error creating
the compressed file, the function returns default.
Example 1
compress("CollectedData.zip", "*.csdb");
Example 2
list string imageFilesListing;
dirlist(imageFilesListing, pathname(InputFile), "*.jpg", recursive);
compress("Images.zip", imageFilesListing);
Decompress Function
Format
i = decompress(zip_file_name『 ,directory_name』 );
Description
The decompress function decompresses the compressed files contained in a .zip file. Compressed .zip files can be
created using the compress function or with other popular tools.
The argument zip_file_name is a string expression containing the file name of the .zip file. An optional string expression
directory_name indicates the folder to which the files should be decompressed. If no directory name is specified, the
files are decompressed to the current application directory.
Example
decompress("Images.zip",pathname(InputFile) + "Images");
Diagnostics Function
Format
s = diagnostics(『 property』 『 , argument, ...』 );
Description
The diagnostics function is used to get information about the version of CSPro you are using, as well as other
functionality. It is used mostly for troubleshooting.
The function takes a string expression property and then optional string arguments that determine what it returns. If
called without a property, the function will return all of the no-argument diagnostic properties. The properties are:
Return Value
The function returns a string containing the diagnostic information. If the property is invalid, then the function returns a
blank string.
Example
PROC MATERNITY_QUEST
preproc
if diagnostics("version") <> "7.5" then
errmsg("This application has not been tested with this version of CSPro. Things may
behave differently than expected.");
endif;
Encode Function
Format
s = encode(『 encoding_type, 』 text);
The optional argument encoding_type specifies the format for encoding. The text argument is a string expression that
specifies the text to encode. If an encoding type is provided with no text, then the default encoding type for the
application is modified. (An application starts with the default encoding type set to HTML.) The encoding types are:
Return Value
The function returns a string with the appropriate characters encoded. If the default encoding type is changed, the
function returns a blank string.
Example
// write to a HTML file
file html_file;
html_file.write("<p>Enumerator name: %s</p>", encode(strip(ENUMERATOR_NAME)));
string house_image_filename = pathconcat("../House Images", key(HOUSING_DICT) + ".jpg");
html_file.write("<p><img src='file:///%s' alt='House' /></p>", encode(URI,
house_image_filename));
ErrMsg Function
Format 1:
[b =] errmsg(alpha-exp[,p1[,p2[,...,pn]]])
[select(caption-1, field-name-1, ..., caption-n, field-name-n)]
[denom=var] [case|summary]);
Format 2:
[b =] errmsg(msg-num[,p1[,p2[,...,pn]]])
[select(caption-1, field-name-1, ..., caption-n, field-name-n)]
[denom=var] [case|summary];
Description:
The errmsg function displays a message on the data entry screen (when used in a Data Entry application) or writes a
message to the batch edit report (when used in a Batch Edit application). If messages are defined via the message
number msg-num, then those messages will be stored in a message file [.mgf].
Pa ge 565 of 684 Genera l Func ons
msg-num can be a number or numeric expression
Note to ISSA users: Use the errmsg function with the case keyword to replace the "display" function.
Each argument (e.g., "p1") is sequentially inserted into the error message. Arguments can be numeric or
alphanumeric expressions, but the type of the argument must match the type of the receiving field in the message
text. The maximum number of arguments in an errmsg function is 20.
In the message text
%[n]d = Insert a number and display it as an integer
%[n.d]f = Insert a number and display it as a decimal value
%[n.d]s = Insert a text string
"n" is the size of the field and "d" is the number of decimal places to show for a number.
Numbers are never truncated. Text strings are truncated only if ".d" is used.
If "n" is positive, the insert is right justified in the size of the field. If "n" is negative, the insert is left justified in the size
of the field. If "n" is a positive number with a leading zero, the insert is right justified in the size of the field and zero
filled to the left.
When inserting a number, if "n" is preceded by a +, the sign of the number is always displayed.
Examples:
Value = 23456 %d 23456
%10d 23456
%-10d 23456
%010d 0000023456
%+10d +23456
%+010d +000023456
%f 23456.000000
The denom keyword allows you to specify a denominator, so that you can show percentages in the summary portion
of the output listing. This is very useful for showing edit failure rates. In Example 2 below, the output listing will show
the number of times there was more than one head of household divided by the number of households processed
during the run. Note that it is the responsibility of the application designer to write logic to put the proper values into
the denominator variable.
The case and summary keywords give you some control over the output listing. By default, the output listing shows
you messages case by case, and also shows you a summary of the number of times the message was triggered (with
an optional denominator, described above). You can limit the output listing to only case-by-case reporting, or only
summary reporting by using these keywords.
Example 2:
errmsg("More than 1 head of household") denom = PERSON_COUNT summary;
Example 3:
errmsg("Head of household is %d years old. Age must be >= 12", AGE)
select("Go to RELATIONSHIP", RELATIONSHIP,"Go To AGE", AGE)
denom = PERSON_COUNT;
Format 2 Example:
OK = errmsg (1,"June",30,31);
Note the errmsg call could have also been invoked as follows:
i = 1;
OK = errmsg (i,"June",30,31);
Description
The execsystem function, on a desktop, starts another Windows application or process. There is a related version for
applications running on mobile devices.
The string expression action is the name of the application or process to be started. Command line parameters may be
included in this expression. If folder or file names contain blanks, then quotation marks (") must surround the names. In
this case CSPro text strings can be surrounded by apostrophes (').
An optional set of three flags controls the way that the application is opened, and includes the following options:
Window Focus focus The application is opened and immediately has focus (is "active"). This is the default
option.
nofocus The application is opened without focus.
Return Value
The function returns a logical value of 1 (true) if the new application is started successfully and 0 (false) otherwise.
Examples
// opens the Windows calculator
execsystem("calc.exe");
// opens documentation in Text Viewer
execsystem(maketext('"%sTextView.exe" "C:\Survey\Helps.txt"', pathname(CSPro)), maximized,
wait);
// open a website in Internet Explorer
execsystem(maketext('"%sInternet Explorer\iexplore.exe" "https://www.csprousers.org/forum"',
pathname(ProgramFiles32)));
See also: SystemApp Object, ExecSystem Function (Mobile), ExecPFF Function, View Function
Description
The execsystem function, on mobile devices (Android), starts another application or executes a task. There is a related
version for applications running on desktop CSEntry.
If stop is coded, the application will immediately close after launching the application, in the same way that would occur
if using the stop function with an argument of 1.
The string expression action must begin with one of the following commands:
Command Description
app Open another application on the device. (You can get more control using SystemApp instead.)
browse View a website in the Internet browser.
call Place a phone call.
camera Take a photo in .jpg or .png format.
gps View a latitude/longitude point using an installed mapping engine (e.g., Google Maps).
html View a website from within CSEntry (equivalent to using the view function).
signature Bring up a screen where someone can sign the mobile device, an image of which can be saved in .jpg
or .png format.
sms Send a text message.
Return Value
The function returns a logical value of 1 (true) if the new application or task is started successfully and 0 (false)
otherwise.
Examples
execsystem("app:com.google.android.gm"); // open Gmail
execsystem("browse:https://www.census.gov");
execsystem("call:+13017631451"); // call the CSPro support number
execsystem("camera:" + pathconcat(Application, "photo.jpg"));
execsystem("camera:" + pathconcat(Application, "photo.png"));
execsystem("gps:38.84839,-76.931098");
execsystem("gps:38.84839,-76.931098,CSPro Team at the U.S. Census Bureau");
execsystem("html:https://www.census.gov");
execsystem("html:file:///" + pathconcat(Application, "local_webpage.html"));
execsystem("signature:" + pathconcat(Application, "saved-signature.jpg"));
execsystem("signature:" + pathconcat(Application, "saved-signature.png"));
execsystem("sms:+13017631451");
execsystem("sms:+13017631451,Hello, CSPro Team!");
execsystem("view:" + pathconcat(Application, "picture.jpg"));
execsystem("view:" + pathconcat(Application, "audio.mp3"));
execsystem("view:" + pathconcat(Application, "movie.3gp"));
See also: SystemApp Object, ExecSystem Function (Desktop), ExecPFF Function, Multimedia Features, View
Function
ExecPFF Function
Format
b = execpff(pff_filename ‖ pff_name『 , flags』 );
Description
The execpff function starts another CSPro application. This function is useful when designing menu programs or when
conducting surveys that use multiple machines on which the path of the CSPro executables may differ. The function is
very similar to execsystem, but instead of passing the name of an application or task, you supply a string expression,
pff_filename, containing the name of a CSPro application's Program Information File (.pff), or pff_name, the name of a
pff object. If using a modified but unsaved pff object, it will be saved to a temporary file and then executed.
An optional set of flags, described in the execsystem helps, allow you to control how the CSPro application is opened.
On mobile devices, multiple data entry applications cannot be run simultaneously, so the function simply stores the file
name of the PFF. When the mobile application returns to the Entry Applications screen, whether via user interaction or
via the stop function, the passed PFF will be launched. Using stop as a flag will cause the PFF to be launched
immediately.
If the PFF file that launched the application that calls this function specified an OnExit parameter, the pff_filename
specified by this function will override the OnExit parameter.
Example
PROC MENU
if MENU = 1 then
execpff("Household.pff", stop);
elseif MENU = 2 then
execpff("Agriculture.pff", stop);
endif;
See also: Pff.Exec Function, ExecSystem Function (Desktop), ExecSystem Function (Mobile)
GetProperty Function
Format
s = getproperty(『 symbol, 』 property_name);
Description
The getproperty function returns a string containing the current value associated with the property specified by the
string expression property_name. If an optional dictionary symbol, which can be either an item or a field, is specified,
then the property must apply to an item or field. If no symbol is specified, then the property must be an application
property.
Properties List
The following properties apply to the application:
Name Values
DataType Alpha, Numeric
Decimal (integer)
DecimalChar Yes, No
Len (integer)
ZeroFill Yes, No
Example
PROC APPLICATION_FF
preproc
if getproperty("OperatorID") = "No" then
setoperatorid(getusername());
endif;
GetLabel Function
Format
s = getlabel(dictionary_symbol 『 , value』 );
Description
The getlabel function returns the label of a dictionary_symbol or the text associated with a particular value of the
symbol as defined in a value set. If the value argument is not specified, then the dictionary symbol's label is returned.
The symbol can be the name of a dictionary, level, record, item, or value set.
The value argument can only be used if the dictionary symbol is an item or a value set. The value is either a numeric or
string expression, based on the type of the item. If the argument is used, the label associated with the specified value is
returned. If the symbol is an item name, then the value labels from the current value set are returned. If the symbol is a
value set, then the value labels from that value set are returned. If no label is associated with the value, then an empty
string is returned. This behavior can also be achieved with the getvaluelabel function.
When using the value argument, a value's code can be returned rather than the value label by using the optional by
label (as opposed to the default by code behavior).
Return Value
The function returns a string containing the label.
Example
PROC EXAMPLE
write("%s", getlabel(SEX)); // Sex
write("Crop Type = %s", getlabel(CROP_VS2, 23)); // Crop Type = Maize
PROC RELATIONSHIP
write("%s = %s", getlabel($), getlabel($, $)); // Relationship = Head,
// Relationship = Child, etc.
PROC BY_EXAMPLE
write("%s", getlabel(SEX by code, 1)); // Male
write("%s", getlabel(SEX by label, "Male")); // 1
Return Value
The getlanguage function returns a string with the name of the language currently being used by the application.
Example
if getlanguage() = "EN" then
errmsg("Hello");
elseif getlanguage() = "FR" then
errmsg("Bonjour");
endif;
See also: SetLanguage Function, OnChangeLanguage Global Function, Multiple Language Applications
GetSymbol Function
Format
s = getsymbol();
Description
The getsymbol function returns the name of the current procedure being executed.
An optional syntax, getsymbol(savepartial), returns the field name and occurrence information (for multiply-occurring
items) of the location of the last partial save.
Return Value
The function returns a string with the procedure's name.
Example
function OnStop()
if getsymbol() = "ACTION" then
stop(1);
else
reenter ACTION;
endif;
end;
GetValueLabel Function
Pa ge 573 of 684 Genera l Func ons
Format
s = getvaluelabel(item_name);
Description
The getvaluelabel function returns the value label associated with the current value of item_name. The label is
determined in the following order, with the first valid label returned:
1. The label for the value in the item's current value set.
2. The label for the value in the item's primary value set.
3. The value itself, formatted for displaying.
s = getlabel(ITEM_NAME, ITEM_NAME);
In addition to uses in logic, the function can be used as a fill in question text.
Return Value
The function returns a string containing the label.
Example
PROC GRADE_ATTENDING
if GRADE_ATTENDING > 3 and AGE < 15 then
errmsg("Someone aged under 15 cannot attend %s.", getvaluelabel(GRADE_ATTENDING));
reenter;
endif;
// using the "%l" message formatter, the above error message can also written as:
errmsg("Someone aged under 15 cannot attend %l.", GRADE_ATTENDING);
Hash Function
Format
s = hash(value 『 , length『 , salt』 』 );
Description
The hash function generates a hash value for the string or numeric expression value. This hash value is a fixed-length
representation of the value and can be used for purposes such as storing passwords without storing the actual plaintext
password. The hash value of the same text string will always be the same, but it is nontrivial (impossible for CSPro
purposes) to work backwards to calculate the original text from the hash value.
The hash value is calculated using a SHA-256 key derivation algorithm. This results in a 32-byte value, which is returned
as a hexadecimal string with each byte represented as two characters, thus resulting in a string of a length 64.
Optionally, you can specify a numeric expression length, which specifies the length of the hash value in bytes. The
string returned will always be twice the value of length. The maximum value for length is 500.
You can also specify a string expression, salt, which is an additional input in generating the hash value. You must store
Pa ge 574 of 684 Genera l Func ons
this salt value somewhere to be able to use this hash function to perform any checks. More information on hash values,
key derivation algorithms, and salt values is readily available online.
Return Value
The function returns a hash value represented as a hexadecimal string.
Example
string user_password = prompt("Enter your password", password);
if hash(user_password) <> USER_PASSWORD_HASH then
errmsg("Invalid password. You cannot access this system.");
stop(1);
endif;
InValueSet Function
Format
b = invalueset(item_name『 , value_set_name』 );
Description
The invalueset function determines whether an item's current value is within the permissible ranges of a value set. In
addition to specifying the item_name, an optional argument, value_set_name, allows you to specify in which value set
to check if the item's value is within the permissible ranges. If no value set is specified, the item's current value set is
used.
Return Value
The function returns a logical value of 1 (true) if the item's value is within the value set's ranges and 0 (false) otherwise. If
the item has no value set, the function return 1 (true).
Example 1
if not invalueset(P08_MARITAL_STATUS) then
errmsg("Marital status is out of range. Value is %d", P08_MARITAL_STATUS);
endif;
Example 2
if D02_SEX = 1 and not invalueset(D05_CAUSE_DEATH, D05_CAUSE_DEATH_MALE_VS) then
errmsg("A man cannot die of %l", D05_CAUSE_DEATH);
reenter;
endif;
IsChecked Function
Format
b = ischecked(code, check_box_field_name);
The codes of CSPro check boxes are placed at uniformly spaced offsets based on the size of the code. For example, if
the check box field has a length of 20 and each code has a length of 2, then the function checks positions 1-2 for the
code, then positions 3-4, and so on until positions 19-20.
Return Value
The function returns a logical value of 1 (true) if the code was found (at the correct offset) in the check box field and 0
(false) otherwise.
Example
PROC LANGUAGES_SPOKEN_OTHER
preproc
ask if ischecked("Z", LANGUAGES_SPOKEN);
LoadSetting Function
Format
s = loadsetting(attribute 『 ,default_setting』 );
Description
The loadsetting function, using the string expression attribute, retrieves a setting from an attribute-value pair, a setting
that has been previously saved using the savesetting function. An optional string or numeric expression
default_setting can be provided and will be returned if no value has been saved for the given attribute.
The file from which settings are retrieved can be modified by altering the CommonStore attribute of a PFF file. If an
attribute does not exist in the specified file but does exist in the global settings database (shared by all CSPro
applications), then that global attribute-value pair is used.
Return Value
The function returns a string containing the value associated with the attribute. If no value is associated with the attribute,
then the function returns a blank string unless a default setting has been provided, in which case it returns the default
setting.
Example
PROC MY_APPLICATION_FF
preproc
// set the language at the start of the program, defaulting to English
setlanguage(loadsetting("Language", "EN"));
See also: SaveSetting Function, User and Configuration Settings, CSEntry Settings Modification
LogText Function
Format
b = logtext(message『 , argument1, ..., argumentN』 );
Description
The logtext function writes a user-defined message to the paradata log.
The message is either a string expression or a numeric message number that contains the text that is written to the file.
If the text contains any message formatters, the optional arguments argument1 to argumentN will be inserted into the
text.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise. If no paradata log file is open, then the
function returns 0.
Example
PROC RELATIONSHIP
if RELATIONSHIP = 1 then
logtext("Head of household registered as %s",strip(NAME));
endif;
See also: Paradata, Message Formatting Options, Trace Function, Write Function
MaxValue Function
Format
d = maxvalue(item_name ‖ value_set_name);
Description
The maxvalue function returns the maximum valid value for an item or value set. If a numeric item_name is provided, the
function looks at the current value set for the item, or if no value set exists, looks at the item itself (returning a maximum
value based on the item's number of digits). If a numeric value_set_name is provided, the function returns the maximum
Pa ge 577 of 684 Genera l Func ons
range or discrete value in the value set.
Return Value
The function returns the maximum valid value for an item or value set.
Example
PROC EDUCATION_CURRENT
if EDUCATION_CURRENT = 1 and AGE > maxvalue(EDUCATION_CURRENT_PRESCHOOL_VS) then
errmsg("Someone aged %d cannot be currently attending preschool.", AGE);
reenter;
endif;
MinValue Function
Format
d = minvalue(item_name ‖ value_set_name);
Description
The minvalue function returns the minimum valid value for an item or value set. If a numeric item_name is provided, the
function looks at the current value set for the item, or if no value set exists, looks at the item itself (returning a minimum
value based on the item's number of digits). If a numeric value_set_name is provided, the function returns the minimum
range or discrete value in the value set.
Return Value
The function returns the minimum valid value for an item or value set.
Example
PROC AGE
if RELATIONSHIP = 1 and AGE < minvalue(AGE_HEAD_VS) then
errmsg("The head of household must be at least %d-years-old.", minvalue(AGE_HEAD_VS));
reenter;
endif;
Paradata Function
Format
d = paradata(action_type『 , argument1, ..., argumentN』 );
Description
The paradata function performs an action related to the paradata log. The first argument, action_type, must be one of
the following:
It is not recommended that you open or close paradata logs via logic, as the log will not contain all of the initializing
features that exist when specifying the paradata log when the application begins (and via the PFF file).
Return Value
For the open, close, and flush actions, the function returns a logical value of 1 (true) if successful and 0 (false)
otherwise. For the concat action, the function returns the number of paradata logs concatenated, or default upon error.
Look at the documentation for the sqlquery function to see the return values for the query action.
Example - Concat
list string logs_list;
dirlist(logs_list, "C:\Survey\Paradata", "*.cslog");
// add the paradata logs into the currently open log
paradata(concat, filename(paradata), filename(paradata), logs_list);
Example - Query
string message_events_query = maketext(
"SELECT COUNT(*) "
"FROM message_event "
"JOIN event ON message_event.id = event.id "
"JOIN case_instance ON event.case_instance = case_instance.id "
"JOIN case_info ON case_instance.case_info = case_info.id "
"WHERE case_info.uuid = '%s';",
uuid(DICT_NAME));
numeric message_events_for_this_case = paradata(query, message_events_query);
PathConcat Function
Feature Upgrade: Starting with CSPro 7.5, you are encouraged to use the path.concat function.
Format
s = pathconcat(path1『 , ..., pathN』 );
Description
The pathconcat function concatenates the values of multiple string expressions (path1 + ... + pathN) into one string
representing a file path. The first argument can be either a string expression or one of the arguments used in the
Pa ge 579 of 684 Genera l Func ons
pathname function. The combined paths are returned as an absolute, not relative, path.
Return Value
The function returns the concatenated path string.
Example
string image_filename = pathconcat(Application, "../Images", maketext("%v%v.jpg", CLUSTER,
HHNO));
PathName Function
Format
s = pathname(path_type ‖ dictionary_name ‖ file_handler);
Description
The following path_type arguments return the folder name associated with a special directory on your device:
path_type Description
Application The folder where the application file (.ent or .bch) is located.
CSPro The folder where the CSPro executable files are located.
InputFile The folder where the main data file (associated with the primary dictionary) is located.
Temp The operating system's temporary directory.
path_type Description
Documents The current user's Documents folder.
Desktop The current user's Desktop folder.
ProgramFiles32 The folder containing installed 32-bit applications.
ProgramFiles64 The folder containing installed 64-bit applications. If the computer has only a 32-bit processor,
this returns the same value as ProgramFiles32.
Windows The folder contains the Windows operating system.
path_type Description
CSEntry The folder from which CSPro applications are run.
CSEntryExternal On devices with removable storage (e.g., an SD card), the folder on removable storage to
which CSPro applications can write.
If supplying a dictionary_name, the function returns the folder name of the dictionary file (.dcf) itself, not of the data file
associated with the dictionary. On mobile devices, the function returns the folder name of the .pen file because dictionary
files (.dcf) are not used directly when running .pen applications.
If providing a file_handler (declared in a file statement in PROC GLOBAL), the function returns the folder name
containing the text file associated with the file handler.
Example
setfile(log_file, pathname(Desktop) + "Census Daily Report.txt", create);
Report Function
Format
b = report(template_file_name『 ,output_file_name』 );
Description
The report function generates a templated report. The function reads the template specified in the string expression
template_file_name and analyzes the template for any queries that need to be executed. After running the queries, and
passing to the template any data created using the setreportdata function, a report is created based off the template.
The optional string expression output_file_name specifies the name for the created report. If this argument is not
specified, the report will be saved to a temporary file and it will be displayed in a web browser. These temporary reports
are deleted when the application closes.
Return Value
The function returns a logical value of 1 (true) if the report was generated correctly, and 0 (false) otherwise. Any errors
encountered when creating the report are displayed, which will help you diagnose any problems with the template.
Example
// create and view the report
report("interviewer_report_template.html");
// alternatively, save the report
report("interviewer_report_template.html","interviewer_report." + getoperatorid() + ".html");
SaveSetting Function
Format
b = savesetting(attribute, value);
Description
The savesetting function saves a setting as an attribute-value pair using the string expression attribute and the string
or numeric expression value. You can save multiple settings using different attribute strings and can retrieve the settings
using the loadsetting function. These settings are stored locally on the device and can be accessed by other programs
using the attribute string. Using settings is a way to share data between different CSPro applications on the same
device.
Return Value
The function returns 1 (true) if successful or 0 (false) if the function failed.
Example
PROC INTERVIEWER_NAME
preproc
string InterviewerNameKey = "InterviewerName";
INTERVIEWER_NAME = loadsetting(InterviewerNameKey);
postproc
if INTERVIEWER_NAME = "" then
errmsg("You must enter the interviewer's name.");
reenter;
endif;
savesetting(InterviewerNameKey, INTERVIEWER_NAME);
See also: LoadSetting Function, User and Configuration Settings, CSEntry Settings Modification
SetLanguage Function
Format
b = setlanguage(language_name);
Description
The setlanguage function changes the language being used by an application. Pass a string expression,
language_name, with the name, not the label, of the language to which you want to change.
Return Value
The function returns 1 (true) if the language exists and the language was successfully changed. Otherwise it returns 0
(false).
Example
PROC INTERVIEW_LANGUAGE
if INTERVIEW_LANGUAGE = 1 then
setlanguage("EN");
elseif INTERVIEW_LANGUAGE = 2 then
setlanguage("FR");
else
setlanguage("ZH");
endif;
See also: GetLanguage Function, OnChangeLanguage Global Function, Multiple Language Applications
Description
The setproperty function modifies the current property, specified by the string expression property_name, associated
with an application or a field. If an optional dictionary symbol is specified, then the property must apply to a field. If the
symbol is not a field but is intead a block, group, form, or dictionary, then the function will try to apply the property to all
fields belonging to that symbol. If no symbol is specified, then the property must be an application property. The property
will be modified to the value given in the string expression property_value.
Properties List
The following properties apply to the application:
Name Values
DataType Alpha, Numeric
Decimal (integer)
DecimalChar Yes, No
Len (integer)
ZeroFill Yes, No
Return Value
The function returns the number of items whose property was successfully changed. If either the property name or value
is invalid, then the function returns default.
Example
SetReportData Function
Format
b = setreportdata(『 attribute,』 data_source);
Description
The setreportdata function stores in memory a piece of data that will later be used by the report function to generate
a templated report. The string expression attribute specifies the key, or tag, that uniquely identifies the piece of data.
The attribute is case sensitive. In the report template, you will use this attribute to refer to this data.
data_source Description
value A string expression containing some text.
record_name The contents of a record, including multiply occurring records. In a data entry application, only
information up to the number of entered occurrences will be stored.
list_name The contents of a list.
array_name The contents of a one- or multiple-dimensional array.
sqlquery The result set of a query executed by the sqlquery function. Do not specify that function's optional
result_set argument, as the result set is stored in memory when called by setreportdata.
It is not necessary to specify the attribute when working with a record_name, list_name, or array_name. If no
attribute is specified, then it will default to the name of the record, list, or array.
Return Value
The function returns a logical value of 1 (true) if the data was stored successfully, and 0 (false) otherwise (for example if
the sqlquery function encountered a problem).
Example - SqlQuery
SetValueSet Function
Format
b = setvalueset(item_name, value_set_name);
Description
The setvalueset function allows you to dynamically change an item's current value set. An item's current value set is
used to determine whether or not entered values are out of range. The current value set is also used to provide the value
choices displayed in extended controls. The changing of a value set is not permanent; it remains in effect only during the
processing of the current case or until the next call to setvalueset.
The item_name is the name of the item in the data dictionary whose value set is to be changed. The argument is usually
an item name but, preceded by the @ symbol, it can also be a string expression that evaluates to the name of an item;
for example:
setvalueset(@getsymbol(), MY_VALUESET_VS1);
The argument value_set_name is the name of an existing value set in the dictionary for the specified item. The function
will replace the values of the current value set for the item with the values from the value set specified by
value_set_name, which is either the name of a value set or a string expression that evaluates to a value set name. The
value_set_name can also be the name of a dynamic value set.
The setvalueset function can also create dynamic value sets from array arguments.
Return Value
When using a value set specified in the dictionary, the function returns a logical value of 1 (true) if the value set was
changed successfully and 0 (false) otherwise (for example, if the value set name is invalid). With dynamic value sets, the
function returns the number of codes in the newly created value set.
Example 1
PROC D05_CAUSE_DEATH
onfocus
if D02_SEX = 1 then
setvalueset(SEX, D05_CAUSE_DEATH_MALE_VS);
else
setvalueset(SEX, D05_CAUSE_DEATH_FEMALE_VS);
endif;
Example 2
Format
i = setvalueset(item_name, codes_array, labels_array『 , images_array』 );
Description
The setvalueset function allows you to dynamically change an item's current value set. An item's current value set is
used to determine whether or not entered values are out of range. The current value set is also used to provide the value
choices displayed in extended controls. The changing of a value set is not permanent; it remains in effect only during the
processing of the current case or until the next call to setvalueset.
The item_name is the name of the item in the data dictionary whose value set is to be changed. The second argument,
codes_array, is an array of codes to use in the new value set and the third argument, labels_array, is an array of labels
that correspond to the values in the codes array (the nth element of the labels array is the label for the nth element in the
codes array). If the item is numeric, codes_array must be a numeric array. If the item is alphanumeric, codes_array
must be a string array. The labels_array must always be a string array. The optional fourth argument, images_array, is
a string array pointing to image filenames to be used as value set images.
When working with the codes_array, to indicate the end of a numeric array, assign the special value notappl to the
position after the last valid code. To indicate the end of a string array, assign a blank value. This indicates when to stop
loading values from the array.
In versions of CSPro earlier than 6.0, the value set labels and codes array needed to start at index position 0. Zero-based
arrays are still permitted, but if a one-based array is passed to the function (and the zeroth element has never been
assigned to), then CSPro will start processing the dynamic value set at array position 1.
Return Value
The function returns the number of codes in the newly created value set.
Example
SetValueSets Function
Format
i = setvaluesets(value_sets_search_query);
Description
The setvaluesets function allows you to dynamically change the value sets associated with multiple items. Unlike the
setvalueset function, which only operates on one item, the setvaluesets function traverses every item in the primary
dictionary and searches each item's value sets to see if they have a name that contains the text found in the string
expression value_sets_search_query. If the text is found, the first value set with the name containing the expression is
the value set used for the item. Otherwise the currently used value set is maintained.
Before multiple language dictionary labels were added in CSPro 6.2, this function was useful for changing the language of
value sets in a multi-language application.
Return Value
The function returns the number of items whose value sets were changed.
Example
function ResetLanguage(language)
if language = 1 then // English
setvaluesets("_ENG");
elseif language = 2 then // Russian
setvaluesets("_RUS");
else // Tajik
setvaluesets("_TAJ");
endif;
end;
Special Function
Format
b = special(numeric_expression);
Description
The special function determines whether the value of numeric_expression is one of the four special values: missing,
refused, notappl, or default.
Return Value
The function returns a logical value of 1 (true) if the variable is a special value and 0 (false) otherwise.
Example
if special(CHILDREN_EVER_BORN) then
CHILDREN_EVER_BORN = 0;
endif;
SqlQuery Function
Format
d = sqlquery(sqlite_database『 , result_set』 , sql_query);
Description
The sqlquery function executes a query on a SQLite database and returns a result set in a variety of formats. This
function is intended for advanced users who are familiar with writing SQL expressions. While many files used by CSPro
applications are text files, there are some files that are stored as SQLite databases, including CSPro DB data files and
paradata logs. You can query these files with this function.
sqlite_database Description
paradata The query will be executed on the currently open paradata log. This is the same as:
paradata(query,...).
dictionary_name The data source pointed to by dictionary_name must be of type CSPro DB, Encrypted CSPro DB, or
text. The query will be executed on the file associated with this data source. If working with text
files, you can query the file's index.
file_name The string expression file_name gives the file name of a SQLite database. The database will be
opened, the query executed, and then the database will be closed. This allows you to work with
databases maintained outside of CSPro.
The optional second argument, result_set, indicates the destination for the result set generated following the execution of
the query. If you do not provide a result set for your query, the query must be a scalar query (one that returns a single
result such as "SELECT COUNT(*) FROM ..."). The result set must be one of the following:
Using a working storage record for the results is advantageous because the results are stored in their proper type.
Because lists and arrays are all of one type (numeric or string), some results may be converted to a invalid type (for
example, a string value may be stored in a numeric array).
The final argument, sql_query, is a string expression containing the SQL query.
Return Value
If executing a scalar query, the function returns the queried value. Otherwise, the function returns the number of rows
stored in the result set. If there was an error executing the query, then the function returns default.
string SessionQuery =
"SELECT operatorid_info.operatorid AS OPERATOR_NAME, event_start.time AS START_TIMESTAMP,
event_end.time AS END_TIMESTAMP "
"FROM session_event session_event_start "
"JOIN event event_start ON session_event_start.id = event_start.id "
"JOIN session_event session_event_end "
"JOIN event event_end ON session_event_end.id = event_end.id AND
event_start.session_instance = event_end.session_instance "
"JOIN session_instance ON event_start.session_instance = session_instance.id "
"JOIN session_info ON session_instance.session_info = session_info.id "
"JOIN operatorid_info ON session_info.operatorid_info = operatorid_info.id "
"WHERE session_event_start.action = 1 AND session_event_end.action = 0 "
"ORDER BY event_start.time;";
sqlquery(paradata, WS_ENTRY_SESSIONS_REC, SessionQuery);
do numeric ctr = 1 while ctr <= count(WS_ENTRY_SESSIONS_REC)
errmsg("Operator '%s' worked on %s at %s for %0.2f minutes",
strip(OPERATOR_NAME(ctr)),
edit("9999-99-99", sysdate("YYYYMMDD", START_TIMESTAMP(ctr))),
edit("99:99", systime("HHMM", END_TIMESTAMP(ctr))),
( END_TIMESTAMP(ctr) - START_TIMESTAMP(ctr) ) / 60);
enddo;
PROC GLOBAL
function sql DisplayEntrySession(string operatorName, numeric startTime, numeric endTime)
errmsg("Operator '%s' worked on %s at %s for %0.2f minutes",
operatorName,
edit("9999-99-99", sysdate("YYYYMMDD",startTime)),
edit("99:99", systime("HHMM",endTime)),
( endTime - startTime ) / 60);
end;
PROC EXAMPLE
string SessionQuery =
"SELECT DisplayEntrySession(operatorid_info.operatorid, event_start.time,
event_end.time) "
"FROM session_event session_event_start "
"JOIN event event_start ON session_event_start.id = event_start.id "
"JOIN session_event session_event_end "
"JOIN event event_end ON session_event_end.id = event_end.id AND
event_start.session_instance = event_end.session_instance "
"JOIN session_instance ON event_start.session_instance = session_instance.id "
"JOIN session_info ON session_instance.session_info = session_info.id "
"JOIN operatorid_info ON session_info.operatorid_info = operatorid_info.id "
"WHERE session_event_start.action = 1 AND session_event_end.action = 0 "
"ORDER BY event_start.time;";
sqlquery(paradata, SessionQuery);
Stop Function
Format
b = stop(『 stop_code』 );
Description
The stop function prematurely ends a CSPro application. If the function is used in a data entry application, the optional
numeric expression stop_code determines whether data entry is stopped just for the current case or whether the entire
data entry application is closed. If stop_code is not specified or is 0, entry of the current case is stopped (the same a
pressing the stop button), but CSEntry remains active. If stop_code is 1, entry of the current case is stopped and
CSEntry terminates.
In a data entry application, if the function is executed in the postproc of the first (highest) level then the data for the case
is saved. Otherwise, any data entered for the current case is lost. If you want to avoid losing data, you can call
savepartial just before the stop function.
If the function is used in a batch edit application, the stop_code has no effect. The run is always terminated. If an output
file was specified in the batch run, neither the current case nor subsequent cases will be saved to the output file.
Return Value
The function returns a logical value of 0 (false) if the program cannot be stopped (for example, if the function is called
from an external form).
SysParm Function
Format
s = sysparm(『 parameter_name』 );
Description
The sysparm function returns the value of a parameter provided in a data entry or batch edit PFF file. The function returns
the parameter as a left-justified string. If no parameter was given in the PFF file, the function returns a blank string.
If a string expression parameter_name is provided, the function looks to see if there is a parameter with the given name
specified in the [Parameters] section of the PFF file. If such a parameter exists, it is returned as a string. Using this
functionality, you can pass several parameters to your programs.
On Windows, it is possible to check for the presence of a command line argument using this function. If
parameter_name is not in the PFF file, the function will check if that string is specified in the command line and if so,
will return a non-blank string.
Return Value
The function returns a string containing the parameter.
Example PFF
[Parameters]
Parameter=Otter
State=20
County=5
Example Logic
errmsg("%s",sysparm()); // displays Otter
STATE = tonumber(sysparm("State")); // sets STATE to 20
COUNTY = tonumber(sysparm("County")); // sets COUNTY to 5
Tr Function
Format
s = tr(string_expression ‖ message_number);
Return Value
The function returns a string with the translated message. If there is no available translation, then the original message is
returned.
Example
FR("Hello") Bonjour
See also: Message File (.mgf), Multiple Language Applications, SetLanguage Function
Trace Function
Programmers use tracing to obtain low-level information about how an application runs, with the information often used for
debugging purposes to understand why a program does not execute as expected. CSPro offers limited tracing
functionality for two objectives: logging user-generated information and tracking executed statements. Tracing messages
can be displayed in a window or saved to a file.
It is possible, by calling the trace function twice, to send messages to both a window and a file. If the filename does not
contain a dictionary, the file will be placed in the application folder. On mobile devices, it is only possible to trace to a file
and any trace statements output to a window will be ignored.
The set trace statement indicates to CSPro that logic statements should or should not be outputted but the
statements will only be outputted if tracing is activated, thus the trace function and set trace statements must be
used together.
Example
trace("There is no trace window open so this message is discarded");
// opens the trace file and clears previous contents
trace(on,"trace.txt",clear);
trace("This message appears in the file");
trace("Complex strings can be outputted using errmsg-style formatting; e.g., e = %0.3f",exp(1));
// closes the trace file
trace(off);
// opens the trace file and now messages will be appended to the end of the file
trace(on,"trace.txt");
set trace;
numeric value = 10;
if value > 10 then
errmsg("A");
elseif value < 10 then
errmsg("B");
else
errmsg("C");
endif;
errmsg("This statement will appear on the trace window");
set trace(off);
errmsg("This statement will not appear on the trace window");
Trace Output
As the following trace results show, the output for conditional statements (e.g., if) and loops (e.g., do) is limited. Trace
results show the line numbers to the left of the executed statements.
UUID Function
Format
s = uuid(『 dictionary_name』 );
Description
This uuid function returns a human-readable string containing a universally unique identifier (UUID), also referred to as a
globally unique identifier (GUID). This is a 36-character string that will be formatted in groups of 8, 4, 4, 4, and 12
characters, separated by hyphens. For example:
e459e2b6-1fec-4d71-86c1-078cc2fe7433
This is a string representation of a 128-bit number that can essentially be considered globally unique. The probability of
generating the same UUID twice is near zero. You will receive a different string every time you call the function.
If an optional dictionary_name is specified, the function returns the UUID (internal key) associated with the currently
loaded case. This may be useful when working with paradata queries.
Return Value
The function returns a string containing the human-readable representation of the identifier.
Example
PROC UNIQUE_ID
preproc
if UNIQUE_ID = "" then
UNIQUE_ID = uuid();
endif;
View Function
b = view(filename ‖ website_address』 );
Return Value
The function returns a logical value of 1 (true) if the file exists and the external application could be launched successfully
and 0 (false) otherwise.
Examples
view("https://www.census.gov/data/software/cspro.html");
view("../Old Images/carn03.jpg");
Warning Function
Format
i = warning(message『 , argument1, ..., argumentN』 );
Description
The warning function is similar to the errmsg function and the arguments and return value are described in the help
documentation for that function. The function only has special behavior in a data entry application. If used in a batch or
tabulation application, it behaves identically to errmsg.
The function differs from errmsg in that it can be used for "soft check" error messages. These messages will not be
displayed to the operator when advancing in the case (such as when resuming from a partial save). This can also be
useful for information that you want to display to the operator only when they have just filled a field, rather than when
moving from one section to another section.
If using the select option, then one of the field arguments must be continue or you must specify the default_option.
When advancing past this function, a message will not display and then the default_option or continue will be
evaluated.
Example
PROC RELATIONSHIP
if RELATIONSHIP = 1 then
warning("Please confirm that %s is the head of household.", strip(NAME))
select("Correct", continue,
"Incorrect: Change Relationship", RELATIONSHIP);
endif;
Description
The adjlba function adjusts the lower bound of the century month code (CMC) of an event based on an age. It is
generally used to calculate the minimum date of birth based on the date of an interview.
The lower_cmc1 and upper_cmc1 arguments are numeric expressions generally representing the date of birth CMC
range. The lower_cmc2 argument is a numeric expression of the CMC of an event, generally the date of an interview.
The upper_cmc2 argument is not used (but is used by adjuba). The age argument is a numeric expression expressing
the age of the person in complete years.
Return Value
The function returns the adjusted lower bound of the CMC of an event based on the age. If the adjusted lower bound is
greater than upper_cmc1, the function returns the error code of -1.
Example
t = adjlba(ldob,udob,di,di,HL6);
See also: AdjUBA Function, AdjLBI Function, AdjUBI Function, CMCode Function, SetLB Function, SetUB Function
AdjLBI Function
Format
i = adjlbi(lower_cmc1,upper_cmc1,lower_cmc2,upper_cmc2,interval);
Description
The adjlbi function adjusts the lower bound of the century month code (CMC) of an event based on an interval. It is
generally used to calculate the minimum next date of birth based on a date of birth and a minimum number of months
allowed between births.
The lower_cmc1 argument is a numeric expression generally representing the date of birth CMC of a child. The
upper_cmc1 argument is not used (but is used by adjubi). The lower_cmc2 and upper_cmc2 arguments are numeric
expressions generally representing the date of birth CMC range of the child that was born after the child represented in
lower_cmc1. The interval argument is a numeric expression expressing the minimum number of months between
events, which is generally the minimum number of months allowed to pass between a woman's successive childbirths.
Return Value
The function returns the adjusted lower bound of the CMC of an event based on the number of months represented by the
interval. If the adjusted lower bound is greater than upper_cmc2, the function returns the error code of -1. It the adjusted
lower bound is otherwise valid, the function returns either the adjusted lower bound or lower_cmc2, whichever is greater.
See also: AdjUBI Function, AdjLBA Function, AdjUBA Function, CMCode Function, SetLB Function, SetUB Function
AdjUBA Function
Format
i = adjuba(lower_cmc1,upper_cmc1,lower_cmc2,upper_cmc2,age);
Description
The adjuba function adjusts the upper bound of the century month code (CMC) of an event based on an age. It is
generally used to calculate the maximum date of birth based on the date of an interview.
The lower_cmc1 and upper_cmc1 arguments are numeric expressions generally representing the date of birth CMC
range. The lower_cmc2 argument is not used (but is used by adjlba). The upper_cmc2 argument is a numeric
expression of the CMC of an event, generally the date of an interview. The age argument is a numeric expression
expressing the age of the person in complete years.
Return Value
The function returns the adjusted upper bound of the CMC of an event based on the age. If the adjusted upper bound is
less than upper_cmc1, the function returns the error code of -1.
Example
t = adjuba(ldob,udob,di,di,HL6);
See also: AdjLBA Function, AdjLBI Function, AdjUBI Function, CMCode Function, SetLB Function, SetUB Function
AdjUBI Function
Format
i = adjubi(lower_cmc1,upper_cmc1,lower_cmc2,upper_cmc2,interval);
Description
The adjubi function adjusts the upper bound of the century month code (CMC) of an event based on an interval. It is
generally used to calculate the maximum previous date of birth based on a date of birth and a minimum number of
months allowed between births.
The lower_cmc1 and upper_cmc1 arguments are numeric expressions generally representing the date of birth CMC
range of the child that was born prior to the child represented in upper_cmc2. The lower_cmc2 argument is not used
(but is used by adjlbi). The upper_cmc2 argument is a numeric expression generally representing the date of birth CMC
of a child. The interval argument is a numeric expression expressing the minimum number of months between events,
which is generally the minimum number of months allowed to pass between a woman's successive childbirths.
Return Value
The function returns the adjusted upper bound of the CMC of an event based on the number of months represented by
the interval. If the adjusted upper bound is less than lower_cmc1, the function returns the error code of -1. It the adjusted
Example
t = adjubi(ldc(i),udc(i),ldc(j),udc(j),7);
See also: AdjLBI Function, AdjLBA Function, AdjUBA Function, CMCode Function, SetLB Function, SetUB Function
DateAdd Function
Format
i = dateadd(start_date, period 『 ,period_type』 );
Description
The dateadd function calculates a new date by adding a period to a starting date. The numeric expression start_date
must be in YYYYMMDD (year/month/day) format. If no year is present, then the current or previous year is assumed in
order satisfy the condition that the date is not in the future (based on the computer's system clock). The numeric
expression period is the amount of time that is added to the starting date, and defaults to days. Alternatively, an
optional string expression, period_type, modifies the period, and can be one of the following values:
period_type Description
"d" days (default)
"w" weeks
"m" months
"y" years
Return Value
The function returns a date calculated by adding the period to the starting date. If the starting date cannot be processed,
the function returns default.
Examples
dateadd(20121225, 7); // returns 20130101
dateadd(20120228, 1); // returns 20120229
dateadd(20130228, 1); // returns 20130301
dateadd(20040820, 3, "w"); // returns 20040910
dateadd(20040820, 3, "m"); // returns 20041120
dateadd(20040820, 3, "y"); // returns 20070820
dateadd(20001010, -3, "m"); // returns 20000710
DateDiff Function
Format
i = datediff(start_date, end_date 『 ,diff_type』 );
Description
Pa ge 601 of 684 Da te a nd Time Func ons
The datediff function returns the difference between two dates as a number, defaulting to the number of days between
the dates. This function acts similarly to Microsoft Excel's datedif function. The numeric expressions start_date and
end_date must be in YYYYMMDD (year/month/day) format. If no year is present then the current or next year is
assumed in order to satisfy the condition that the start date is earlier than the end date. If years are present, it is
possible for the start date to be later than the end date, in which case the function returns a negative difference. An
optional string expression, diff_type, indicates the type of date difference requested, and can be one of the following
values:
diff_type Description
"d" days between the start and end dates (default)
"w" weeks between the start and end dates
"m" months between the start and end dates
"y" years between the start and end dates
"md" days between the start and end dates ignoring both the years and months of the dates
"ym" months between the start and end dates ignoring the years of the dates
"yd" days between the start and end dates ignoring the years of the dates
Return Value
The function returns the requested difference in dates, defaulting to the number of days between the dates. If the dates
cannot be processed, the function returns default. The function returns values equal to Excel's function for most
arguments, but due to processing differences involving leap years, sometimes CSPro's function will return a value
different from Excel's.
Example 1
datediff(19790404, 19820605); // returns 1158
datediff(19790404, 19820605, "d"); // returns 1158
datediff(19790404, 19820605, "w"); // returns 165
datediff(19790404, 19820605, "m"); // returns 38
datediff(19790404, 19820605, "y"); // returns 3
datediff(19790404, 19820605, "md"); // returns 1
datediff(19790404, 19820605, "ym"); // returns 2
datediff(19790404, 19820605, "yd"); // returns 62
Example 2
numeric date1 = 20090120;
numeric date2 = 20121106;
string outputText = "The difference between January 20, 2009 and November 6, 2012 is %d years,
%d months, and %d days";
errmsg(outputText, datediff(date1, date2, "y"), datediff(date1, date2, "ym"), datediff(date1,
date2, "md"));
// returns ... 3 years, 9 months, and 17 days
DateValid Function
Format
b = datevalid(date);
Description
Return Value
The function returns 1 (true) if the date is valid and 0 (false) otherwise.
Example
datevalid(20120229); // returns 1
datevalid(20130229); // returns 0
PublishDate Function
Format
i = publishdate();
Description
The publishdate function returns the date and time that the code was compiled. Generally, this is not particularly
meaningful because the code is compiled right before it is executed. The exception to this is when you create compiled
binary data entry applications. These files, with the extension .pen, are used to run applications on mobile devices. In the
case of a .pen file, the value returned by the function will be the date and time that the .pen file was created.
Return Value
The function returns a number in the form YYYYMMddHHmmss , where YYYY is the year, MM is the month, dd is the
day, HH is the hour, mm is the minute, and ss are the seconds.
Example
numeric publishDay = int(publishdate() / 1000000); // ignore the time
if datediff(publishDay,sysdate("YYYYMMDD")) > 20 then
errmsg("This program expires after 20 days. Sync to get a new program.");
stop(1);
endif;
See also: DateAdd Function, DateDiff Function, SysDate Function, SysTime Function
SetLB Function
Format
i = setlb(month, year, minimum_cmc);
Description
The setlb function returns the lower bound of the century month code (CMC) of an event. The arguments month, year,
and minimum_cmc are all numeric expressions.
If the month and year are both valid, the function returns the CMC of that date, as if calling:
If the month is invalid but the year is valid, the function returns the CMC of January in the valid year, as if calling:
i = cmcode(1, year);
If both the month and year are invalid, the function returns the lower bound represented as minimum_cmc.
Return Value
The function returns the lower bound of the CMC of an event.
Example
if CM12Y < 9997 then
ldoblc = setlb(CM12M, CM12Y, 0);
udoblc = setub(CM12M, CM12Y, 9999);
endif;
See also: SetUB Function, CMCode Function, AdjLBA Function, AdjLBI Function, AdjUBA Function, AdjUBI Function
SetUB Function
Format
i = setub(month, year, maximum_cmc);
Description
The setlb function returns the upper bound of the century month code (CMC) of an event. The arguments month, year,
and maximum_cmc are all numeric expressions.
If the month and year are both valid, the function returns the CMC of that date, as if calling:
i = cmcode(month, year);
If the month is invalid but the year is valid, the function returns the CMC of December in the valid year, as if calling:
i = cmcode(12, year);
If both the month and year are invalid, the function returns the upper bound represented as maximum_cmc.
Return Value
The function returns the upper bound of the CMC of an event.
Example
if CM12Y < 9997 then
ldoblc = setlb(CM12M, CM12Y, 0);
udoblc = setub(CM12M, CM12Y, 9999);
endif;
See also: SetLB Function, CMCode Function, AdjLBA Function, AdjLBI Function, AdjUBA Function, AdjUBI Function
SysDate Function
Pa ge 604 of 684 Da te a nd Time Func ons
Format
i = sysdate(『 date_format『 ,timestamp_value』 』 );
Description
The sysdate function returns the current system date as an integer. The optional string expression date_format is
composed of a combination of DD (days), MM (months), and/or YY or YYYY (years). YY returns the current year in two
digits, while YYYY returns it in four digits. The strings DD, MM and YY or YYYY can be put together in any order to
make up a customized format. If no date format is specified, the function returns the date in the format YYMMDD.
Passing an optional numeric expression, timestamp_value, means that instead of using the system date, the function
will use the date, in the local machine's time zone, of a date/time value that came from timestamp function.
The current date can be returned as a string using the edit function as follows:
edit("9999-99-99",sysdate("YYYYDDMM"));
Return Value
The function returns the system date as an integer. If the date format is invalid, the function returns 0.
Example
// if the current date is December 17, 1999...
x = sysdate("DDMMYYYY"); // returns 17121999
x = sysdate("MMYYYY"); // returns 121999
x = sysdate("DD"); // returns 17
x = sysdate(); // returns 991217
// if the current date is March 8, 2000...
x = sysdate("DDMMYYYY"); // returns 8032000
x = sysdate("MMYYYY"); // returns 32000
x = sysdate("MMYY"); // returns 300
x = sysdate("DD"); // returns 8
x = sysdate(); // returns 308
See also: DateAdd Function, DateDiff Function, DateValid Function, PublishDate Function, SysTime Function,
Timestamp Function
SysTime Function
Format
i = systime(『 time_format『 ,timestamp_value』 』 );
Description
The systime function returns the current system time as a six-digit integer in the form HHMMSS, where HH is the hour,
MM are the minutes, and SS are the seconds.
Optionally, a string expression time_format can be supplied, using a combination of HH, MM, and SS, to extract only
part of the current system time.
Passing an optional numeric expression, timestamp_value, means that instead of using the system time, the function
will use the time, in the local machine's time zone, of a date/time value that came from timestamp function.
The current time can be returned as a string using the edit function as follows:
Return Value
The function returns the system time as an integer.
Example
SURVEY_START_HOUR = systime("HH");
SURVEY_START_MINUTE = systime("MM");
Timestamp Function
Format
d = timestamp();
Description
The timestamp function returns the UNIX time, which is, in Coordinated Universal Time (UTC), the number of seconds
that have elapsed since January 1, 1970. This value is used by many computer programs and is thus useful for storing
times as well as calculating timespans. Because the time is in UTC, it eliminates the need to worry about time zones
when working with times.
Return Value
The function returns the UNIX time as a decimal value. The integer part of this value is the number of elapsed seconds,
while the decimal part contains the milliseconds.
Example
PROC GLOBAL
numeric surveyStartTime;
PROC CROPS_QUEST
preproc
surveyStartTime = timestamp();
postproc
SURVEY_DURATION = timestamp() - surveyStartTime;
See also: TimeString Function, SysTime Function, SysDate Function, FileTime Function
TimeString Function
Format
s = timestring(『 timestamp_format』 『 ,timestamp_value』 );
Description
The optional string expression timestamp_format contains date and time formatters. You can view a list of these
formatters on a webpage about the C function strftime. If no formatter is supplied, the function returns an expression
based on the machine's local settings (using the formatter "%c").
The optional numeric expression timestamp_value allows you to use a date/time value that came from timestamp
function. If not supplied, the function uses the current date/time.
If querying a SQL database via the sqlquery function, you can call this function with the name cspro_timestring.
Return Value
The function returns the custom formatted date and time string.
Example
numeric csproFirstReleaseTimestamp = 957528000;
// displays a string similar to: Fri May 5 08:00:00 2000
errmsg("%s",timestring(csproFirstReleaseTimestamp));
// displays a string similar to: Friday, May 5, 2000
errmsg("%s",timestring("%A, %B %e, %Y",csproFirstReleaseTimestamp));
Description
The clear function initializes the memory values of data items defined for the external or working storage dictionary
dictionary_name. Numeric items are initialized to notappl (blank) for external dictionaries and 0 for working storage
dictionaries. Alphanumeric items are initialized to blank. The function will also set the number of occurrences of records
and items to 0.
Return Value
The function returns a logical value 1 (true) if successful and 0 (false) otherwise.
Example
clear(OTHERS_DICT);
enter OTHERS_FF;
Close Function
Format
b = close(dictionary_name ‖ file_handler);
Description
The close function closes a file, whether opened as a dictionary or via a file statement. The function closes either a
dictionary named dictionary_name or a text file associated with file_handler.
If closing a dictionary, the dictionary will be set to a none data source and will remain set to that source until the previous
file is reopened using the open function or if a new file is associated with the dictionary using the setfile function.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
Example
CountCases Function
Format
i = countcases(dictionary_name 『 where condition』 );
Description
The countcases function counts the number of cases in a file referenced by dictionary_name. The dictionary can be
either the main input dictionary or an external dictionary. An optional logical condition, referencing items in a dictionary,
can be supplied to count the number of cases in the file that meet a certain criterion. When using this condition, the
dictionary must be an external dictionary.
It is possible, by specifying dictionary access parameters after the dictionary name, to limit the cases that the
countcases function processes.
Return Value
The function returns the number of cases in the file, or if a condition is given, the number of cases that meet that
condition. If there is an error processing the file, the function returns default.
Example
PROC NEW_INTERVIEWER_NAME
if countcases(STAFF_DICT where STAFF_NAME = NEW_INTERVIEWER_NAME) >= 1 then
errmsg("You must give the interviewer a unique name.");
reenter;
endif;
DelCase Function
Format
b = delcase(dictionary_name『 , id1, ..., idN』 );
Description
The delcase function marks a case for deletion in an external file.
The dictionary_name must be supplied and refers to an external dictionary associated with your application.
The optional list of ID variables, id1 ... idN, specifies the list of variables that will identify the case to delete from the
external file. Generally, each of the variables in the list is defined in a dictionary. The combined length of the variables in
the list must equal the length of the case IDs defined for the external dictionary. The function concatenates the variables
Pa ge 609 of 684 Externa l File Func ons
in the ID list to form a string. Alternatively, you can pass, as a string, the full key of the case to delete. The function then
deletes from the external file the case whose case identifier matches the string constructed from the list.
If no ID list is provided, the current values of the ID items for the external dictionary are used.
Return Value
The function returns a logical value of 1 (true) if the case existed and was successfully marked for deletion and 0 (false)
otherwise.
Example
delcase(ASSIGNMENT_DICT, CLUSTER, HH);
DirCreate Function
Format
b = dircreate(directory_name);
Description
The dircreate function creates a new directory whose name comes from the string expression directory_name. If
necessary, parent directories will be added to support the creation of the directory.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise. If the directory already exists, the
function returns 1.
Example
string backupDirectoryName = pathname(InputFile) + "Backups";
dircreate(backupDirectoryName);
DirDelete Function
Format
i = dirdelete(directory_name);
Description
The dirdelete function deletes an empty directory or group of empty directories. The argument directory_name is a
string expression giving a directory name.
You can use the wildcard characters "*" and "?" to specify a group of directories to delete.
Return Value
Example
// delete directory "Backups" from the application folder
dirdelete(pathname(Application) + "Backups");
// delete all directories that begin with "Backup" from the application folder
dirdelete(pathname(Application) + "Backup*");
See also: DirCreate Function, DirExist Function, DirList Function, FileDelete Function
DirExist Function
Format
b = direxist(directory_name);
Description
The direxist function determines whether a directory exists. directory_name is a string expression.
Return Value
The function returns a logical value of 1 (true) if the directory exists and 0 (false) otherwise.
Example
if direxist("C:\Backups") then
// process backups
endif;
DirList Function
Format
b = dirlist(directory_listing,directory_name『 ,wildcard_filter』 『 ,recursive』 );
Description
The dirlist function fills directory_listing, a string list, with the paths of files and directories located in a directory
specified by the string expression directory_name. A wildcard filter can be passed as an optional third string
expression, wildcard_filter. An optional fourth argument, recursive, instructs the function to create a listing of the
specified directory as well as any subdirectories. All file and directory names will be fully-qualified; i.e., they will be
absolute, not relative, path names.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) if the directory does not exist or cannot be
accessed.
Example
FileConcat Function
Format
b = fileconcat(『 dictionary_name,』 output_file_name, file_name1『 , ..., file_nameN』 );
Description
The fileconcat function concatenates a number of files into a single file. The files to concatenate can be either
individual file names or wildcard file specifications. The function supports two different methods of concatenation:
Concatenate cases: Reads cases from each input file and writes them to the output file, skipping invalid and
duplicate cases. This mode requires a data dictionary that describes the format of the input and output data files
so that cases may be validated and duplicates may be identified. Concatenate cases mode may be used with all
types of CSPro data files.
Concatenate file contents: Simply appends one file to the end of another regardless of the file contents. This
mode will not work for CSPro data files that are not text files (such as CSPro DB files).
To peform a case concatenation, you must specify a dictionary_name that specifies the layout of the data files being
concatenated. When the dictionary is specified, the files are combined by reading each case from each of the input files
and writing them to the output file. In this mode, invalid cases and duplicate cases are automatically removed from the
output file.
To perform content concatenation, do not specify a dictionary. In this case, the input files are appended together without
examining their contents. This will be faster but will not eliminate duplicates or invalid cases.
The output_file_name is a string expression giving the name of the output file that will contain the concatenated data.
The arguments file_name1 to file_nameN are either string expressions or string lists that contain the names of specific
files or a wildcard specification of a group of files.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
Example 1
fileconcat("DMV.txt", "District of Columbia.txt", "Maryland.txt", "Virginia.txt");
Example 2
fileconcat(CENSUS_DICT, "14combined.csdb", "../Data/14*.csdb");
FileCopy Function
Pa ge 612 of 684 Externa l File Func ons
Format
i = filecopy(file_name, output_file_name);
Description
The filecopy function copies one file to another file or copies a group of files to a folder. The file_name is a string
expression or string list giving the source file name(s). The output_file_name is a string expression giving the
destination file or folder name. The arguments can also be file handlers declared in file statements.
You can use the wildcard characters "*" and "?" to specify a group of files to copy as the file_name. If using wildcard
characters, the output_file_name must be the name of a folder.
Return Value
The function returns the number of files copied. If there was an error copying files, the function returns default.
Example 1
filecopy("Virginia.csdb", "Virginia_Backup.csdb");
Example 2
string backup_folder_name = pathconcat(Application, maketext("Backup%d", timestamp()));
dircreate(backup_folder_name);
filecopy(pathconcat(Application, "*.csdb"), backup_folder_name);
FileCreate Function
Format
b = filecreate(file_name);
Description
The filecreate function creates a new file named after the string expression (or file handler) file_name. If the file
already exists, it is truncated to length three (the bytes in the file will consist only of the UTF-8 byte order mark).
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
Example
string report_filename = pathconcat(Application, "EA Completion Rate.html");
filecreate(report_filename);
FileDelete Function
Format
Pa ge 613 of 684 Externa l File Func ons
i = filedelete(file_name);
Description
The filedelete function deletes a file or group of files. The argument file_name is either a string expression or string
list giving a file name or names, or is a file handler declared in a file statement.
You can use the wildcard characters "*" and "?" to specify a group of files to delete.
Return Value
The function returns the number of files deleted. If there was an error deleting files, the function returns default.
Example
// delete file "Household.csdb" from the backups folder
filedelete(pathconcat(Application, "Backups/Household.csdb"));
// delete all data from the backups folder
filedelete(pathconcat(Application, "Backups/*.csdb"));
FileEmpty Function
Format
b = fileempty(file_name);
Description
The fileempty function determines whether or not a file is empty and has no content. This function is useful for
checking the status of files without needing to know the text encoding. An empty ANSI file is 0 bytes but an empty UTF-
8 file is 3 bytes. The argument file_name is either a string expression giving a file name or is a file handler declared in a
file statement in PROC GLOBAL.
Return Value
The function returns a logical value of 1 (true) if the file exists and is empty and 0 (false) if the file is not empty. If the file
does not exist, the function returns default.
Example
if fileempty(customReportFile) then
// print the header
filewrite(customReportFile,"SECONDS,KEYSTROKES");
endif;
FileExist Function
Format
b = fileexist(file_name);
You can use the wildcard characters "*" and "?" in the file_name. In that case the function will determine whether any
file matching the specification exists.
Prior to CSPro 6.2, the function determined whether either a file or a directory existed. The direxist function must now
be used to check for the existence of a directory.
Return Value
The function returns a logical value of 1 (true) if the file exists and 0 (false) otherwise.
Example
if fileexist(householdPffFilename) then
filedelete(householdPffFilename);
endif;
FileName Function
Format
s = filename(dictionary_name ‖ file_handler ‖ pff_name ‖ audio_name);
Description
The filename function returns the fully qualified name of a file. If supplying a dictionary_name, the function returns the
name of the data file associated with a data dictionary. If providing a file_handler (declared in a file statement), the
function returns the name of the text file associated with the file handler.
If providing a pff_name (declared in a pff statement), the function returns the name of an existing PFF file, or the name
of a temporarily created PFF file. If a PFF file was loaded (via pff.load) and not modified, the name of that PFF file is
returned. If the PFF has been modified and not saved, then calling filename will save the PFF to a temporary file and the
name of that file is returned.
If providing an audio_name (declared in an audio statement), the function returns the name of the audio file associated
with the audio object.
The function can also take paradata as its argument, in which case it returns the file name of the currently open
paradata log.
Return Value
The function returns a string containing the folder and file name.
Example
filewrite(report_file, "Report file: %s", filename(report_file));
filewrite(report_file, "Census file: %s", filename(CENSUS_DICT));
See also: Path Namespace, PathName Function, FileExist Function, FileSize Function
Description
The fileread function reads one or more lines of text from the file associated with file_handler. After the read the file
pointer is positioned to the next line in the file. This function reads lines sequentially.
If a string_variable is provided as a argument, a single line of text is read and placed in the variable, which is either a
temporary string variable or an alphanumeric dictionary item. If the dictionary item is longer than the line of text, blanks
will be added at the end. If the item is shorter, the line of text will be truncated.
Alternatively, a string list can be used as an argument. In this case, all remaining lines in the file are read and stored in
string_list.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
Example
PROC GLOBAL
file OperatorIdOverrideFile;
PROC APPLICATION_FF
preproc
string operatorid_override;
if fileread(OperatorIdOverrideFile, operatorid_override) then
setoperatorid(operatorid_override);
endif;
FileRename Function
Format
b = filerename(old_file_name, new_file_name);
Description
The filerename function changes the name of a file or group of files on the disk. The argument old_file_name is a
string expression or string list giving the existing file name(s). The argument new_file_name is a string expression giving
the new file name or destination folder name. The arguments can also be file handlers declared in a file statement.
You can use the wildcard characters "*" and "?" to specify a group of files to rename as the old_file_name. If using
wildcard characters, the new_file_name must be the name of a folder.
Return Value
Example
filerename("data.csdb", maketext("old_data%d.csdb",timestamp()));
FileSize Function
Format
i = filesize(file_name);
Description
The filesize function returns the size of a file in bytes. The argument file_name is either a string expression giving a
file name or is a file handler declared in a file statement in PROC GLOBAL.
Return Value
If the file exists, the function returns the size of the file in bytes. If the file does not exist, the function returns default.
Example 1
if filesize("report.txt") > 0 then
errmsg("The report already exists!");
endif;
Example 2
PROC GLOBAL
file reportFile;
PROC EXAMPLE
numeric reportSize = filesize(reportFile);
FileTime Function
Format
d = filetime(file_name);
Description
The filetime function returns the last modified date and time of a file or a directory. The argument file_name is either a
string expression giving a file or directory name or is a file handler declared in a file statement in PROC GLOBAL.
Return Value
If the file or directory exists, the function returns a timestamp value indicating the last modified date and time. If the file or
Example
list string photos_list;
dirlist(photos_list, "../Photos", "*.jpg");
do numeric counter = 1 while counter <= photos_list.length()
// only sync photos that weren't previously synced
if filetime(photos_list(counter)) > last_sync_timestamp then
syncfile(PUT, photos_list(counter), "/Photos/");
endif;
enddo;
FileWrite Function
Format
b = filewrite(file_handler, message『 , argument1, ..., argumentN』 );
Description
The filewrite function writes one or more lines of text to the file associated with file_handler.
The message is either a string expression or a numeric message number that contains the text that is written to the file.
If the text contains any message formatters, the optional arguments argument1 to argumentN will be inserted into the
text. There are some additional options for file output:
If you want to break a line of text into two lines, place \n where you want the line divided.
If you want a text line to begin on a new page, place \f at the beginning of the text string.
To output \n or \f as text instead of a new line or a new page, use a double backslash (e.g., \\n).
Alternatively, the message can be a string list. If a list is provided, then each string contained in the list is written to the
file, allowing for the output of multiple lines of text with one function call.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
Example
PROC GLOBAL
file CsvFile;
PROC EXAMPLE
filewrite(CsvFile, "Name,Sex,Age");
do numeric counter = 1 while counter <= totocc(PERSON_REC)
filewrite(CsvFile, "%s,%d,%d", encode(CSV, NAME(counter)), SEX(counter), AGE(counter));
enddo;
See also: FileRead Function, File.Write Function, Write Function, Message Formatting Options, Encode Function
Description
The find function determines the existence of a case in a data file that matches a specified condition. The function
searches the index of a file and determines whether any case matches the specified condition. The position in the file is
not changed, unlike the similar locate function, and no case is loaded into memory.
The dictionary_name refers to either an external dictionary or the main input file of an entry or batch application.
The relational_operator is one of the following: =, <, <=, >, >=, or startswith.
The string expression key_prefix specifies the condition to use when searching for cases. Cases with a key that begins
with or equals the key_prefix are considered eligible cases.
An additional option, using uuid instead of a relational operator, determines the existence of a case based on the case's
UUID, not the case's key. This may be useful when looking up duplicate or deleted cases.
Return Value
The function returns a logical value of 1 (true) if a case is found and 0 (false) otherwise.
Example
PROC OCCUPATION
if not find(OCCUPATION_CODES_DICT, =, maketext("%04d", OCCUPATION)) then
errmsg("The occupation %d is not valid.", OCCUPATION);
reenter;
endif;
Key Function
Format
s = key(dictionary_name);
Description
The key function returns a string containing the key of the case currently loaded in the file referenced by
dictionary_name. A dictionary key is a single string comprised of the IDs for a case.
Return Value
The function returns a string containing the key. If there has been no previous activity on the file and no key has been
established, the key function returns a blank string.
Example
KeyList Function
Format
i = keylist(dictionary_name『 , key_listing』 );
Description
The keylist function fills key_listing, an optional string list, with the keys in the file referenced by dictionary_name.
The dictionary can be either the main input dictionary or an external dictionary. A dictionary key is a single string
comprised of the IDs for a case.
It is possible, by specifying dictionary access parameters after the dictionary name, to limit the keys that the keylist
function returns.
Return Value
The function returns the number of cases in file.
Example 1
PROC ACTION
if ACTION in 1,3,6 and keylist(CENSUS_DICT) = 0 then // actions that require cases
errmsg("No cases have been entered yet.");
reenter;
endif;
Example 2
PROC HOUSEHOLD_NUMBER
list string enteredKeys;
keylist(CENSUS_DICT, enteredKeys);
do numeric keyCtr = 1 while keyCtr <= enteredKeys.length()
if HOUSEHOLD_NUMBER = tonumber(enteredKeys(keyCtr)[9:2]) then
errmsg("The household number %d has already been entered.", HOUSEHOLD_NUMBER);
reenter;
endif;
enddo;
LoadCase Function
Format
b = loadcase(dictionary_name『 , id1, ..., idN』 );
The dictionary_name must be supplied and refers to an external dictionary associated with your application.
The optional list of ID variables, id1 ... idN, specifies the list of variables that will identify the case to load from the
external file. This process is similar to matching files on the basis of key variables in statistical and database software
packages. Generally, each of the variables in the list is defined in a dictionary. The combined length of the variables in
the list must equal the length of the case IDs defined for the external dictionary. The function concatenates the variables
in the ID list to form a string. Alternatively, you can pass, as a string, the full key of the case to load. The function then
loads from the external file the case whose case identifier matches the string constructed from the list.
If no ID list is provided, the next logical case in the external file will be loaded. The determination of what is "next" is
based on the dictionary's access parameters.
(In versions of CSPro prior to 7.0, calling this function would automatically open an external file. The function no longer
does this, so if you call close on an external dictionary, you must open it manually before calling this function.)
Return Value
The function returns a logical value of 1 (true) if the case was loaded successfully and 0 (false) otherwise.
Example
if loadcase(SAMPLE_DICT, CLUSTER, HH) then
WEIGHT = SAMPLE_WEIGHT;
endif;
Locate Function
Format
b = locate(dictionary_name, relational_operator, key_prefix);
Description
The locate function finds, but does not load, a case in a data file that matches a specified condition. The function
searches the index of a file and finds the first case that matches the specified condition. The case pointer is positioned
to the case's location, but the case is not loaded into memory. To load the case into memory, use the retrieve function
or the loadcase function without arguments.
The dictionary_name refers to either an external dictionary or the main input file of a batch application.
The relational_operator is one of the following: =, <, <=, >, >=, or startswith.
The string expression key_prefix specifies the condition to use when searching for cases. Cases with a key that begins
with or equals the key_prefix are considered eligible cases. If the relational operators are < or <=, then the file is
positioned at the case with the largest key which satisfies the condition. If the relational operators are > or >=, then the
file is positioned at the case with the smallest key which satisfies the condition. If startswith is used, the file is
positioned at the first case whose key starts with the key_prefix.
An additional option, using uuid instead of a relational operator, determines the existence of a case based on the case's
UUID, not the case's key. This may be useful when looking up duplicate or deleted cases.
Example
if locate(SURVEY_DICT, >=, "55") then
do while loadcase(SURVEY_DICT) and PROVINCE = 55
// process all cases in province 55
enddo;
endif;
See also: Find Function, Set First Statement, Set Last Statement, Retrieve Function, LoadCase Function
NMembers Function
Format
i = nmembers(dictionary_name 『 mark_type』 );
Description
The nmembers function returns the number of cases selected by an operator during a selcase function call.
The dictionary_name must be supplied and refers to an external dictionary associated with your application. An optional
argument, mark_type, indicates what kinds of cases to process, and can be one of the following values:
mark_type Description
marked Process only the cases selected by the operator during the selcase function call.
unmarked Process the cases not selected by the operator.
all Process all of the cases that could have been selected by the operator.
Return Value
The function returns the number of cases that are described by the mark_type query.
Example
while 1 do
selcase(OCCUPATION_DICT,"Plantation") multiple;
if nmembers(OCCUPATION_DICT) < 3 then
errmsg("You must select at least three options.");
else
break;
endif;
enddo;
Description:
The open function opens and keeps open an external file or file declared in a file statement.
Under most circumstances, neither an open nor a close function is necessary to manipulate the file. In data entry
applications, by default, the file is opened when it is operated on with a file function, such as loadcase, writecase,
readfile, or writefile and closed immediately afterward. In batch applications, by default, the file is opened at the
beginning of the run and closed at the end.
If you want to control the opening and closing of the file, you can use the open and close functions to do this. If you
code an open function anywhere is the application logic, then you must control all opening and closing of the file
The ext-dict-name or file-name must be supplied. Ext-dict-name is the name of a data dictionary defining an external
data file. File-name is the name of a file declared in a file statement.
The keywords update, append or create are optional. If no keyword is coded, the file is opened in update mode.
When an ext-dict-name is used, if update or append is used, the file is opened, its contents are not changed, and
the file is ready to update cases. If create is coded, the file is opened, all previous records are removed and the file is
ready to add cases.
When a file-name is used, if update is used, the file is opened and you are positioned at the beginning of the file. If
append is coded, the file is opened, its contents are not changed, and you are positioned at the end of the file. If
create is coded, the file is opened, all previous records are removed and you are positioned at the beginning of the
file.
Return value:
The function returns a logical value of 1 (true) if file is opened and 0 (false) otherwise.
Example 1:
OK = open(LOOKUP);
Example 2:
OK = open(REPORT, append);
Retrieve Function
Format
b = retrieve(dictionary_name);
Description
The retrieve function reads a case into memory from the current position of an external file. It is intended for use only
after a successful execution of the locate function. The dictionary_name must refer to an external dictionary.
Return Value
The function returns a logical value of 1 (true) if a case is retrieved and 0 (false) otherwise.
Example
Description
The set access statement is used to control the way that CSPro iterates through a data file. By default, CSPro
processes all non-deleted cases in file order for the main input file of a batch application or in ascending key order (e.g.,
A -> B -> C) for all other files. With this statement, you can change the default behavior to process cases in a different
order or to control what cases are processed.
The dictionary_name refers to either an external dictionary or the main input file of a batch application. If the dictionary
access parameters are modified while in the middle of iterating through the data file, the next case loaded will be based
on the location of the currently loaded case. For example, if you are running a batch application on cases 1 to 5, and
then after processing case 3 you change the order to descending, the next cases loaded will be 2 and 1.
Order Ascending Iterate from low to high value (key or file position) (default)
Descending Iterate from high to low value (key or file position)
An additional parameter that can be used with these functions is startswith, functionality that allows for partial iteration
of only cases with keys that start with the specified key prefix.
Example 1
Example 2
forcase CENSUS_DICT(CaseStatus.Partial) do
interview_vs.add("Continue interview: " + getcaselabel(CENSUS_DICT), key(CENSUS_DICT));
endfor;
Example 3
// only process assignments in the area where the supervisor is working
forcase GEOCODES_DICT(startswith, strip(MENU_GEOCODE_SELECTION)) do
// ...
endfor;
See also: Set First Statement, Set Last Statement, Locate Function
SetFile Function
Format
b = setfile(dictionary_name ‖ file_handler, file_name『 , update ‖ append ‖ create』 );
Description
The setfile function assigns a data file to the dictionary associated with dictionary_name or associates the file
file_handler to a text file on the disk. The file_name argument is a string expression containing the file name of the file
to be associated with the dictionary or file. With dictionaries, the file_name can also be a connection string.
Using the keyword update, append or create is optional. If no keyword is used, the file is opened in update mode.
Dictionary behavior: If update or append is used, the data file's contents are not changed and the file is ready to
update cases. If create is used, all previous cases are removed and the file is ready to add cases.
File behavior: If update is used, you are positioned at the beginning of the file. If append is used, the file's contents are
not changed and you are positioned at the end of the file. If create is used, all previous content in the file is removed and
you are positioned at the beginning of the file.
For both dictionaries and files, if create or append is used and the file does not already exist, a new empty file will be
created. If update is used and the file does not already exist, the function will fail and return 0.
Return Value
The function returns a logical value of 1 (true) if the new file is successfully assigned and 0 (false) otherwise.
Dictionary Example
if REGION = 1 then
setfile(GEOCODES_DICT, "Eastern.csdb");
else
setfile(GEOCODES_DICT, "Western.csdb");
endif;
See also: File Statement, FileName Function, SetOutput Function, File.Open Function
Description
The set first statement is used, like the locate function, to find a specific case in a data file. The case pointer is
positioned to the first case. Based on the dictionary access parameters, the case is either the first case by indexed
(key) order or by sequential (file) order. To load the case into memory, use the retrieve function or the loadcase
function without arguments.
The dictionary_name refers to either an external dictionary or the main input file of a batch application. Using this
statement with the main input file of a batch application allows you to pass through the same data file more than once in
a single run of a batch application.
Example
PROC MENU
set first(SURVEY_DICT);
do while loadcase(SURVEY_DICT)
// process the cases in ascending key order
enddo;
See also: Set Last Statement, Locate Function, Set Access Statement
Description
The set last statement is used, like the locate function, to find a specific case in a data file. The case pointer is
positioned to the last case. Based on the dictionary access parameters, the case is either the last case by indexed
(key) order or by sequential (file) order. To load the case into memory, use the retrieve function or the loadcase
function without arguments.
The dictionary_name refers to either an external dictionary or the main input file of a batch application.
Example
See also: Set First Statement, Locate Function, Set Access Statement
Write Function
Format:
[b =] write(alpha-exp[,p1[,p2[,...,pn]]]);
Description:
The write function writes text to a write file that can be used as a report. Each argument (e.g., "p1") is sequentially
inserted into the write string. Arguments can be numeric or alphanumeric expressions, but the type of the argument
must match the type of the receiving field in the message text. If no write file is specified at run time, the write file lines
are placed in the default data entry or batch error report.
In the string expression
%[n]d = Insert a number and display it as an integer
%[n.d]f = Insert a number and display it as a decimal value
%[n.d]s = Insert a text string
"n" is the size of the field and "d" is the number of decimal places to show for a number.
Numbers are never truncated. Text strings are truncated only if ".d" is used.
If "n" is positive, the insert is right-justified in the size of the field. If "n" is negative, the insert is left-justified in the size
of the field. If "n" is a positive number with a leading zero, the insert is right-justified in the size of the field and zero-
filled to the left. When inserting a number, if "n" is preceded by a "+", the sign of the number is always displayed.
Examples:
Value = %d 23456
23456
%10d 23456
%-10d 23456
%010d 0000023456
%+10d +23456
%+010d +000023456
%f 23456.000000
Value = %f 12.567
12.567
%10.3f 12.567
%-10.3f 12.567
%10.2f 12.57
%10.5f 12.56700
%010.3f 000012.567
%+10.3f +12.567
%+010.3f +00012.567
%d 12
Value = %s abcdef
Pa ge 627 of 684 Externa l File Func ons
Value = %s abcdef
"abcdef"
%10s abcdef
%-10s abcdef
%10.3s abc
%- abc
10.3s
Return Value:
The function returns a logical value 1 (true) if successful and 0 (false) otherwise.
Example:
write("Sex = %d", SEX);
WriteCase Function
Format
b = writecase(dictionary_name);
Description
The writecase function writes a case from memory to an external data file. It can be used to update existing cases or to
write new ones.
The dictionary_name must be supplied and refers to an external dictionary associated with your application.
After a case is written to an external file, the dictionary variables for that case remain in memory. If the application does
not assign new values to all variables in the external dictionary before the next writecase function is executed, then the
values from the previous case will be written to the external data file. Use the clear function to reset the values of these
variables.
No Index Mode
In a batch application it is possible to have the function output cases without updating the file's index. This allows the
external data file to have duplicate cases, i.e., cases sharing the same set of IDs. This may be useful if the batch
application is a tool to reformat large sets of data, a situation in which maintaining the file's index is very time
consuming. Since no check is done for duplicates you should be certain that duplicates cannot be generated, or that you
want duplicates in the data file. You cannot use other external file functions like loadcase or retrieve if using no index
mode. To use this special mode, in the function arguments write (noindex) after the dictionary name.
b = writecase(dictionary_name(noindex));
Return Value
The function returns a logical value of 1 (true) if the write is successful and 0 (false) otherwise.
Example
WORK_RECORD_ID = HOUSEHOLD_ID;
LAST_WORK_TIME = timestamp();
writecase(WORK_RECORD_DICT);
Description
The syncconnect function opens a connection to a server to start a synchronization session. The function has four forms
based on the type of connection, with each form described on a separate page:
CSWeb
Bluetooth
Dropbox
FTP
LocalDropbox
LocalFiles
After a successful connection, call syncdata and syncfile to synchronize data and files between the device and the
server. When finished transferring data, call syncdisconnect to close the connection.
Return Value
The function returns a logical value of 1 (true) if the connection was successful and 0 (false) otherwise.
See also: Synchronization Overview, SyncDisconnect Function, SyncServer Function, SyncData Function, SyncFile
Function, SyncParadata Function
Format
b = syncconnect(CSWeb,csweb_url『 ,username『 ,password』 』 );
Description
The syncconnect function opens a connection to a CSWeb server to synchronize data and files over the Internet. The
first argument must be the keyword CSWeb. The second argument, csweb_url, is a string expression of the full URL of
the CSWeb server. The third and fourth arguments, username and password are optional string arguments of the
credentials used to login to the server. If the username and password are not specified, the user will be prompted to enter
them the first time the synchronization is run. After the credentials have been entered the first time, they are saved in
secure storage on the device and do not need to be entered again. You can use the CSPro Settings dialog to clear any
cached credentials.
For more information about CSWeb servers, see the Synchronization Overview and the CSWeb help documentation.
Pa ge 629 of 684 Synchroniza on Func ons
Example
string cswebServerUrl = "https://www.myserver.org/api";
// connect to the CSWeb server
if syncconnect(CSWeb,cswebServerUrl) then
// send the latest cases to the CSWeb server
syncdata(PUT,SURVEY_DICT);
syncdisconnect();
endif;
See also: Synchronization Overview, SyncConnect Function, SyncConnect Function (Bluetooth), SyncConnect Function
(Dropbox), SyncConnect Function (FTP), SyncConnect Function (LocalDropbox), SyncConnect Function (LocalFiles)
Format
b = syncconnect(Bluetooth『 ,server_device_name』 );
Description
The syncconnect function opens a connection, via Bluetooth, to another device (a computer, laptop, tablet, or phone)
running a local server. The first argument must be the keyword Bluetooth. The second argument, server_device_name,
is an optional string expression of the name of the device to which you want to connect. For Android devices, the device
name can be found in Bluetooth settings on the device running the server. For Windows devices, this is the PC name
that can be found in Windows settings. The getbluetoothname function can be used on the other device to find the
device name. If no server device name is specified, a list of nearby devices is displayed and the operator may choose
which one to connect to.
In order for the two devices to connect successfully, the device running the server must call the function syncserver at
roughly the same time as the call to syncconnect is made by the client device.
When connecting over Bluetooth, no Internet connection is required, however the two devices must be in close physical
proximity. This is useful when transferring data in areas where no Internet connection is available.
Example
string server_name = "supervisor21";
// connect to the supervisor's device using Bluetooth
if syncconnect(Bluetooth, server_name) then
// send the latest cases to the supervisor's device
syncdata(PUT, SURVEY_DICT);
syncdisconnect();
endif;
See also: SyncConnect Function, SyncConnect Function (CSWeb), SyncConnect Function (Dropbox), SyncConnect
Function (FTP), SyncConnect Function (LocalDropbox), SyncConnect Function (LocalFiles), SyncMessage Function,
GetBluetoothName Function
Format
b = syncconnect(Dropbox);
Description
The syncconnect function opens a connection to Dropbox to synchronize data and files over the Internet. The first and
only argument must be the keyword Dropbox. This requires an account with the online file sharing service Dropbox. It
does NOT require that the Dropbox client software be installed on the device. The first time the connection is made, the
user will see a prompt to enter the Dropbox username and password and these settings will be saved in secure storage
for future synchronizations. You can use the CSPro Settings dialog to clear any cached settings.
Example
// connect to Dropbox
if syncconnect(Dropbox) then
// send the latest cases to Dropbox
syncdata(PUT,SURVEY_DICT);
syncdisconnect();
endif;
See also: Synchronization Overview, SyncConnect Function, SyncConnect Function (CSWeb), SyncConnect Function
(Bluetooth), SyncConnect Function (FTP), SyncConnect Function (LocalDropbox), SyncConnect Function (LocalFiles)
Format
b = syncconnect(LocalDropbox);
Description
The syncconnect function opens a connection to Dropbox to synchronize data and files using the Dropbox folder on the
local computer. The first and only argument must be the keyword LocalDropbox. This requires that the Dropbox client
software be installed on the computer. Rather than synchronize files over the Internet this connection relies on the
Dropbox client software to do file synchronization. CSPro simply copies files and data to/from the Dropbox folder. This is
mainly useful when downloading large quantities of data which would take a long time to do using the online Dropbox
connection. The LocalDropbox connection is only supported on Windows.
Example
See also: Synchronization Overview, SyncConnect Function, SyncConnect Function (Dropbox), SyncConnect Function
(LocalFiles), SyncConnect Function (CSWeb), SyncConnect Function (Bluetooth), SyncConnect Function (FTP)
Format
b = syncconnect(LocalFiles, local_path);
Description
The syncconnect function opens a connection to synchronize data and files using the a folder on the local computer.
The first argument must be the keyword LocalFiles and the second argument, the string expression local_path, must
be the path to a local folder on the computer. Rather than synchronize files over the Internet this connection assumes
that some other service, such as an FTP server, will handle file transfers. CSPro simply copies files and data to/from the
local folder. This is mainly useful when downloading large quantities of data which would take a long time to do using a
network connection.
Example
// connect to local folder
if syncconnect(LocalFiles, "C:/FTPDirectory/") then
// download the latest cases
syncdata(GET, SURVEY_DICT);
syncdisconnect();
endif;
See also: Synchronization Overview, SyncConnect Function, SyncConnect Function (CSWeb), SyncConnect Function
(Bluetooth), SyncConnect Function (FTP)
Format
b = syncconnect(FTP,ftp_url『 ,username『 ,password』 』 );
Description
CSPro synchronization can work with FTP servers using encrypted transmission via TLS/SSL. There are two modes that
CSPro supports for encrypted FTP: explicit and implicit. To use explicit TLS/SSL, provide a URL for the server that starts
with "ftpes://". To use implicit TLS/SSL, provide a URL for the server that starts with "ftps://". For servers that do
not support TLS/SSL, use a URL that starts with "ftp://". FTP over SSH (SFTP) is not supported.
You can optionally specify a port number by adding it to the end of the URL, separated by a colon. For example, the
URL "ftp://myserver.com:27" tells CSPro to connect to port 27. If no port is specified, the standard FTP port will be
used (port 21 for FTP or port 990 for FTPS).
Example
string ftpServerUrl = "ftpes://myserver.org";
// connect to the FTP server
if syncconnect(FTP,ftpServerUrl) then
// send the latest cases to the FTP server
syncdata(PUT,SURVEY_DICT);
syncdisconnect();
endif;
See also: Synchronization Overview, SyncConnect Function, SyncConnect Function (CSWeb), SyncConnect Function
(Bluetooth), SyncConnect Function (Dropbox), SyncConnect Function (LocalDropbox), SyncConnect Function
(LocalFiles)
SyncServer Function
Format
b = syncserver(Bluetooth『 ,file_root_path』 );
Description
The syncserver function runs a local server that allows peer-to-peer synchronization between two devices via Bluetooth,
without needing an Internet connection. The function waits for a connection from another device (a client) made when the
client calls syncconnect to start a Bluetooth synchronization. Once the two devices are connected, the client device
calls syncdata and syncfile to copy data and/or files to and from the server. Finally, the client device calls
syncdisconnect to end the session.
The server in a peer-to-peer synchronization is passive. It does not call syncdata or syncfile. It simply responds to
requests initiated from the client. The Bluetooth server runs until either the client successfully connects and disconnects
or the operator cancels the synchronization.
The syncserver function displays a dialog to the operator indicating that it is waiting for a client device to connect.
During this time, no other logic on the server is executed. Once a connection is made, the dialog displays the progress
of the synchronization. When the client disconnects by calling syncdisconnect, the dialog is removed and syncserver
returns. If no connection is made, the operator can use the cancel button on the dialog to cause syncserver to return
and continue executing any logic that follows. The server only allows connections while the syncserver function is
running, so the server must call syncserver before the client device calls syncconnect.
The first argument must be the keyword Bluetooth. The second argument, file_root_path, is an optional string
When connecting over Bluetooth, no Internet connection is required, but the two devices must be in close physical
proximity.
Return Value
The function returns a logical value of 1 (true) if the synchronization was successful and 0 (false) otherwise.
Example 1
// start a peer-to-peer Bluetooth server with the application directory as the file root path
syncserver(Bluetooth);
Example 2
// start a peer-to-peer Bluetooth server with the application parent directory as the file root
path
syncserver(Bluetooth, "..");
See also: Synchronization Overview, SyncConnect Function, SyncData Function, SyncDisconnect Function, SyncFile
Function, SyncMessage Function
SyncDisconnect Function
Format
b = syncdisconnect();
Description
The syncdisconnect function ends a synchronization session previously started by calling syncconnect.
Return Value
The function returns a logical value of 1 (true) if successful and 0 (false) otherwise.
Example
if syncconnect(Dropbox) then
// perform synchronization actions
syncdisconnect();
endif;
See also: Synchronization Overview, SyncConnect Function, SyncData Function, SyncFile Function, SyncParadata
Function, SyncServer Function
Description
The syncdata function transfers cases in a CSPro data file between a device and a synchronization server. Before calling
syncdata, you must first connect to the server by calling syncconnect.
The function can upload cases from the local device (client) to the server as well as download cases from the server. The
direction argument determines which of these operations is performed. It must be one of the following values:
GET: Download any cases that were modified on the server since the last sync and update or add them to the
local data file.
PUT: Upload to the server any cases that were modified in the local data file since the last sync.
BOTH: Sync cases in the local data file with the server in both directions (i.e., perform both a GET and a PUT).
The dictionary_name argument must be the name of a data dictionary corresponding to the data file to synchronize.
This can be either the dictionary for the main data file or an external dictionary. CSPro uses the data file that is currently
associated with this dictionary, either because it was specified when the application was started or via a call to setfile
from application logic.
For synchronization with a CSWeb server, the dictionary must first be uploaded to the server. (See the CSWeb help
documentation for more information.)
For peer-to-peer synchronization, the data is written to the file associated with the dictionary of the same name on the
device running the server. This means that both devices must have this dictionary added to the currently running CSPro
application, either as the main dictionary or as an external dictionary.
By providing an optional string expression for the sync_universe argument, you can limit the cases that are transferred.
The universe is matched against the ID items of each of the cases. Only cases whose ID items concatenated together
match the universe will be transferred. For example, if the universe is "123" then cases with IDs "1234" and "1235"
would be synced but a case with IDs "2345" would not.
The syncdata function keeps track of which cases are transferred each time the client and server are synchronized and
uses this information to only transfer cases that have been modified since the last synchronization. This significantly
reduces the amount of data transferred and therefore reduces bandwidth and the cost of air time. It also reduces the
chance that two interviewers overwrite each other's work by both syncing to the same data file on the server. As long as
the two interviewers do not modify the same case at the same time, they may both synchronize to the same server
without overwriting each other's data.
The syncdata function is only supported on data files in CSPro DB or Encrypted CSPro DB formats. Data files in the
legacy text format may not be synchronized using syncdata but instead can be transferred using the syncfile function.
That function always transfers the entire file, however, increasing the amount of data transferred and allowing for one
interviewer to overwrite the changes of another.
Return Value
The function returns a logical value of 1 (true) if the transfer was successful and 0 (false) otherwise.
Example 1
Example 2
if syncconnect(Bluetooth) then
// sync data for only the province and district assigned to the interviewer
string assignmentUniverse = maketext("%v%v", ASSIGNED_PROVINCE, ASSIGNED_DISTRICT);
syncdata(BOTH, CENSUS_DICT, assignmentUniverse);
syncdisconnect();
endif;
See also: Synchronization Overview, SyncConnect Function, SyncFile Function, SyncParadata Function,
SyncDisconnect Function, SyncServer Function
SyncFile Function
Format
b = syncfile(direction, from_path『 , to_path』 );
Description
The syncfile function transfers files between a device and a synchronization server. Files may be of any type:
application files, images, text files, etc. However, for transferring CSPro data files it is more efficient to use the function
syncdata. Before calling syncfile, you must first connect to a server by calling syncconnect.
The syncfile function uploads or downloads one or more files from the server. Unlike syncdata, this sends or retrieves
an entire file, not individual cases. It is therefore useful for transferring files that are not data files such as images or new
versions of an application.
The function can upload files from the local device (client) to the server as well as download files from the server. The
direction argument determines which of these operations is performed. It must be one of the following values:
GET: Download files from the server and save them on the client.
PUT: Upload files from the client to the server.
The from_path argument is a string expression that specifies the path of the file to transfer. If the direction is GET then
this is the path of the file on the server to download. If the direction is PUT then this is the full path to the file name on the
client to upload.
The optional to_path argument is a string expression that specifies the destination path for the file. If the direction is
GET then this is the full path of the destination file name on the client device. If the direction is PUT then this is the path
of the destination file or folder on the server. If to_path is not specified, then the file will have the same name as the file in
the from_path and will be saved to the application directory on the client (for GET) or the server root directory (for PUT).
The from_path argument may contain the wildcard characters "*" and "?" to specify a group of files to transfer. In this
case, the to_path argument should be the name of a folder to copy all files that match the wildcard pattern into.
Return Value
The function returns a logical value of 1 (true) if the transfer was successful and 0 (false) otherwise.
Example
if syncconnect(CSWeb, "https://www.myserver.org/api") then
// upload all images in images directory to the images directory on the server
syncfile(PUT, "Images/*.jpg", "/Images/");
syncdisconnect();
endif;
See also: Synchronization Overview, SyncConnect Function, SyncData Function, SyncParadata Function, SyncApp
Function, SyncDisconnect Function, SyncServer Function
SyncMessage Function
Format
// Bluetooth client
s = syncmessage(message_name『 , message_value』 );
// Bluetooth server
function string OnSyncMessage(string message_name, string message_value)
exit message_response;
end;
Description
The syncmessage function sends a string message from one device to another device that is acting as a Bluetooth
server. The string expression message_name identifies the message, and an optional string expression message_value
defines a value associated with the message. The client device sends this message to a Bluetooth server.
The application running on the Bluetooth server must have a user-defined function with the name OnSyncMessage that
has two string parameters and returns a string. While the syncserver function is running, the OnSyncMessage function is
called anytime a message is received and the return value of that function is returned to the Bluetooth client.
Return Value
The function returns the message returned by the OnSyncMessage function running on the Bluetooth server. If no
message is returned, a blank string is returned.
Example
// on the server
PROC GLOBAL
function string OnSyncMessage(string message_name, string message_value)
// store information about the sync in the paradata log
if message_name = "STAFF_CODE" then
logtext("Syncing with %s at %s", message_value, timestring());
// return the training mode
elseif message_name = "TRAINING_MODE" then
exit maketext("%d", TRAINING_MODE);
endif;
end;
SyncParadata Function
Format
b = syncparadata(direction);
Description
The syncparadata function transfers paradata between a device and a synchronization server. Before calling
syncparadata, you must first connect to the server by calling syncconnect.
The function can upload paradata from the local device (client) to the server as well as download paradata from the
server. The direction argument determines which of these operations is performed. It must be one of the following
values:
GET: Download any paradata that was sent to the server since the last sync and add it to the currently open
paradata log.
PUT: Upload to the server any paradata that was collected since the last sync.
BOTH: Sync paradata in the currently open paradata log with the server in both directions (i.e., perform both a
GET and a PUT).
The syncparadata function keeps track of what paradata has been transferred each time the client and server are
synchronized and uses this information to only transfer paradata that has been collected since the last synchronization.
This significantly reduces the amount of data transferred and therefore reduces bandwidth and the cost of air time.
Paradata logs can be very large so be careful about adding paradata synchronizations to your applications. If you are
interested in syncing paradata, a general suggestion is to use BOTH when syncing between devices using Bluetooth,
and to use PUT to a CSWeb, Dropbox, or FTP server.
Return Value
Example
if syncconnect(Bluetooth) then
// other sync actions...
syncparadata(BOTH);
syncdisconnect();
endif;
See also: Synchronization Overview, SyncData Function, SyncFile Function, Paradata, Paradata Function
SyncApp Function
Format
b = syncapp();
Description
The syncapp function updates application files, such as a .pen and .pff, from a new version of the application package on
a server. This function is used to make updates, such as bug fixes and other modifications, to an application already
deployed on mobile devices in the field.
The function checks for an updated version of the application on the server that the program is currently connected to
from a previous call to syncconnect. If the version of the application on the server is newer than the currently running
version, then the new package is downloaded from the server and the application on the device is updated.
This function only works on Android. It is not supported on Windows. It may be used with CSWeb, Dropbox and FTP
servers as well as with peer to peer synchronization via Bluetooth.
Using syncapp is typically only for advanced scenarios, such as when you want to update an application over Bluetooth
or when you want to initiate an update from within a menu program. In most cases it is simpler to use Update Installed
Applications from the menu in the Entry Applications screen of CSEntry as it does the same update as syncapp without
requiring any custom logic.
Before calling syncapp, you must first connect to a server by calling syncconnect.
To use the syncapp function, you must first upload the application package to a server using the Deploy Application tool.
The syncapp function looks for a package on the server with the same package name as the currently running
application. If no package matching the current package name has been deployed to the server, the function will display
an error. Note that the application package is not necessarily the same as the name of the .pen file. The package name
is the name that was entered in the Deploy Application tool when the package was deployed to the server. It is important
therefore to always use the same package name whenever deploying updates to an application.
The syncapp function only downloads the application if the version of the application on the server is more recent than
the version on the mobile device. This is accomplished by storing the date and time the package was uploaded to the
server by the Deploy Application tool. This date and time is stored in the application package and is used to determine if
the package on the server is newer than the package on the mobile device.
Furthermore, if you are using a CSWeb server or Bluetooth, in order to keep data usage to a minimum, only files that are
different in the version of the package on the server from those in the version on the mobile device are downloaded. This
way if you have large files in your application package that do not change as part of the update, they will not slow down
the update. This optimization is not done for Dropbox or FTP servers.
Files that are marked Only on first install in the deployment package that already exist on the mobile device will be
ignored by syncapp and are not updated even if a newer version is included in the updated package.
Be careful when including data files in an application package that is used with syncapp since any data files that are in
use by the current application will be locked which will result in errors when syncapp attempts to overwrite them. Either
close any files that may be updated prior to calling syncapp or mark the data files as Only on first install in the Deploy
Applications tool and use syncdata to update them instead.
Note that if as part of the application update the currently running .pen file is updated to a newer version, syncapp will
restart the application in order to run the updated version. This means that any logic that follows the call to syncapp will
not be executed. To avoid problems, make sure that the call to syncapp is the last call in the procedure. For example, to
combine the application update with calls to syncdata to update data files, place the calls to syncdata before the call to
syncapp.
Return Value
The function returns a logical value of 1 (true) if the update was successful and 0 (false) otherwise.
Example
if syncconnect(CSWeb,"https://www.myserver.org/api") then
// get the latest versions of the application files from the server
syncapp();
syncdisconnect();
endif;
See also: Synchronization Overview, SyncConnect Function, SyncData Function, SyncDisconnect Function,
SyncServer Function
GetBluetoothName Function
Format
s = getbluetoothname();
Description
The getbluetoothname function returns the name of the device as broadcast to other Bluetooth devices. When other
Bluetooth enabled devices scan for nearby devices to connect to, this is the name that they will see.
When using Bluetooth for synchronization between two devices the Bluetooth name of the device is what is shown in the
screen asking the user which device to connect to. In addition the Bluetooth name of one device may be passed as an
argument to the syncconnect function on the other device in order to connect directly to that device bypassing the
screen asking the user which device to connect to.
Because it is possible to change the Bluetooth device name in the settings on Android and in the Device Manager on
Windows, the Bluetooth name can be used as a way to name devices for management and audit purposes. The device
can be assigned a unique identifer or serial number which is used as the Bluetooth device name. Using
getbluetoothname, this identifier can be written to an item in the data file for audit purposes, supplied as a username for
synchronization or used as a key in lookup file as part of a survey management system.
Return Value
Pa ge 640 of 684 Synchroniza on Func ons
The function returns a string containing the name of the device.
Example
PROC DEVICE_NAME
preproc
DEVICE_NAME = getbluetoothname();
noinput;
See also: GetDeviceID Function, GetOperatorId Function, GetOS Function, GetUserName Function, SyncConnect
Function (Bluetooth)
Basic Functionality
This example will demonstrate how to create a simple report using Mustache, the default templating engine. The basic
report template is a standard HTML file to which you add two components. First, you add a div block with the id
cspro_report. This is the placeholder block where the template engine will place the results of its templating.
<div id="cspro_report"></div>
Second, you specify the text that you want to pass to the templating engine. Using the script element, you specify the
type, application/vnd.cspro.report-template, as well as the id, cspro_report_template. In between the script tags,
you specify the HTML. Using Mustache, replacement parameters are defined in between double brackets {{ ... }}, and #
and / are used to start and end processing on elements of an array.
When CSPro passes this template to Mustache, the templating engine will look to replace {{title}}, and then for each
occurrence (array element) of PERSON_REC, it will replace {{NAME}}, {{SEX}}, and {{AGE}}. Once it has done this, it
will replace the cspro_report block with the dynamic contents.
Within CSPro, we must pass these parameters to the reporting system. Assuming that our template is saved to a file
named simple_report.html, then our code might be as follows:
setreportdata("title","Demographic Report");
setreportdata(PERSON_REC);
report("simple_report.html");
The calls to setreportdata set the data for the title and copy the information on the person record to an array that can
be accessed in the template. The call to report specifies the name of the template. CSPro will read the template, pass
in the title and person record data, and display the report in a web browser.
The first component is the data itself. Via calls to setreportdata, CSPro creates a JSON object and embeds it as a
script with the id cspro_report_data. If your template has such an id defined, it will be overwritten; otherwise, it will be
inserted in the output. If you want to define your own templating engine, it must be able to work with this JSON object.
Report Template
As described above, the report template contains the HTML text that will be passed to the templating engine. That, along
with the JSON data, is what turns the template into a dynamic report. The template has a few parameters:
<script type="application/vnd.cspro.report-template"
id="..."
data-template-engines="..."
data-queries="...">
<!-- HTML report -->
</script>
The id uniquely identifies the script block. There are no restrictions on its name, but if you want to use the basic report
functionality, then the id must be cspro_report_template and there must be a div block with the id cspro_report.
The optional data-template-engines specifies the names of the templating engines that are required to create the report.
Separate multiple names with whitespace. If not specified, and if the id is cspro_report_template, Mustache will be
used, as that engine comes installed with CSPro. If you specify a templating engine that CSPro cannot locate, the report
generation will result in an error.
The optional data-queries specifies the names of any queries whose execution is required to create the report. Separate
multiple names with whitespace. CSPro will check if data with the query name has been set via setreportdata. If not, it
will search for a query with the proper name, and if found, execute it. If no suitable query can be found, the report
generation will result in an error.
Query
Any queries defined in your template will be executed prior to CSPro generating the report. Queries will be passed
through the setreportdata/sqlquery functions with the result set stored under the attribute defined by data-query-
name. The query has a few parameters:
<script type="application/vnd.cspro.query"
data-query-name="..."
data-description="..."
data-source="...">
<!-- SQL query -->
</script>
The data-query-name is the name that CSPro uses to refer to the query. You do not need to specify an id, as CSPro
uses this name instead. This is the name that corresponds to entries in a report template's data-queries.
The data-description is optional text that may be used to give a more complete description of the query.
The data-source identifies the source of the query. The reporting system must know on which database to execute the
query. The options are either paradata, to execute a query on the currently open paradata log, or a dictionary name, in
which case the query is executed on the currently open CSPro data source associated with that dictionary.
Template Engine
<script type="application/vnd.cspro.template-engine"
data-template-engine-name="..."
data-script-type="..."
data-include-src="..."
data-include-type="...">
<!-- script code -->
</script>
The data-template-engine-name is the name that CSPro uses to refer to the template engine. You do not need to specify
an id, as CSPro uses this name instead. This is the name that corresponds to entries in a report template's data-
template-engines.
The optional data-script-type specifies the type of code located in the script block. If not specified, it defaults to
application/javascript.
The optional data-include-src identifies the file, generally a JavaScript file, that contains the source for the templating
engine.
The optional data-include-type specifies the type of code found in the file identified by data-include-src. If not specified, it
defaults to application/javascript.
CSPro will include in the HTML report the script block of any templating engine necessary to generate the report, as well
as links to any (JavaScript) files required to run the template.
CSPro first reads in any report templates and determines what queries and template engines are required to create the
report. It checks any data set via setreportdata, and if that data has an attribute with the same name as a query, then
it is assumed that that data overrides the query and the query required by the template is assumed to be handled and is
ignored. CSPro then looks in the report template for the definition of any queries and template engines. If they are found,
then they are processed.
However, if CSPro cannot find the definition of a query or template engine in the report template, it looks in two places for
these definitions, which can be included in report script definition files, which have the extension .csrs. These files
consist of script blocks defining queries and template engines. First, CSPro looks in the directory where the report is
being created and loads any .csrs files in that directory. If CSPro cannot find the definition of a query or template engine
in those files, it then loads the .csrs files that come with default CSPro installation. If it cannot find them in either of
these locations, the report generation will result in an error.
To see an example of how this works, create a basic report, using the Mustache engine, and the output of the report will
use the engine as it is defined in "C:\Program Files (x86)\CSPro 7.5\Reports\CSPro Reports.csrs." You could override
the default Mustache behavior by adding a template engine with the name "mustache" to your report template or to a
.csrs file in the directory where your report will be created.
Below are some examples of how to use Mustache with data from CSPro:
Example - Records
// each record occurrence is output as an array, with the items
// for each occurrence of the record output within that array
setreportdata(PERSON_REC);
{{#PERSON_REC}}
<p><b>{{NAME}}</b> is aged {{AGE}}.</p>
{{/PERSON_REC}}
Example - SqlQuery
string DeviceQuery =
"SELECT device_model, COUNT(*) AS device_count "
"FROM device_info "
"GROUP BY device_model;";
setreportdata("device_info",sqlquery(paradata,DeviceQuery));
{{#device_info}}
<p>{{device_count}} interviewers used {{device_model}}.</p>
{{/device_info}}
Pentium processor
512MB of RAM
SVGA monitor
Mouse
100MB of free hard drive space
Microsoft Windows Vista, 7, 8, or 10
Installing CSPro
The CSPro installer has the file name cspro75.exe. To install CSPro:
After CSPro has been installed on your computer, you will have the option to run the program and/or view the release
notes.
You will find a CSPro 7.5 icon on your desktop that you can double-click on when you want to run CSPro in the future.
Due to internal changes within CSPro 7.5, once files have been loaded in CSPro 7.5, you may no longer be able to load
them in previous versions of CSPro.
Uninstalling CSPro
There are two ways to uninstall CSPro. The uninstaller will remove all registry entries and CSPro system files. It will not
remove any applications or other files that you have created.
You can uninstall the program using the Windows Control Panel:
1. Using the Windows search functionality (Windows Key+S), type Add or remove programs.
2. Select CSPro from the list of programs.
3. Follow the prompts to uninstall the program.
1. Use Windows Explorer to browse to the CSPro application folder, which will likely be: C:\Program Files
(x86)\CSPro 7.5.
2. Run the program uninstall.exe.
3. Follow the prompts to uninstall the program.
If the application is on a network drive, multiple users can access it at the same time. Normally, only one person has
access to an application on a local hard drive. The data file for an application can also be placed on a network drive or
the operator's local hard drive, though two or more data entry operators cannot access the same data (or lookup) file at
the same time.
Another way to distribute a data entry application is to compile it into a single combined file that contains all elements of
the program. This file cannot be edited and provides security during the data entry operation. For this approach, read
about binary data entry applications.
For information about data entry applications in general, see the Data Entry User's Guide.
See also: Data Entry Keys, Batch Edit Keys, Tabulation Keys
Shortcut Description
Key
Ctrl + K Compile the currently selected item's logic.
Ctrl + Space Display autocomplete suggestions for a partially entered word.
Ctrl + 4 Insert the name of the procedure or function that is currently being edited. 4 is the key for this item
($) on many keyboards.
Ctrl + G Go to a specific line number.
Ctrl + F Find the specified text.
F3 Find the next occurrence of the specified text.
Shift + F3 Find the previous occurrence of the specified text.
Ctrl + H Replace the specified text with different text.
Ctrl + A Select all text.
Ctrl + Shift + L View the PROC GLOBAL logic.
Ctrl + / Comment or uncomment the current line or block of logic.
Ctrl + D Duplicate the current line or block of logic.
Ctrl + L Delete the current line.
Ctrl + Del Delete the current word.
Ctrl + Shift + F Copy the selected logic in a format for using on the CSPro Users Forum.
Alt + F12 Display the reference window with information on the selected word.
Ctrl + Alt + Display the reference window with information on the selected word.
Click
Pa ge 649 of 684 Appendix B - Keys Summa ry
F12 Go to the declaration of the selected word.
Ctrl + Click Go to the declaration of the selected word.
Ctrl + Plus (+) Increase the text font size.
Ctrl + Minus Decrease the text font size.
(-)
See also: Data Dictionary Keys, Batch Edit Keys, Tabulation Keys
Shortcut Description
Key
Ctrl + K Compile the currently selected item's logic.
See also: Data Dictionary Keys, Data Entry Keys, Tabulation Keys
Tabulation Keys
Shortcuts Common Throughout CSPro
Shortcut Description
Key
Ctrl + K Compile the currently selected item's logic.
Ctrl + Space Display autocomplete suggestions for a partially entered word.
Ctrl + 4 Insert the name of the procedure or function that is currently being edited. 4 is the key for this item
($) on many keyboards.
Ctrl + G Go to a specific line number.
Ctrl + F Find the specified text.
F3 Find the next occurrence of the specified text.
Shift + F3 Find the previous occurrence of the specified text.
Ctrl + H Replace the specified text with different text.
Ctrl + A Select all text.
Ctrl + Shift + L View the PROC GLOBAL logic.
Ctrl + / Comment or uncomment the current line or block of logic.
Ctrl + D Duplicate the current line or block of logic.
Ctrl + L Delete the current line.
Ctrl + Del Delete the current word.
Ctrl + Shift + F Copy the selected logic in a format for using on the CSPro Users Forum.
Pa ge 652 of 684 Appendix B - Keys Summa ry
Alt + F12 Display the reference window with information on the selected word.
Ctrl + Alt + Display the reference window with information on the selected word.
Click
F12 Go to the declaration of the selected word.
Ctrl + Click Go to the declaration of the selected word.
Ctrl + Plus (+) Increase the text font size.
Ctrl + Minus Decrease the text font size.
(-)
See also: Data Dictionary Keys, Data Entry Keys, Batch Edit Keys
File
New Create a new application.
Open Open an existing application.
Tools
Data Viewer View data files.
Text Viewer View text or data files.
Table Viewer View CSPro tables.
Table Retrieval Retrieve tables from a data set.
Tabulate Frequencies Tabulate frequency distributions for file contents.
Sort Data Sort cases or records.
Export Data Export data in various formats.
Reformat Data Reformat data using two dictionaries.
Compare Data Compare contents of two similar data files.
Concatenate Data Join text or data files one after the other.
Table Retrieval Setup Setup table retrieval database.
Pack Application Pack entire CSPro application into a ZIP file.
Index Data Identifies duplicate cases in a data file.
Excel to CSPro Convert data from Excel to CSPro.
PFF Editor Edit a PFF file in an interactive environment.
Production Runner Automate the running of multiple CSPro applications.
Operator Statistics Viewer View operator statistics files.
Save Array Viewer View and modify saved arrays.
Font Preferences Change the font preferences for the CSPro Designer.
Window
Cascade Arrange windows in an overlapping fashion.
Tile Top to Bottom Arrange windows one above the other.
Tile Side by Side Arrange windows one beside the other.
Help
Help Topics Get help on current application.
Examples Folder Open the folder containing some CSPro examples.
Mailing List Signup Sign up to the mailing list to receive CSPro news updates.
Technical Support Get information about receiving technical support.
Android CSEntry Application Open Google Play to download the Android application.
About Get information about the software.
File
New Create a new application.
Open Open an existing application.
Close Close an application.
Save Save an application.
Save As Save the current dictionary to a new file name.
Add Files Insert a file in an existing application.
Drop Files Drop a file from an existing application.
Page Setup Change headers, footers, and margins for printed pages.
Print Setup Change orientation and paper size for printed pages.
Print Preview Preview the printed pages.
Print Print all or part of a document.
Edit
Undo Undo dictionary changes.
Redo Redo dictionary changes.
Cut Copy selected dictionary element to clipboard and delete it.
Copy Copy selected dictionary element to clipboard.
Paste Paste dictionary element on clipboard to selected location.
Modify Edit the selected dictionary element.
Add Add a dictionary element at the end of the list.
Insert Insert a dictionary element at the selected location.
Delete Delete selected dictionary element.
Languages Define the set of languages used in the dictionary.
Relations Define relations between items and records.
Notes Edit notes for selected dictionary element.
Find Find a label or name with the specified text.
Convert to Subitems Convert selected items to subitems and insert the item that contains them.
Flatten Occurrences Convert a multiply occurring item to many singly occurring items.
Occurrence Labels Edit the occurrence labels of a multiply occurring record or item.
Generate Value Set Generate a value set of intervals of a numeric data item.
Dictionary Macros Access special functionality for the dictionary.
View
Names in Trees Show names instead of labels in trees.
Append Labels to Names in Show both labels and names in trees.
Tree
Full Screen Hide the trees and show full screen view.
Layout Show record layout of file in the window.
Options
Relative Positions Select whether items stay next to each other with no gaps.
ZeroFill Default 'Yes' Select whether numeric data items will have leading zeros.
DecChar Default 'Yes' Specifies whether the item should be stored in the data file with an explicit decimal
character.
File
New Create a new application.
Open Open an existing application.
Close Close an application.
Save Save an application.
Save As Save the active application with a new name.
Add Files Add a dictionary or form to an existing application.
Drop Files Drop a dictionary or form from an existing application.
Compile Compile the logic in the application.
Run Run the application.
Run as Batch Run the application after finishing keying data for a file.
Publish Entry Application Generate a binary (compiled) version of the application.
Edit
Undo Undo the most recent change.
Redo Redo the last undo.
Cut Copy selected element to clipboard and delete it.
Copy Copy selected element to clipboard.
Paste Paste element on clipboard to selected location.
Add Form Add a form to the application.
Delete Form Delete a form from the application.
Generate Forms Generate forms using the dictionary.
Delete Delete selected objects.
Find Find text in the procedures.
Find Next Find the next occurrence of text in the procedures.
Replace Replace text with new text in the procedures.
Format Logic Format the logic using the default CSPro code formatting style.
View
Box Toolbar Show or hide the box drawing toolbar.
Names in Trees Show names instead of labels in trees.
Append Labels to Names in Show both labels and names in trees.
Tree
Full Screen Hide the trees and show full screen view.
Form Show forms in the right-hand window.
Logic Show logic procedures in the right-hand window.
CAPI Questions Show the screen to enter question text.
OnKey Character Map Calculate OnKey and OnChar codes.
User and Configuration Settings Open a dialog that allows you to edit the user and configuration settings.
Options
Data Entry Change the data entry options.
Drag Change the drag options.
Data Sources Change the data source options.
Align
Left Position to left-most item.
Center Center items as a group.
Right Position to right-most item.
Top Position to top-most item.
Mid Align mid-points of items as a group.
Bottom Position to bottom-most item.
Evenly Horizontal Space evenly left to right.
Evenly Vertical Space evenly top to bottom.
CAPI Options
Use CAPI Font 1 For CAPI text use first default font.
Use CAPI Font 2 For CAPI text use second default font.
Bold Switch between bold and non-bold text.
Italic Switch between italic and non-italic text.
Underline Switch between underlined and non-underlined text.
Color Change color of text.
Left Align selected text to left of question window.
Center Align selected text to center of question window.
Right Align selected text to right of question window.
Bullets Add bullets to selected or new text.
Insert Bitmap Insert bitmap (BMP file) image in question window.
Help Text Switch display between question text and help text.
Define CAPI Languages Define the set of CAPI languages used in application.
Change CAPI Font 1 Select the first default font for a CAPI language.
Change CAPI Font 2 Select the second default font for a CAPI language.
Fit Windows to Questions Adjust question windows to display their entire text.
File
New Create a new application.
Open Open an existing application.
Close Close an application.
Save Save an application.
Save As Save the active application with a new name.
Add Files Add a dictionary to an existing application.
Drop Files Drop a dictionary from an existing application.
Compile Compile the logic in the application.
Edit
Undo Undo latest cut/copy/paste operations.
Redo Redo the latest undo operations.
Cut Cut logic and place it on the clipboard.
Copy Copy logic and place it on the clipboard.
Paste Paste logic from the clipboard.
Find Find text in the logic.
Find Next Find the next occurrence of text in the logic.
Replace Replace text with new text in the logic.
Format Logic Format the logic using the default CSPro code formatting style.
View
Names in Trees Show names instead of labels in trees.
Append Labels to Names in Show both labels and names in trees.
Tree
Full Screen Hide the trees and show full screen view.
User and Configuration Settings Open a dialog that allows you to edit the user and configuration settings.
Options
Custom Order Allow user defined order of editing.
Deprecation Warnings Modify the setting for the kind of deprecation warnings to display during
compilation.
Tabulation Menu
The following is a summary of the menu options available while editing tabulation applications:
File
New Create a new application.
Open Open an existing application.
Close Close an application.
Save Save an application.
Save As Save the active application with a new name.
Save Tables Save current table results in a file.
Add Files Add a dictionary to an existing application.
Drop Files Drop a dictionary from an existing application.
Load Preferences Load stored table formatting preferences.
Save Preferences Save table formatting preferences.
Run Run the application.
Run in Parts Run Tabulate, Consolidate, or Format process.
Page Setup Change headers, footers, and margins for printed pages.
Print Setup Change orientation and paper size for printed pages.
Print Preview Preview the printed pages.
Print Print all or part of a document.
View
Names in Trees Show names instead of labels in trees.
Append Labels to Names in Tree Show both labels and names in trees.
Full Screen Hide the trees and show full screen view.
Hidden Parts Show or hide hidden parts of a table.
Previous Table Show the previous table.
Next Table Show the next table.
Zoom In Zoom in on a print preview.
Zoom Out Zoom out on a print preview.
Facing Pages Show facing pages (two) on a print preview.
Multiple Pages Show selected number of pages in print preview.
First Page Show first print page of first table.
Previous page Show previous print page.
Next page Show next print page.
Last page Show last print page of last table.
Edit Notes for dictionary, level, record, item, value set, or value.
Find a label or a name in the dictionary.
Get Help.
Get Help.
Get Help.
Tabulation Toolbar
The Tabulation toolbar is displayed across the top of the window, immediately below the menu bar. The toolbar provides
quick mouse access to many of the more frequently-used features found in Tabulation.
Click To
Run a tabulation.
Add a table.
Insert a table.
Delete the current table.
Get Help.
Note: The Convert Dictionary tool is no longer distributed with CSPro but is instead available on
www.csprousers.org.
CSPro allows the use of long names even though IMPS only allows 16 characters. Therefore, if any of your unique
names are longer than 16 characters, you will receive a message for each variable that its name will be truncated to 16.
CSPro does not support the use of short names, as described in IMPS. Therefore, if the first 8 characters of any item
name in CSPro are not unique, the converted IMPS dictionary will have duplicate shorts names. You should check the
converted IMPS dictionary to ensure that there are no duplicate short names.
Convert an ISSA data dictionary to CSPro data dictionary only, you will receive a CSPro dictionary (.dcf) only.
You can choose to convert either a regular data dictionary (.dic) or a working storage dictionary (.wst).
Convert an ISSA data dictionary to CSPro form and data dictionary, you will receive both a CSPro dictionary (.dcf)
and a CSPro form file (.fmf). You can choose to convert either a regular data dictionary (.dic) or a working storage
dictionary (.wst).
At any time in CSPro, you can go to the Tools menu option. Select Convert Dictionary from the menu, then choose to
convert Between CSPro and IMPS in the opening dialog box.
CSPro is much less constrained than CENTRY in the relationship between dictionary records and data entry forms. In
CSPro you can mix dictionary items from different records on the same form. See Issues to Consider When Designing a
Form for more details.
In CSPro you can make the equivalent of the CENTRY "Batch," "Questionnaire," and "Record" screens. If you use this
approach, you must be sure to make all the fields on the "Batch" screen persistent.
Note: The Convert Dictionary tool is no longer distributed with CSPro but is instead available on
www.csprousers.org.
At any time in CSPro, you can go to the Tools menu option. Select Convert Dictionary from the menu, then choose to
convert Between CSPro and ISSA in the opening dialog box.
Next, state that you'd like to convert from ISSA to a CSPro Forms and Data Dictionary. Provide the name of the
original ISSA dictionary file, and the name you would like to call the CSPro form file to be generated (a CSPro dictionary
file will also be created, and its name will be based on the form file name). Press OK when ready and the files will be
created for you. You are then ready to fine-tune the layout of the forms as desired.
Note that this creates a stand-alone dictionary and form file; it does not create a data entry application. Until there exists
Note: The Convert Dictionary tool is no longer distributed with CSPro but is instead available on
www.csprousers.org.
A survey can provide valuable infor ma tion about the population being studied, but because of the limitations of samples,
the results can be generalized only for relatively high-level geographic areas, such as the country as a whole, or
(depending on the way in which the sample was selected) for specific regions or areas of the country (e.g., urban vs.
rural). A census, however, attempts to cover the entire geographic area of the subject population, so it can provide
reliable information at very low levels of geography. In addition to counts (e.g., number of people, number of housing
units, number of farms, number of businesses, etc.), a census usually provides a profile of additional related
characteristics such as fertility, housing quality, acreage, number of employees, and so on.
Census and survey data are used to plan for education, health facilities, administration, and other needs. In order to
implement programs and activities, statistics are needed by government administrators and by private users, including
businesses, industries, research organizations, and the general public. These statistics also may serve as measures of
existing conditions for small areas, providing a basis for planning development programs, and perhaps establishing a
basis for action.
In order to obtain an accurate census or survey, the data must be as free as possible from errors and inconsistencies.
Statistics derived from 'dirty data' (that is, data which still contain errors) may produce an inaccurate profile of the country
or geographic area. Therefore, before any tabulation programs are run, the data should be checked for errors and
changed so that important data items are valid and consistent. This is not to say that correction of data after they are
collected can compensate for poorly collected data. It is not practical (if not impossible) to produce a data file which is
100 percent error free. Every effort at accuracy should be made in all stages of the census or survey.
Errors in Enumeration
Pa ge 667 of 684 Appendix F - Errors in Cens us es a nd Surveys
Two types of errors occur at the enumeration stage: the respondent sometimes errs when giving information to the
enumerator either by offering what the respondent believes to be the "proper response" (as opposed to a truthful
response) to the questions; or by misunderstanding the question; and the enumerator, in asking the questions, recording
the responses, and reviewing entries at the conclusion of the interview, also may add errors to the data.
Little can be done to improve the quality of responses from individuals, except through publicity for the census and well-
trained enumerators who explain the purpose of the census and reasons for asking the various questions. The quality of
enumerators and enumerator training often can be the crucial factors in census processing. Enumerators must be
properly trained in all relevant aspects of census procedures and made to understand why their part of the census is
important, and how the enumeration fits in with the other stages of the census. Pretesting should be used to eliminate
problems in the questionnaires and materials, and to help enumerators obtain the data and complete the enumeration in
the allotted time. Also, since enumerators come from many different backgrounds and have varying levels of education
and training, training must be developed to make certain the enumerators know how to ask the questions to obtain an
unbiased response.
Once the forms leave the enumeration areas, changes can no longer easily be made with the knowledge and help of the
respondents and enumerators, so other procedures must be used to cope with inaccurate, incomplete, or inconsistent
data. During preliminary census office editing, checks of crucial entries must be carried out quickly to determine
completeness and consistency in the collected data. Highly-aberrant forms may be sent back to the field, if time and
money permit. Place codes must be checked for validity, and relationships between numbers of expected individuals as
recorded on the household form, and the actual numbers of individual forms (if individual forms are used) must agree.
Errors in Coding
Precise, detailed instructions for coding in preparation for manual and computer editing must be determined after the
tabulation plans are developed, but before the enumeration is actually undertaken. Back in the census office, it is no
longer possible to make corrections while in contact with the respondents, so editing must be determined on the basis of
assumptions about what the most probable response would be. If computer editing is possible, manual office editing
should be minimized to checking for completeness.
If census data are collected on precoded questionnaires (coded either by respondents or enumerators), and machines
are used to convert the information to computer-readable data, then except for the introduction of errors due to stray
marks or physical problems with the questionnaires, the errors found should be minimal.
Errors may occur when the data are coded, since the coder may miscode some piece of information. If the miscode is
invalid, it should be caught during the computer editing; if the code is valid but incorrect (for example, if two digits are
reversed for the entry for birthplace), the computer will not note the errors, and the information will remain incorrect for the
tabulations. Coders must be trained to edit according to the edit specifications, and efforts must be made to obtain and
maintain quality of coders, 'weeding out' inefficient inaccurate coders. Spot checks and verification of samples from each
coder can help to identify persistent coding errors.
With keyer-based entry, errors are introduced into the data through miskeying. Verification (rekeying or double-keying)
can reduce these errors. A system called "intelligent data entry" (IDE) may be used to prevent invalid entries from ever
getting into the system. An IDE system ensures that the value for each field or data item is within the permissible range
With scanning technologies, errors can be more insidious, and proper verification is more time-consuming, because it
will require either manual comparison between the information captured by the scanner and the forms, or the
establishment of a separate keying operation so that keyed output can be compared with scanned output. It is extremely
important not to assume that the use of "advanced" technology reduces or eliminates the need for verification; errors can
and will occur, so they must be caught early in the cycle of capture so that corrective measures (technological or
manual) may be applied. If this is not done, systemic error can corrupt the data beyond repair.
Computer edit checks have been used in almost all censuses carried out since the 1980 round. For proper
implementation of this tool, there must be good communication between the subject-matter specialists and the
programmers. Subject-matter specialists should write complete and clear edit specifications. Programmers should review
these specifications and work closely with subject-matter specialists to resolve questions or difficulties in implementing
the specifications. Programmers also should make sure that subject-matter specialists are involved in testing the edit
programs, that is, in providing test data and reviewing the outputs to insure that all the necessary edits were included in
the specifications. It is the programmers' responsibility to produce an edit program free of errors. If these programs are
inadequately thought out or not completely tested, existing errors in the data may not be corrected and even more errors
may be introduced.
Errors in Tabulation
Errors can occur at the tabulation stage due to improper programming or use of unknown information. Errors at this stage
are difficult to correct without introducing new errors.
Errors in Publication
Errors can occur at the publication stage through lack of inter-tabulation checking, or through printing errors. If errors are
carried through all stages of the process to publication, they will be apparent and the results will be of questionable
value. Most importantly, obvious errors at this stage diminish the credibility of the organization presenting the data.
Finally, it is very important that error analysis be done to help in interpreting the extent and kind of errors in the census
and to aid in preparing for future censuses and surveys.
Dictionary
.dcf Data dictionary
Program Information
.pff Parameters describing how to run any CSPro application
.pffRunner Compilation of PFF files used by the Production Runner
Data File
.csdb CSPro DB data file
.csdbe Encrypted CSPro DB data file
Tabulation Application
.xtb Tabulation application
.dcf Data dictionary
.xts Table specifications
.xtb.apc Logic code
.xtb.mgf User-defined messages
Other
.cmp Comparison specifications for the Sort Data tool
.csds Deployment specifications for the Deploy Application tool
.cslog Log with paradata events stored during an application's run
.csrs Report script for templated reports
.exf Export specifications for the Export Data tool
.fqf Frequency specifications for the Tabulate Frequencies tool
.m4a Recorded audio files (AAC audio in an MPEG-4 container)
.ssf Sort specifications for the Sort Data tool
.trs Table retrieval file
.xl2cs Conversion specifications for the Excel to CSPro tool
1. Convert the data to a flat file with one record per line and all records for a case organized together. (Read more
about data requirements or CSPro's file structure.)
2. Create a CSPro dictionary that describes this data file.
There are many formats from which data can be converted, but converting data to CSPro format from Microsoft Excel is
Pa ge 671 of 684 Appendix G - File Types
the most common task. There is a tool, Excel to CSPro, that can do this automatically for you. It is the preferred way of
converting data. It is also possible to create a flat file yourself from within Excel. To convert data to CSPro format:
1. Open a file in Excel. We will demonstrate this conversion using the following file:
2. Remove any header information in the file. CSPro data files should contain only data and thus any metadata (e.g.,
column headings) should be removed:
4. Remove any specialized formatting attached to cells (unless that formatting is desired in the final output):
5. Set the column widths to appropriate values. When this information is finally saved, the width of each column
determines the length of each item in your CSPro dictionary.
6. Save As, choosing "Formatted Text (Space delimited) (*.prn)" as the output format:
It is a good idea to run frequencies on each item to make sure that the dictionary you created properly describes the .prn
file.
The above example is a rather simple one, but more complex imports, including importing cases with multiple records
(and thus a record type), are also possible.
.ent
.bch
.xtb
.dcf
.fmf
For instance:
[NoEdit]
[CSPro Application]
Version=CSPro 7.5
When the user tries to open any such data file, the CSPro Designer will give an error message. However, the user can
open the files in the context of running an application, whether that is running a batch program or exporting data. This
setting only affects the CSPro Designer. It is not particularly robust protection, as a knowledgeable user can remove
[NoEdit] from the file, but it will protect against most users modifying application files. With data entry applications, .pen
files can be distributed for more protection against editing.
Data entry applications run as they would with a file specified. Batch applications, however, do not run any logic
associated with elements of the dictionary. Instead only the application preproc and postproc are executed.
PROC GLOBAL
// this function concatenates all the .dat files in a directory by order of date
function ConcatenateDatFiles()
string file_listing_filename = "file_listing.txt";
string output_filename = "Output/CombinedFiles.dat";
// use the DOS dir command to create a file with the list of all .dat files
string listing_call = maketext('cmd /c "dir /b /od *.dat > %s"', file_listing_filename);
execsystem(listing_call, wait);
// open that file and read all of the contents
file file_listing_file;
list string file_list;
file_listing_file.open(file_listing_filename);
file_listing_file.read(file_list);
file_listing_file.close();
filedelete(file_listing_filename);
// create and run a pff to concatenate these files
pff concat_pff;
concat_pff.setproperty("AppType", "Concatenate");
concat_pff.setproperty("InputData", file_list);
concat_pff.setproperty("OutputData", output_filename);
concat_pff.setproperty("Listing", "ConcatenateDatFiles.lst");
concat_pff.setproperty("ViewListing", "OnError");
concat_pff.setproperty("ViewResults", "No");
concat_pff.exec();
end;
PROC CONCATENATE_FF
ConcatenateDatFiles();
Note that this information applies only to CSPro DB files created in version 7.4 or later. Data files created in earlier
versions store all the data from a case in a single database column so it is not possible to access individual items using
SQL.
Each record table has an auto-increment integer primary key column named <record name>-id. This column does not
correspond to any dictionary item and its value is set automatically by SQLite.
Records with multiple occurrences have an additional column named occ which represents the occurrence number. If a
case in the data file has, for example, three occurrences of a particular record then the values of the occ column for the
rows in the record table for that case will be 1, 2 and 3.
All CSPro DB files also have a table named cases that contains additional information about each case in the data file
such as the case label, whether or not the case was deleted, and whether or not it was partially saved. The ID column of
the cases table is a UUID stored as a string. The database table for the first level in the dictionary is linked to the cases
table by this ID column.
CSPro DB files also contain additional tables that store field notes as well as information required for data
synchronization.
Note the use of ` around level-1-id. This is required so that SQLite does not treat the "-" character in the name as a
minus sign which would cause a syntax error.
The query above could potentially include data from deleted cases. To exclude deleted cases, join with the cases table
and filter on the deleted column:
Using GROUP BY, the following query gets the number of households by enumeration area:
CSPro allows you to explicitly open, close and save data dictionary files independently of other application files. You
must be careful when you do so if more than one application uses the data dictionary.
CSPro applications may optionally contain data dictionaries which represent secondary files, such as lookup files, which
are opened during data entry.
The data dictionary file is a text file which may be viewed with any text editor, such as Text Viewer or Notepad. It is not
recommended to make changes to this file outside the CSPro environment.
The data entry application file is a text file which may be viewed with any text editor, such as Text Viewer or Notepad. It
is not recommended to make changes to this file outside the CSPro environment. Advanced users might do so, however,
to change the names of associated files from the CSPro assigned defaults.
CSPro allows you to explicitly open, close and save forms files independently of the application file. When you do so, the
associated data dictionary file is also opened, closed or saved. Note that if you open a forms file you will not have
access to its application's logic. Generally, only advanced users open forms files explicitly.
The forms file is a text file which may be viewed with any text editor, such as Text Viewer or Notepad. It is not
recommended to make changes to this file outside the CSPro environment. Advanced users might do so, however, for
example to change the name of the associated data dictionary file.
By default, the logic file has the same name as the application file, but with a different extension, .apc. This is not a
requirement, however. Advanced users who change the name of this file must also remember to change the
corresponding name in the application file.
You can associate multiple logic files with an application, which can be useful if you have a library of functions that you
would like to share across several applications. To do this, select File -> Add Files and then choose the external logic
Pa ge 677 of 684 Files Des crip on
file to add to the application. The names of any variables or functions defined in that logic file cannot clash with the
names used in the application's primary logic file. If you no longer want to use the external logic file, you can remove it
by selecting File -> Drop Files.
The logic file is a text file which may be viewed with any text editor, such as Text Viewer or Notepad. While you may
make changes to this file outside the CSPro environment, CSPro provides a powerful text editor which is integrated with
the CSPro compiler.
Prior to CSPro 6.0, logic files had the extension .app. Although the default extension has changed, CSPro can still read
applications that have logic files with the .app extension.
You can associate multiple message files with an application, which can be useful if you have messages defined in
multiple languages. To do this, select File -> Add Files and then choose the external message file to add to the
application. If you no longer want to use the external message file, you can remove it by selecting File -> Drop Files. If
the name of the external message file begins with CSProRuntime, then the messages in that file will override the CSPro
system messages.
Basic Messages
Each line in the message file contains one message. A message consists of a message number followed by text. The
message text can be up to 240 characters long. It is displayed on the entry screen when an errmsg function with the
message number is executed in a data entry application. It is written to the execution report when errmsg function with
the message number is executed in a batch application. The messages can also be accessed with other functions
(including maketext).
When an errmsg(1) function is executed in a data entry application, the message "This is the first message" is
displayed on screen. When an errmsg(2) function is executed, the message "This is the second message" is
displayed.
When the function is executed, it knows to use error message 1 and substitute the word "June" for %s in the message
text, the number 30 for the first %d, and the number 31 for the second %d. The message "The month of June has only 30
days. You entered 31!" will be displayed on the screen.
The more general the arguments of the message, the more flexible the message. In the example below, the value of the
variable HHDAY is used as a argument. The error message will use the value of HHDAY if the errmsg function is
PROC HHDAY
if HHMONTH = 6 and HHDAY > 30 then
errmsg(1, "June", 30, HHDAY);
reenter;
endif;
101 Hello
ZH(101) 你好
FR(101) Bonjour
CSPro will automatically display the correct message based on the current language. If no translated message exists for
the current language, the default message is shown.
Using the Language directive declares that all subsequent error messages are in a given language. For example, the
previous example could be rewritten as:
Language=EN
101 Hello
Language=ZH
101 你好
Language=FR
101 Bonjour
If you would like to define messages for multiple languages at the same time, separate the name of each language with a
comma:
Language=HI, PA
101 नम ते
You can also use the message file to translate string literals that are used along with the tr function. For example:
ZH("Hello") 你好
FR("Hello") Bonjour
Calling tr("Hello") in logic will display the correct string based on the current language.
Batch Edits
Compares
Concatenates
Deployments
Data Entry
Excel to CSPro Conversions
Exports
Frequencies
Indexing
Packs
Paradata Concatenates
Reformats
Sorts
Synchronizations
Tabulations
CSPro automatically generates an index file when creating a new data file. CSPro then coordinates between both files as
cases are added, modified, or deleted.
When CSPro opens a data file, it looks for the corresponding index file. If it does not find one, CSPro automatically
generates a new one. If you suspect, for any reason, that the index file has been corrupted, or does not match the data
file, simply delete the index file and let CSPro generate a new one.
An index is only created for text files. Prior to CSPro 7.0, the data file index had the extension .idx.
Note text can contain "\n" characters, which indicate a newline (Enter key) character. If the note text is automatically
wrapped in the note text box, there will be no newline characters.
The .csnot file can be processed by another CSPro application by creating a data dictionary for it. You can use the
Dictionary Macros function to automatically create this dictionary.
Notes can be created, edited, and deleted using the getnote, putnote, and editnote functions.
Notes for non-text files (such as CSPro DB files) are stored within the file itself and not in a .csnot file. Prior to CSPro
7.0, the notes file had the extension .not.
The .sts file is a text file and you can view this file in Text Viewer.
If necessary, you can modify this file in a text editor. However, you should manipulate this file with extreme caution. If
you modify it incorrectly, you can lose information about partially entered cases or which cases have been verified.
Status information for non-text files (such as CSPro DB files) is stored within the file itself and not in a .sts file.
You can combine multiple paradata logs into a single log by using the Paradata Concatenator tool.
Each record in the log file represents one data entry session. The record layout is as follows:
Position Contents
1-3 Mode (ADD, MOD, or VER)
4 comma
5 - 36 Operator ID (as entered)
37 comma
38 - 47 Start date (mm/dd/yyyy)
48 comma
49 - 56 Start time (hh:mm:ss)
57 comma
58 - 65 End time (hh:mm:ss)
66 comma
67 - 74 Total time (End time - Start time) (seconds)
75 comma
76 - 83 Pause time (seconds)
84 comma
85 - 92 Number of cases written
93 comma
94 - 101 Number of records written
102 comma
103 - 110 Number of keystrokes
111 comma
112 - 119 Number of bad keystrokes
120 comma
121 - 128 Number of fields with errors attributed to keyer
129 comma
130 - 137 Number of fields with errors attributed to verifyer
138 comma
139 - 146 Total number of fields verified
Listing File
The listing file contains information about how an application was run and contains the results of the program's
operations. It gives the following information:
The listing file associated with a batch edit application is often the primary output of the application. For most other
applications and tools, the listing file is a secondary output. For data entry applications, this file can be useful for
monitoring the work of keyers in training and identifying problem keyers.
Using any extension not in the table above will result in a text listing file.
The CSV (comma-separated values) and CSPro Data listing files can only be used with applications that contain an input
dictionary so they are not suitable for some tabulation applications.
Whereas the text listing file displays keys of cases that had errors in text format, the CSV and CSPro Data listing files
output these keys as separate columns/items (e.g., "0901" vs. PROVINCE: 9 / DISTRICT: 1).
Error messages that occur outside of a case (for example during the application procedure) are written to all listing file
formats except for CSPro Data, which only contains information about case-related errors. When using the CSPro Data
listing file, a dictionary that describes the data file will be output using the name of the listing file with the added
extension .dcf.
The batch edit application file is a text file that may be viewed with any text editor, such as Text Viewer or Notepad. It is
not recommended to make changes to this file outside the CSPro environment. Advanced users might do so, however, to
change the names of associated files from the CSPro assigned defaults.
CSPro creates the saved arrays file automatically when one or more arrays in a batch edit application are marked as a
saved array. The name of the saved arrays file is always the same as that of the application with the additional .sva file
extension added (e.g., application_name.bch.sva).
If you want to prevent the file from being updated after a run, which may be desirable if you want to produce the same
results from run to run, you can disable the update via logic:
setproperty("UpdateSaveArrayFile", "No");
CSPro allows you to open, close and save tabulation application files. When you do so, all other files associated with the
application are also opened, closed or saved.
The tabulation application file is a text file which may be viewed with any text editor, such as Text Viewer or Notepad. It
is not recommended to make changes to this file outside the CSPro environment. Advanced users might do so, however,
to change the names of associated files from the CSPro assigned defaults.
The tabulation specifications file is a text file that may be viewed with any text editor, such as Text Viewer or Notepad. It
is not recommended to make changes to this file outside the CSPro environment.
The Area Names file defines the hierarchical levels of geography and assigns text names to the numeric codes for each
geographic unit. The items must be defined in the common part of the data dictionary and should be listed in order from
major to minor division.
These files are used as output and input when tabulations are run in parts.
See also: Table Matrices Index File (.tabidx), Introduction to Run in Parts