KEMBAR78
SQL Reference Manual | PDF | Copyright | Source Code
0% found this document useful (0 votes)
4K views503 pages

SQL Reference Manual

Vertica(r) Analytic Database 3.5, Preview 2 SQL Reference Manual contains proprietary information, as well as trade secrets of Vertica Systems, Inc. Software is protected under international copyright law. This product or products depicted herein may be protected by one or more U.S. Or international patents or pending patents.
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4K views503 pages

SQL Reference Manual

Vertica(r) Analytic Database 3.5, Preview 2 SQL Reference Manual contains proprietary information, as well as trade secrets of Vertica Systems, Inc. Software is protected under international copyright law. This product or products depicted herein may be protected by one or more U.S. Or international patents or pending patents.
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 503

Vertica® Analytic Database 3.

5, Preview 2

SQL Reference Manual Copyright© 2006-2009 Vertica Systems, Inc.

Date of Publication: September 22, 2009

CONFIDENTIAL

Copyright© 2006-2009 Vertica Systems, Inc. and its licensors. All rights reserved.
Vertica Systems, Inc.
8 Federal Street
Billerica, MA 01821
Phone: (978) 600-1000
Fax: (978) 600-1001
E-Mail: info@vertica.com
Web site: http://www.vertica.com (http://www.vertica.com)

The software described in this copyright notice is furnished under a license and may be used or
copied only in accordance with the terms of such license. Vertica Systems, Inc. software
contains proprietary information, as well as trade secrets of Vertica Systems, Inc., and is
protected under international copyright law. Reproduction, adaptation, or translation, in whole or
in part, by any means — graphic, electronic or mechanical, including photocopying, recording,

-i-
SQL Reference Manual

taping, or storage in an information retrieval system — of any part of this work covered by
copyright is prohibited without prior written permission of the copyright owner, except as
allowed under the copyright laws.
This product or products depicted herein may be protected by one or more U.S. or international
patents or pending patents.

Vertica Scripts
This documentation might reference sample Vertica scripts that demonstrate and/or enhance
functionality available in the Vertica® Analytic Database. These Vertica scripts are bound to the
Terms and Conditions referred to in the Vertica Script EULA Agreement
(http://www.vertica.com/termsofuse)

Trademarks
Vertica™ and the Vertica® Analytic Database™ are trademarks of Vertica Systems, Inc..
Adobe®, Acrobat®, and Acrobat® Reader® are registered trademarks of Adobe Systems Incorporated.
AMD™ is a trademark of Advanced Micro Devices, Inc., in the United States and other countries.
Fedora™ is a trademark of Red Hat, Inc.
Intel® is a registered trademark of Intel.
Linux® is a registered trademark of Linus Torvalds.
Microsoft® is a registered trademark of Microsoft Corporation.
Novell® is a registered trademark and SUSE™ is a trademark of Novell, Inc., in the United States and
other countries.
Oracle® is a registered trademark of Oracle Corporation.
Red Hat® is a registered trademark of Red Hat, Inc.
VMware® is a registered trademark or trademark of VMware, Inc., in the United States and/or other
jurisdictions.
Other products mentioned may be trademarks or registered trademarks of their respective
companies.

Open Source Software Acknowledgements


Vertica makes no representations or warranties regarding any third party software. All
third-party software is provided or recommended by Vertica on an AS IS basis.
This product includes cryptographic software written by Eric Young (eay@cryptsoft.com).

Boost
Boost Software License - Version 1.38 - February 8th, 2009
Permission is hereby granted, free of charge, to any person or organization obtaining a copy of
the software and accompanying documentation covered by this license (the "Software") to use,
reproduce, display, distribute, execute, and transmit the Software, and to prepare derivative
works of the Software, and to permit third-parties to whom the Software is furnished to do so, all
subject to the following:
The copyright notices in the Software and this entire statement, including the above license
grant, this restriction and the following disclaimer, must be included in all copies of the
Software, in whole or in part, and all derivative works of the Software, unless such copies or

-ii-
Contents

derivative works are solely in the form of machine-executable object code generated by a
source language processor.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS
OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, TITLE AND
NON-INFRINGEMENT. IN NO EVENT SHALL THE COPYRIGHT HOLDERS OR ANYONE
DISTRIBUTING THE SOFTWARE BE LIABLE FOR ANY DAMAGES OR OTHER LIABILITY,
WHETHER IN CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

bzip2
This file is a part of bzip2 and/or libbzip2, a program and library for lossless, block-sorting data
compression.
Copyright © 1996-2005 Julian R Seward. All rights reserved.
1 Redistribution and use in source and binary forms, with or without modification, are
permitted provided that the following conditions are met:
2 Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
3 The origin of this software must not be misrepresented; you must not claim that you wrote
the original software. If you use this software in a product, an acknowledgment in the
product documentation would be appreciated but is not required.
4 Altered source versions must be plainly marked as such, and must not be misrepresented
as being the original software.
5 The name of the author may not be used to endorse or promote products derived from this
software without specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE AUTHOR ``AS IS'' AND ANY EXPRESS OR
IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES
OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED.
IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY DIRECT, INDIRECT,
INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT
NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF
USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON
ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF
THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
Julian Seward, Cambridge, UK.
jseward@bzip.org <mailto:jseward@bzip.org>
bzip2/libbzip2 version 1.0 of 21 March 2000
This program is based on (at least) the work of:
Mike Burrows
David Wheeler
Peter Fenwick

-iii-
SQL Reference Manual

Alistair Moffat
Radford Neal
Ian H. Witten
Robert Sedgewick
Jon L. Bentley

Ganglia Open Source License


Copyright © 2001 by Matt Massie and The Regents of the University of California.
All rights reserved.
Permission to use, copy, modify, and distribute this software and its documentation for any
purpose, without fee, and without written agreement is hereby granted, provided that the above
copyright notice and the following two paragraphs appear in all copies of this software.
IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY FOR
DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES ARISING
OUT OF THE USE OF THIS SOFTWARE AND ITS DOCUMENTATION, EVEN IF THE
UNIVERSITY OF CALIFORNIA HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH
DAMAGE.
THE UNIVERSITY OF CALIFORNIA SPECIFICALLY DISCLAIMS ANY WARRANTIES,
INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY
AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE PROVIDED HEREUNDER
IS ON AN "AS IS" BASIS, AND THE UNIVERSITY OF CALIFORNIA HAS NO OBLIGATION
TO PROVIDE MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR
MODIFICATIONS.

ICU (International Components for Unicode) License - ICU 1.8.1 and later
COPYRIGHT AND PERMISSION NOTICE
Copyright (c) 1995-2009 International Business Machines Corporation and others
All rights reserved.
Permission is hereby granted, free of charge, to any person obtaining a copy of this software
and associated documentation files (the "Software"), to deal in the Software without restriction,
including without limitation the rights to use, copy, modify, merge, publish, distribute, and/or sell
copies of the Software, and to permit persons to whom the Software is furnished to do so,
provided that the above copyright notice(s) and this permission notice appear in all copies of
the Software and that both the above copyright notice(s) and this permission notice appear in
supporting documentation.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS
OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT
OF THIRD PARTY RIGHTS. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR
HOLDERS INCLUDED IN THIS NOTICE BE LIABLE FOR ANY CLAIM, OR ANY SPECIAL
INDIRECT OR CONSEQUENTIAL DAMAGES, OR ANY DAMAGES WHATSOEVER
RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF
CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN

-iv-
Contents

CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.


Except as contained in this notice, the name of a copyright holder shall not be used in
advertising or otherwise to promote the sale, use or other dealings in this Software without prior
written authorization of the copyright holder.
All trademarks and registered trademarks mentioned herein are the property of their respective
owners.

Lighttpd Open Source License


Copyright © 2004, Jan Kneschke, incremental
All rights reserved.
1 Redistribution and use in source and binary forms, with or without modification, are
permitted provided that the following conditions are met:
2 Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
3 Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials provided
with the distribution.
4 Neither the name of the 'incremental' nor the names of its contributors may be used to
endorse or promote products derived from this software without specific prior written
permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
"AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED
TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A
PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED
TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
MIT Kerberos
Copyright © 1985-2007 by the Massachusetts Institute of Technology.
Export of software employing encryption from the United States of America may require a
specific license from the United States Government. It is the responsibility of any person or
organization contemplating export to obtain such a license before exporting.
WITHIN THAT CONSTRAINT, permission to use, copy, modify, and distribute this software and
its documentation for any purpose and without fee is hereby granted, provided that the above
copyright notice appear in all copies and that both that copyright notice and this permission
notice appear in supporting documentation, and that the name of M.I.T. not be used in
advertising or publicity pertaining to distribution of the software without specific, written prior
permission. Furthermore if you modify this software you must label your software as modified
software and not distribute it in such a fashion that it might be confused with the original MIT

-v-
SQL Reference Manual

software. M.I.T. makes no representations about the suitability of this software for any purpose.
It is provided “as is” without express or implied warranty.
Individual source code files are copyright MIT, Cygnus Support, Novell, OpenVision
Technologies, Oracle, Red Hat, Sun Microsystems, FundsXpress, and others.
Project Athena, Athena, Athena MUSE, Discuss, Hesiod, Kerberos, Moira, and Zephyr are
trademarks of the Massachusetts Institute of Technology (MIT). No commercial use of these
trademarks may be made without prior written permission of MIT.
“Commercial use” means use of a name in a product or other for-profit manner. It does NOT
prevent a commercial firm from referring to the MIT trademarks in order to convey information
(although in doing so, recognition of their trademark status should be given).
Portions of src/lib/crypto have the following copyright:
Copyright © 1998 by the FundsXpress, INC.
All rights reserved.
Export of this software from the United States of America may require a specific license from
the United States Government. It is the responsibility of any person or organization
contemplating export to obtain such a license before exporting.
WITHIN THAT CONSTRAINT, permission to use, copy, modify, and distribute this software and
its documentation for any purpose and without fee is hereby granted, provided that the above
copyright notice appear in all copies and that both that copyright notice and this permission
notice appear in supporting documentation, and that the name of FundsXpress. not be used in
advertising or publicity pertaining to distribution of the software without specific, written prior
permission. FundsXpress makes no representations about the suitability of this software for any
purpose. It is provided “as is” without express or implied warranty.
THIS SOFTWARE IS PROVIDED “AS IS” AND WITHOUT ANY EXPRESS OR IMPLIED
WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF
MERCHANTIBILITY AND FITNESS FOR A PARTICULAR PURPOSE.
The implementation of the AES encryption algorithm in src/lib/crypto/aes has the following
copyright:
Copyright © 2001, Dr Brian Gladman <brg@gladman.uk.net>, Worcester, UK.
All rights reserved.
LICENSE TERMS
The free distribution and use of this software in both source and binary form is allowed (with or
without changes) provided that:
1 Distributions of this source code include the above copyright notice, this list of conditions
and the following disclaimer.
2 Distributions in binary form include the above copyright notice, this list of conditions and the
following disclaimer in the documentation and/or other associated materials.
3 The copyright holder's name is not used to endorse products built using this software
without specific written permission.

-vi-
Contents

DISCLAIMER
This software is provided 'as is' with no explicit or implied warranties in respect of any
properties, including, but not limited to, correctness and fitness for purpose.
The implementations of GSSAPI mechglue in GSSAPI-SPNEGO in src/lib/gssapi, including the
following files:
lib/gssapi/generic/gssapi_err_generic.et
lib/gssapi/mechglue/g_accept_sec_context.c
lib/gssapi/mechglue/g_acquire_cred.c
lib/gssapi/mechglue/g_canon_name.c
lib/gssapi/mechglue/g_compare_name.c
lib/gssapi/mechglue/g_context_time.c
lib/gssapi/mechglue/g_delete_sec_context.c
lib/gssapi/mechglue/g_dsp_name.c
lib/gssapi/mechglue/g_dsp_status.c
lib/gssapi/mechglue/g_dup_name.c
lib/gssapi/mechglue/g_exp_sec_context.c
lib/gssapi/mechglue/g_export_name.c
lib/gssapi/mechglue/g_glue.c
lib/gssapi/mechglue/g_imp_name.c
lib/gssapi/mechglue/g_imp_sec_context.c
lib/gssapi/mechglue/g_init_sec_context.c
lib/gssapi/mechglue/g_initialize.c
lib/gssapi/mechglue/g_inquire_context.c
lib/gssapi/mechglue/g_inquire_cred.c
lib/gssapi/mechglue/g_inquire_names.c
lib/gssapi/mechglue/g_process_context.c
lib/gssapi/mechglue/g_rel_buffer.c
lib/gssapi/mechglue/g_rel_cred.c
lib/gssapi/mechglue/g_rel_name.c
lib/gssapi/mechglue/g_rel_oid_set.c
lib/gssapi/mechglue/g_seal.c
lib/gssapi/mechglue/g_sign.c
lib/gssapi/mechglue/g_store_cred.c
lib/gssapi/mechglue/g_unseal.c
lib/gssapi/mechglue/g_userok.c
lib/gssapi/mechglue/g_utils.c
lib/gssapi/mechglue/g_verify.c
lib/gssapi/mechglue/gssd_pname_to_uid.c
lib/gssapi/mechglue/mglueP.h
lib/gssapi/mechglue/oid_ops.c
lib/gssapi/spnego/gssapiP_spnego.h
lib/gssapi/spnego/spnego_mech.c
are subject to the following license:

-vii-
SQL Reference Manual

Copyright © 2004 Sun Microsystems, Inc.


Permission is hereby granted, free of charge, to any person obtaining a copy of this software
and associated documentation files (the “Software”), to deal in the Software without restriction,
including without limitation the rights to use, copy, modify, merge, publish, distribute,
sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or
substantial portions of the Software.
THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS
OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.
IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY
CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT,
TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE
SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Npgsql-.Net Data Provider for Postgresql


Copyright © 2002-2008, The Npgsql Development Team
Permission to use, copy, modify, and distribute this software and its documentation for any
purpose, without fee, and without a written agreement is hereby granted, provided that the
above copyright notice and this paragraph and the following two paragraphs appear in all
copies.
IN NO EVENT SHALL THE NPGSQL DEVELOPMENT TEAM BE LIABLE TO ANY PARTY
FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES,
INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS SOFTWARE AND ITS
DOCUMENTATION, EVEN IF THE NPGSQL DEVELOPMENT TEAM HAS BEEN ADVISED
OF THE POSSIBILITY OF SUCH DAMAGE.
THE NPGSQL DEVELOPMENT TEAM SPECIFICALLY DISCLAIMS ANY WARRANTIES,
INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY
AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE PROVIDED HEREUNDER
IS ON AN "AS IS" BASIS, AND THE NPGSQL DEVELOPMENT TEAM HAS NO
OBLIGATIONS TO PROVIDE MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS,
OR MODIFICATIONS.

Open LDAP
The OpenLDAP Public License
Version 2.8, 17 August 2003
Redistribution and use of this software and associated documentation ("Software"), with or
without modification, are permitted provided that the following conditions are met:
1 Redistributions in source form must retain copyright statements and notices,
2 Redistributions in binary form must reproduce applicable copyright statements and notices,
this list of conditions, and the following disclaimer in the documentation and/or other
materials provided with the distribution, and

-viii-
Contents

3 Redistributions must contain a verbatim copy of this document.


The OpenLDAP Foundation may revise this license from time to time. Each revision is
distinguished by a version number. You may use this Software under terms of this license
revision or under the terms of any subsequent revision of the license.
THIS SOFTWARE IS PROVIDED BY THE OPENLDAP FOUNDATION AND ITS
CONTRIBUTORS ``AS IS'' AND ANY EXPRESSED OR IMPLIED WARRANTIES, INCLUDING,
BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS
FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE
OPENLDAP FOUNDATION, ITS CONTRIBUTORS, OR THE AUTHOR(S) OR OWNER(S) OF
THE SOFTWARE BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,
EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
NEGLIGENCE OR OTHERWISE) ARISING IN
ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE
POSSIBILITY OF SUCH DAMAGE.
The names of the authors and copyright holders must not be used in advertising or otherwise to
promote the sale, use or other dealing in this Software without specific, written prior permission.
Title to copyright in this Software shall at all times remain with copyright holders.
OpenLDAP is a registered trademark of the OpenLDAP Foundation.
Copyright 1999-2003 The OpenLDAP Foundation, Redwood City, California, USA. All Rights Reserved.
Permission to copy and distribute verbatim copies of this document is granted.

Open SSL
OpenSSL License
Copyright © 1998-2008 The OpenSSL Project. All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted
provided that the following conditions are met:
1 Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
2 Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials provided
with the distribution.
3 All advertising materials mentioning features or use of this software must display the
following acknowledgment: "This product includes software developed by the OpenSSL
Project for use in the OpenSSL Toolkit. (http://www.openssl.org/)"
4 The names "OpenSSL Toolkit" and "OpenSSL Project" must not be used to endorse or
promote products derived from this software without prior written permission. For written
permission, please contact openssl-core@openssl.org.
5 Products derived from this software may not be called "OpenSSL" nor may "OpenSSL"
appear in their names without prior written permission of the OpenSSL Project.

-ix-
SQL Reference Manual

6 Redistributions of any form whatsoever must retain the following acknowledgment: "This
product includes software developed by the OpenSSL Project for use in the OpenSSL
Toolkit (http://www.openssl.org/)"
THIS SOFTWARE IS PROVIDED BY THE OpenSSL PROJECT ``AS IS'' AND ANY
EXPRESSED OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE OpenSSL PROJECT OR ITS
CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,
EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

Perl Artistic License


The "Artistic License"
Preamble
The intent of this document is to state the conditions under which a Package may be copied,
such that the Copyright Holder maintains some semblance of artistic control over the
development of the package, while giving the users of the package the right to use and
distribute the Package in a more-or-less customary fashion, plus the right to make reasonable
modifications.
Definitions:
"Package" refers to the collection of files distributed by the Copyright Holder, and derivatives
of that collection of files created through textual modification.
"Standard Version" refers to such a Package if it has not been modified, or has been
modified in accordance with the wishes of the Copyright Holder as specified below.
"Copyright Holder" is whoever is named in the copyright or copyrights for the package.
"You" is you, if you're thinking about copying or distributing this Package.
"Reasonable copying fee" is whatever you can justify on the basis of media cost, duplication
charges, time of people involved, and so on. (You will not be required to justify it to the
Copyright Holder, but only to the computing community at large as a market that must bear
the fee.)
"Freely Available" means that no fee is charged for the item itself, though there may be fees
involved in handling the item.
It also means that recipients of the item may redistribute it under the same conditions they
received it.
1 You may make and give away verbatim copies of the source form of the Standard Version
of this Package without restriction, provided that you duplicate all of the original copyright
notices and associated disclaimers.
2 You may apply bug fixes, portability fixes and other modifications derived from the Public
Domain or from the Copyright Holder. A Package modified in such a way shall still be
considered the Standard Version.

-x-
Contents

3 You may otherwise modify your copy of this Package in any way, provided that you insert a
prominent notice in each changed file stating how and when you changed that file, and
provided that you do at least ONE of the following:
a) place your modifications in the Public Domain or otherwise make them Freely Available,
such as by posting said modifications to Usenet or an equivalent medium, or placing the
modifications on a major archive site such as uunet.uu.net, or by allowing the Copyright
Holder to include your modifications in the Standard Version of the Package.
b) use the modified Package only within your corporation or organization.
c) rename any non-standard executables so the names do not conflict with standard
executables, which must also be provided, and provide a separate manual page for each
non-standard executable that clearly documents how it differs from the Standard Version.
d) make other distribution arrangements with the Copyright Holder.
4 You may distribute the programs of this Package in object code or executable form,
provided that you do at least ONE of the following:
a) distribute a Standard Version of the executables and library files, together with
instructions (in the manual page or equivalent) on where to get the Standard Version.
b) accompany the distribution with the machine-readable source of the Package with your
modifications.
c) give non-standard executables non-standard names, and clearly document the
differences in manual pages (or equivalent), together with instructions on where to get the
Standard Version.
d) make other distribution arrangements with the Copyright Holder.
5 You may charge a reasonable copying fee for any distribution of this Package. You may
charge any fee you choose for support of this Package. You may not charge a fee for this
Package itself. However, you may distribute this Package in aggregate with other (possibly
commercial) programs as part of a larger (possibly commercial) software distribution
provided that you do not advertise this Package as a product of your own. You may embed
this Package's interpreter within an executable of yours (by linking); this shall be construed
as a mere form of aggregation, provided that the complete Standard Version of the
interpreter is so embedded.
6 The scripts and library files supplied as input to or produced as output from the programs of
this Package do not automatically fall under the copyright of this Package, but belong to
whoever generated them, and may be sold commercially, and may be aggregated with this
Package. If such scripts or library files are aggregated with this Package via the so-called
"undump" or "unexec" methods of producing a binary executable image, then distribution of
such an image shall neither be construed as a distribution of this Package nor shall it fall
under the restrictions of Paragraphs 3 and 4, provided that you do not represent such an
executable image as a Standard Version of this Package.
7 C subroutines (or comparably compiled subroutines in other languages) supplied by you
and linked into this Package in order to emulate subroutines and variables of the language
defined by this Package shall not be considered part of this Package, but are the equivalent
of input as in Paragraph 6, provided these subroutines do not change the language in any
way that would cause it to fail the regression tests for the language.
8 Aggregation of this Package with a commercial distribution is always permitted provided that
the use of this Package is embedded; that is, when no overt attempt is made to make this

-xi-
SQL Reference Manual

Package's interfaces visible to the end user of the commercial distribution. Such use shall
not be construed as a distribution of this Package.
9 The name of the Copyright Holder may not be used to endorse or promote products derived
from this software without specific prior written permission.
10 THIS PACKAGE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED
WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF
MERCHANTIBILITY AND FITNESS FOR A PARTICULAR PURPOSE.

PHP License
The PHP License, version 3.01
Copyright © 1999 - 2009 The PHP Group. All rights reserved.
Redistribution and use in source and binary forms, with or without modification, is permitted
provided that the following conditions are met:
1 Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
2 Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials provided
with the distribution.
3 The name "PHP" must not be used to endorse or promote products derived from this
software without prior written permission. For written permission, please contact
group@php.net.
4 Products derived from this software may not be called "PHP", nor may "PHP" appear in their
name, without prior written permission from group@php.net. You may indicate that your
software works in conjunction with PHP by saying "Foo for PHP" instead of calling it "PHP
Foo" or "phpfoo"
5 The PHP Group may publish revised and/or new versions of the license from time to time.
Each version will be given a distinguishing version number.
Once covered code has been published under a particular version of the license, you may always
continue to use it under the terms of that version. You may also choose to use such covered code
under the terms of any subsequent version of the license published by the PHP Group. No one other
than the PHP Group has the right to modify the terms applicable to covered code created under
this License.
6 Redistributions of any form whatsoever must retain the following acknowledgment:
"This product includes PHP software, freely available from <http://www.php.net/software/>".
THIS SOFTWARE IS PROVIDED BY THE PHP DEVELOPMENT TEAM ``AS IS'' AND ANY
EXPRESSED OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE PHP DEVELOPMENT TEAM OR
ITS CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,
EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

-xii-
Contents

This software consists of voluntary contributions made by many individuals on behalf of the
PHP Group.
The PHP Group can be contacted via Email at group@php.net.
For more information on the PHP Group and the PHP project, please see <http://www.php.net>.
PHP includes the Zend Engine, freely available at <http://www.zend.com>.

PostgreSQL
This product uses the PostgreSQL Database Management System(formerly known as
Postgres, then as Postgres95)
Portions Copyright © 1996-2005, The PostgreSQL Global Development Group
Portions Copyright © 1994, The Regents of the University of California
Permission to use, copy, modify, and distribute this software and its documentation for any
purpose, without fee, and without a written agreement is hereby granted, provided that the
above copyright notice and this paragraph and the following two paragraphs appear in all
copies.
IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY FOR
DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES,
INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS SOFTWARE AND ITS
DOCUMENTATION, EVEN IF THE UNIVERSITY OF CALIFORNIA HAS BEEN ADVISED OF
THE POSSIBILITY OF SUCH DAMAGE.
THE UNIVERSITY OF CALIFORNIA SPECIFICALLY DISCLAIMS ANY WARRANTIES,
INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY
AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE PROVIDED HEREUNDER
IS ON AN "AS IS" BASIS, AND THE UNIVERSITY OF CALIFORNIA HAS NO OBLIGATIONS
TO PROVIDE MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR
MODIFICATIONS.

Python Dialog
The Administration Tools part of this product uses Python Dialog, a Python module for doing
console-mode user interaction.
Upstream Author:
Peter Astrand <peter@cendio.se>
Robb Shecter <robb@acm.org>
Sultanbek Tezadov <http://sultan.da.ru>
Florent Rougon <flo@via.ecp.fr>
Copyright © 2000 Robb Shecter, Sultanbek Tezadov
Copyright © 2002, 2003, 2004 Florent Rougon
License:

-xiii-
SQL Reference Manual

This package is free software; you can redistribute it and/or modify it under the terms of the
GNU Lesser General Public License as published by the Free Software Foundation; either
version 2 of the License, or (at your option) any later version.
This package is distributed in the hope that it is useful, but WITHOUT ANY WARRANTY;
without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR
PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this
package; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston,
MA 02110-1301 USA
The complete source code of the Python dialog package and complete text of the GNU Lesser
General Public License can be found on the Vertica Systems Web site at
http://www.vertica.com/licenses/pythondialog-2.7.tar.bz2
http://www.vertica.com/licenses/pythondialog-2.7.tar.bz2

RRDTool Open Source License


Note: rrdtool is a dependency of using the ganglia-web third-party tool. RRDTool allows the
graphs displayed by ganglia-web to be produced.
RRDTOOL - Round Robin Database Tool
A tool for fast logging of numerical data graphical display of this data.
Copyright © 1998-2008 Tobias Oetiker
All rights reserved.
GNU GPL License
This program is free software; you can redistribute it and/or modify it under the terms of the
GNU General Public License as published by the Free Software Foundation; either version 2 of
the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY;
without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR
PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if
not, write to the Free Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA
02111-1307, USA
FLOSS License Exception
(Adapted from http://www.mysql.com/company/legal/licensing/foss-exception.html)
I want specified Free/Libre and Open Source Software ("FLOSS") applications to be able to use
specified GPL-licensed RRDtool libraries (the "Program") despite the fact that not all FLOSS
licenses are compatible with version 2 of the GNU General Public License (the "GPL").
As a special exception to the terms and conditions of version 2.0 of the GPL:
You are free to distribute a Derivative Work that is formed entirely from the Program and one or
more works (each, a "FLOSS Work") licensed under one or more of the licenses listed below,

-xiv-
Contents

as long as:
1 You obey the GPL in all respects for the Program and the Derivative Work, except for
identifiable sections of the Derivative Work which are not derived from the Program, and
which can reasonably be considered independent and separate works in themselves
2 All identifiable sections of the Derivative Work which are not derived from the Program, and
which can reasonably be considered independent and separate works in themselves
§ are distributed subject to one of the FLOSS licenses listed below, and
§ the object code or executable form of those sections are accompanied by the complete
corresponding machine-readable source code for those sections on the same medium
and under the same FLOSS license as the corresponding object code or executable
forms of those sections.
3 Any works which are aggregated with the Program or with a Derivative Work on a volume of
a storage or distribution medium in accordance with the GPL, can reasonably be considered
independent and separate works in themselves which are not derivatives of either the
Program, a Derivative Work or a FLOSS Work.
If the above conditions are not met, then the Program may only be copied, modified, distributed
or used under the terms and conditions of the GPL.
FLOSS License List
License name Version(s)/Copyright Date
Academic Free License 2.0
Apache Software License 1.0/1.1/2.0
Apple Public Source License 2.0
Artistic license From Perl 5.8.0
BSD license "July 22 1999"
Common Public License 1.0
GNU Library or "Lesser" General Public License (LGPL) 2.0/2.1
IBM Public License, Version 1.0
Jabber Open Source License 1.0
MIT License (As listed in file MIT-License.txt) -
Mozilla Public License (MPL) 1.0/1.1
Open Software License 2.0
OpenSSL license (with original SSLeay license) "2003" ("1998")
PHP License 3.0
Python license (CNRI Python License) -
Python Software Foundation License 2.1.1
Sleepycat License "1999"
W3C License "2001"
X11 License "2001"
Zlib/libpng License -
Zope Public License 2.0/2.1

Spread

-xv-
SQL Reference Manual

This product uses software developed by Spread Concepts LLC for use in the Spread toolkit.
For more information about Spread see http://www.spread.org (http://www.spread.org).
Copyright (c) 1993-2006 Spread Concepts LLC. All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted
provided that the following conditions are met:
1 Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer and request.
2 Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer and request in the documentation and/or other
materials provided with the distribution.
3 All advertising materials (including web pages) mentioning features or use of this software,
or software that uses this software, must display the following acknowledgment: "This
product uses software developed by Spread Concepts LLC for use in the Spread toolkit. For
more information about Spread see http://www.spread.org"
4 The names "Spread" or "Spread toolkit" must not be used to endorse or promote products
derived from this software without prior written permission.
5 Redistributions of any form whatsoever must retain the following acknowledgment:
6 "This product uses software developed by Spread Concepts LLC for use in the Spread
toolkit. For more information about Spread, see http://www.spread.org"
7 This license shall be governed by and construed and enforced in accordance with the laws
of the State of Maryland, without reference to its conflicts of law provisions. The exclusive
jurisdiction and venue for all legal actions relating to this license shall be in courts of
competent subject matter jurisdiction located in the State of Maryland.
TO THE MAXIMUM EXTENT PERMITTED BY APPLICABLE LAW, SPREAD IS PROVIDED
UNDER THIS LICENSE ON AN AS IS BASIS, WITHOUT WARRANTY OF ANY KIND, EITHER
EXPRESSED OR IMPLIED, INCLUDING, WITHOUT LIMITATION, WARRANTIES THAT
SPREAD IS FREE OF DEFECTS, MERCHANTABLE, FIT FOR A PARTICULAR PURPOSE
OR NON-INFRINGING. ALL WARRANTIES ARE DISCLAIMED AND THE ENTIRE RISK AS
TO THE QUALITY AND PERFORMANCE OF THE CODE IS WITH YOU. SHOULD ANY
CODE PROVE DEFECTIVE IN ANY RESPECT, YOU (NOT THE COPYRIGHT HOLDER OR
ANY OTHER CONTRIBUTOR) ASSUME THE COST OF ANY NECESSARY SERVICING,
REPAIR OR CORRECTION. THIS DISCLAIMER OF WARRANTY CONSTITUTES AN
ESSENTIAL PART OF THIS LICENSE. NO USE OF ANY CODE IS AUTHORIZED
HEREUNDER EXCEPT UNDER THIS DISCLAIMER.
TO THE MAXIMUM EXTENT PERMITTED BY APPLICABLE LAW, IN NO EVENT SHALL THE
COPYRIGHT HOLDER OR ANY OTHER CONTRIBUTOR BE LIABLE FOR ANY SPECIAL,
INCIDENTAL, INDIRECT, OR CONSEQUENTIAL DAMAGES FOR LOSS OF PROFITS,
REVENUE, OR FOR LOSS OF INFORMATION OR ANY OTHER LOSS.
YOU EXPRESSLY AGREE TO FOREVER INDEMNIFY, DEFEND AND HOLD HARMLESS
THE COPYRIGHT HOLDERS AND CONTRIBUTORS OF SPREAD AGAINST ALL CLAIMS,
DEMANDS, SUITS OR OTHER ACTIONS ARISING DIRECTLY OR INDIRECTLY FROM
YOUR ACCEPTANCE AND USE OF SPREAD.
Although NOT REQUIRED, we at Spread Concepts would appreciate it if active users of

-xvi-
Contents

Spread put a link on their web site to Spread's web site when possible. We also encourage
users to let us know who they are, how they are using Spread, and any comments they have
through either e-mail (spread@spread.org) or our web site at
(http://www.spread.org/comments).

SNMP
Various copyrights apply to this package, listed in various separate parts below. Please make
sure that you read all the parts. Up until 2001, the project was based at UC Davis, and the first
part covers all code written during this time. From 2001 onwards, the project has been based at
SourceForge, and Networks Associates Technology, Inc hold the copyright on behalf of the
wider Net-SNMP community, covering all derivative work done since then. An additional
copyright section has been added as Part 3 below also under a BSD license for the work
contributed by Cambridge Broadband Ltd. to the project since 2001. An additional copyright
section has been added as Part 4 below also under a BSD license for the work contributed by
Sun Microsystems, Inc. to the project since 2003.
Code has been contributed to this project by many people over the years it has been in
development, and a full list of contributors can be found in the README file under the THANKS
section.

Part 1: CMU/UCD copyright notice: (BSD like)


Copyright © 1989, 1991, 1992 by Carnegie Mellon University
Derivative Work - 1996, 1998-2000
Copyright © 1996, 1998-2000 The Regents of the University of California
All Rights Reserved
Permission to use, copy, modify and distribute this software and its documentation for any
purpose and without fee is hereby granted, provided that the above copyright notice appears in
all copies and that both that copyright notice and this permission notice appear in supporting
documentation, and that the name of CMU and The Regents of the University of California not
be used in advertising or publicity pertaining to distribution of the software without specific
written permission.
CMU AND THE REGENTS OF THE UNIVERSITY OF CALIFORNIA DISCLAIM ALL
WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED
WARRANTIES OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL CMU OR
THE REGENTS OF THE UNIVERSITY OF CALIFORNIA BE LIABLE FOR ANY SPECIAL,
INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER
RESULTING FROM THE LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF
CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN
CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.
Part 2: Networks Associates Technology, Inc copyright notice (BSD)
Copyright © 2001-2003, Networks Associates Technology, Inc
All rights reserved.

-xvii-
SQL Reference Manual

Redistribution and use in source and binary forms, with or without modification, are permitted
provided that the following conditions are met:
• Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
• Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials
provided with the distribution.
• Neither the name of the Networks Associates Technology, Inc nor the names of its
contributors may be used to endorse or promote products derived from this software without
specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED
TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A
PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
HOLDERS OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED
TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
PROFITS OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
Part 3: Cambridge Broadband Ltd. copyright notice (BSD)
Portions of this code are copyright (c) 2001-2003, Cambridge Broadband Ltd.
All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted
provided that the following conditions are met:
• Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
• Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials
provided with the distribution.
• The name of Cambridge Broadband Ltd. may not be used to endorse or promote products
derived from this software without specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDER ``AS IS'' AND ANY
EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER BE LIABLE FOR ANY
DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE
GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS
INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN
CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE)
ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE

-xviii-
Contents

POSSIBILITY OF SUCH DAMAGE.


Part 4: Sun Microsystems, Inc. copyright notice (BSD)
Copyright © 2003 Sun Microsystems, Inc., 4150 Network Circle, Santa Clara,
California 95054, U.S.A. All rights reserved.
Use is subject to license terms below.
This distribution may include materials developed by third parties.
Sun, Sun Microsystems, the Sun logo and Solaris are trademarks or registered trademarks
of Sun Microsystems, Inc. in the U.S. and other countries.
Redistribution and use in source and binary forms, with or without modification, are permitted
provided that the following conditions are met:
• Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
• Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials
provided with the distribution.
• Neither the name of the Sun Microsystems, Inc. nor the names of its contributors may be
used to endorse or promote products derived from this software without specific prior written
permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED
TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A
PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
HOLDERS OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED
TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
Part 5: Sparta, Inc copyright notice (BSD)
Copyright © 2003-2006, Sparta, Inc
All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted
provided that the following conditions are met:
• Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
• Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials
provided with the distribution.
• Neither the name of Sparta, Inc nor the names of its contributors may be used to endorse or
promote products derived from this software without specific prior written permission.

-xix-
SQL Reference Manual

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS


``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED
TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A
PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
HOLDERS OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED
TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
Part 6: Cisco/BUPTNIC copyright notice (BSD)
Copyright © 2004, Cisco, Inc and Information Network Center of Beijing University of Posts
and Telecommunications.
All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted
provided that the following conditions are met:
• Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
• Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials
provided with the distribution.
• Neither the name of Cisco, Inc, Beijing University of Posts and Telecommunications, nor the
names of their contributors may be used to endorse or promote products derived from this
software without specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED
TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A
PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
HOLDERS OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED
TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
Part 7: Fabasoft R&D Software GmbH & Co KG copyright notice (BSD)
Copyright © Fabasoft R&D Software GmbH & Co KG, 2003
oss@fabasoft.com
Author: Bernhard Penz
Redistribution and use in source and binary forms, with or without modification, are permitted
provided that the following conditions are met:
• Redistributions of source code must retain the above copyright notice, this list of conditions

-xx-
Contents

and the following disclaimer.


• Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials
provided with the distribution.
• The name of Fabasoft R&D Software GmbH & Co KG or any of its subsidiaries, brand or
product names may not be used to endorse or promote products derived from this software
without specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDER ``AS IS'' AND ANY
EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER BE
LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR
CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF
SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS
INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN
CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE)
ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE
POSSIBILITY OF SUCH DAMAGE.

Tecla Command-line Editing


Copyright (c) 2000 by Martin C. Shepherd.
All rights reserved.
Permission is hereby granted, free of charge, to any person obtaining a copy of this software
and associated documentation files (the "Software"), to deal in the Software without restriction,
including without limitation the rights to use, copy, modify, merge, publish, distribute, and/or sell
copies of the Software, and to permit persons to whom the Software is furnished to do so,
provided that the above copyright notice(s) and this permission notice appear in all copies of
the Software and that both the above copyright notice(s) and this permission notice appear in
supporting documentation.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS
OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT
OF THIRD PARTY RIGHTS. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR
HOLDERS INCLUDED IN THIS NOTICE BE LIABLE FOR ANY CLAIM, OR ANY SPECIAL
INDIRECT OR CONSEQUENTIAL DAMAGES, OR ANY DAMAGES WHATSOEVER
RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF
CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN
CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.
Except as contained in this notice, the name of a copyright holder shall not be used in
advertising or otherwise to promote the sale, use or other dealings in this Software without prior
written authorization of the copyright holder.

Webmin Open Source License


Copyright © Jamie Cameron

-xxi-
SQL Reference Manual

All rights reserved.


Redistribution and use in source and binary forms, with or without modification, are permitted
provided that the following conditions are met:
1 Redistributions of source code must retain the above copyright notice, this list of conditions
and the following disclaimer.
2 Redistributions in binary form must reproduce the above copyright notice, this list of
conditions and the following disclaimer in the documentation and/or other materials provided
with the distribution.
3 Neither the name of the developer nor the names of contributors may be used to endorse or
promote products derived from this software without specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE DEVELOPER ``AS IS'' AND ANY EXPRESS OR
IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES
OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED.
IN NO EVENT SHALL THE DEVELOPER OR CONTRIBUTORS BE LIABLE FOR ANY
DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE
GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS
INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN
CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE)
ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE
POSSIBILITY OF SUCH DAMAGE.

zlib
This is used by the project to load zipped files directly by COPY command. www.zlib.net/
zlib.h -- interface of the 'zlib' general purpose compression library version 1.2.3, July 18th, 2005
Copyright © 1995-2005 Jean-loup Gailly and Mark Adler
This software is provided 'as-is', without any express or implied warranty. In no event will the
authors be held liable for any damages arising from the use of this software.
Permission is granted to anyone to use this software for any purpose, including commercial
applications, and to alter it and redistribute it freely, subject to the following restrictions:
1 The origin of this software must not be misrepresented; you must not claim that you wrote
the original software. If you use this software in a product, an acknowledgment in the
product documentation would be appreciated but is not required.
2 Altered source versions must be plainly marked as such, and must not be misrepresented
as being the original software.
3 This notice may not be removed or altered from any source distribution.
Jean-loup Gailly jloup@gzip.org
Mark Adler madler@alumni.caltech.edu

-xxii-
Contents

Contents

Technical Support 33

About the Documentation 35


Where to Find the Vertica Documentation ............................................................................................ 35
Reading the Online Documentation ...................................................................................................... 35
Printing Full Books .............................................................................................................................. 36
Suggested Reading Paths ...................................................................................................................... 36
Where to Find Additional Information .................................................................................................. 39
Typographical Conventions .................................................................................................................. 40

Preface 43

SQL Overview 45

System Limits 49

SQL Language Elements 51


Keywords and Reserved Words ............................................................................................................ 51
Keywords .................................................................................................................................. 51
Reserved Words ........................................................................................................................ 54
Identifiers............................................................................................................................................. 55
Constants ............................................................................................................................................. 55
Numeric Constants .................................................................................................................... 56
String Constants (Dollar-Quoted)............................................................................................... 57
String Constants (Standard) ....................................................................................................... 57
Date/Time Constants ................................................................................................................. 58
Operators ............................................................................................................................................. 62
Binary Operators ....................................................................................................................... 62
Boolean Operators ..................................................................................................................... 65
Comparison Operators ............................................................................................................... 65
Data Type Coercion Operators (CAST)...................................................................................... 66
Date/Time Operators ................................................................................................................. 67
Mathematical Operators............................................................................................................. 68
NULL Operators ....................................................................................................................... 69

-xxiii-
SQL Reference Manual

String Concatenation Operators ................................................................................................. 70


Expressions .......................................................................................................................................... 70
Aggregate Expressions .............................................................................................................. 72
CASE Expressions .................................................................................................................... 73
Column References ................................................................................................................... 74
Comments ................................................................................................................................. 75
Date/Time Expressions .............................................................................................................. 76
NULL Value ............................................................................................................................. 78
Numeric Expressions ................................................................................................................. 78
Predicates............................................................................................................................................. 79
BETWEEN-predicate ................................................................................................................ 79
Boolean-predicate...................................................................................................................... 80
column-value-predicate ............................................................................................................. 81
IN-predicate .............................................................................................................................. 82
join-predicate ............................................................................................................................ 83
LIKE-predicate.......................................................................................................................... 84
NULL-predicate ........................................................................................................................ 86
Search Conditions ................................................................................................................................ 86

SQL Data Types 89


Binary Data Types ............................................................................................................................... 89
Boolean Data Type ............................................................................................................................... 93
Character Data Types ........................................................................................................................... 94
Date/Time Data Types .......................................................................................................................... 96
DATE ....................................................................................................................................... 96
TIME ........................................................................................................................................ 97
TIMESTAMP ........................................................................................................................... 99
INTERVAL ............................................................................................................................ 102
Numeric Data Types........................................................................................................................... 103
DOUBLE PRECISION (FLOAT) ............................................................................................ 105
INTEGER ............................................................................................................................... 107
NUMERIC .............................................................................................................................. 107

SQL Functions 111


Aggregate Functions .......................................................................................................................... 112
AVG ....................................................................................................................................... 112
BIT_AND ............................................................................................................................... 113
BIT_OR .................................................................................................................................. 114
BIT_XOR ............................................................................................................................... 115
COUNT .................................................................................................................................. 116
COUNT(*) .............................................................................................................................. 119
MAX....................................................................................................................................... 120
MIN ........................................................................................................................................ 120
STDDEV ................................................................................................................................ 121
STDDEV_POP........................................................................................................................ 121
STDDEV_SAMP .................................................................................................................... 122
SUM ....................................................................................................................................... 122
SUM_FLOAT ......................................................................................................................... 123
VAR_POP............................................................................................................................... 123
VAR_SAMP ........................................................................................................................... 124
VARIANCE ............................................................................................................................ 124

-xxiv-
Contents

Analytic Functions ............................................................................................................................. 125


FIRST_VALUE / LAST_VALUE ........................................................................................... 128
LEAD / LAG........................................................................................................................... 131
RANK / DENSE_RANK ......................................................................................................... 135
ROW_NUMBER .................................................................................................................... 137
Date/Time Functions .......................................................................................................................... 139
ADD_MONTHS ..................................................................................................................... 140
AGE........................................................................................................................................ 142
CLOCK_TIMESTAMP ........................................................................................................... 143
CURRENT_DATE .................................................................................................................. 143
CURRENT_TIME................................................................................................................... 144
CURRENT_TIMESTAMP ...................................................................................................... 144
DATE_PART .......................................................................................................................... 145
DATE_TRUNC....................................................................................................................... 146
DATEDIFF ............................................................................................................................. 147
EXTRACT .............................................................................................................................. 153
GETDATE .............................................................................................................................. 153
GETUTCDATE ...................................................................................................................... 154
ISFINITE ................................................................................................................................ 154
LAST_DAY ............................................................................................................................ 155
LOCALTIME ......................................................................................................................... 156
LOCALTIMESTAMP ............................................................................................................. 156
MONTHS_BETWEEN ........................................................................................................... 157
NOW ...................................................................................................................................... 159
OVERLAPS ............................................................................................................................ 159
STATEMENT_TIMESTAMP ................................................................................................. 160
SYSDATE .............................................................................................................................. 161
TIME_SLICE .......................................................................................................................... 161
TIMEOFDAY ......................................................................................................................... 165
TRANSACTION_TIMESTAMP ............................................................................................. 165
Formatting Functions ......................................................................................................................... 166
TO_CHAR .............................................................................................................................. 166
TO_DATE .............................................................................................................................. 168
TO_TIMESTAMP................................................................................................................... 169
TO_NUMBER ........................................................................................................................ 170
Template Patterns for Date/Time Formatting............................................................................ 171
Template Patterns for Numeric Formatting............................................................................... 174
Mathematical Functions ..................................................................................................................... 176
ABS ........................................................................................................................................ 176
ACOS ..................................................................................................................................... 176
ASIN....................................................................................................................................... 177
ATAN ..................................................................................................................................... 177
ATAN2 ................................................................................................................................... 177
CBRT ..................................................................................................................................... 178
CEILING (CEIL) .................................................................................................................... 178
COS ........................................................................................................................................ 178
COT ........................................................................................................................................ 179
DEGREES .............................................................................................................................. 179
EXP ........................................................................................................................................ 180
FLOOR ................................................................................................................................... 180
HASH ..................................................................................................................................... 181
LN .......................................................................................................................................... 181
LOG........................................................................................................................................ 182

-xxv-
SQL Reference Manual

MOD....................................................................................................................................... 182
MODULARHASH .................................................................................................................. 183
PI ............................................................................................................................................ 184
POWER .................................................................................................................................. 184
RADIANS............................................................................................................................... 184
RANDOM............................................................................................................................... 185
RANDOMINT ........................................................................................................................ 185
ROUND .................................................................................................................................. 186
SIGN....................................................................................................................................... 187
SIN ......................................................................................................................................... 187
SQRT ...................................................................................................................................... 188
TAN........................................................................................................................................ 188
TRUNC................................................................................................................................... 188
WIDTH_BUCKET .................................................................................................................. 189
NULL-handling Functions.................................................................................................................. 191
COALESCE ............................................................................................................................ 191
ISNULL .................................................................................................................................. 192
NULLIF .................................................................................................................................. 193
NVL........................................................................................................................................ 193
NVL2 ...................................................................................................................................... 195
String Functions ................................................................................................................................. 197
ASCII ..................................................................................................................................... 197
BIT_LENGTH ........................................................................................................................ 198
BITCOUNT ............................................................................................................................ 198
BITSTRING_TO_BINARY .................................................................................................... 199
BTRIM ................................................................................................................................... 200
CHARACTER_LENGTH ....................................................................................................... 200
CHR........................................................................................................................................ 201
CLIENT_ENCODING ............................................................................................................ 201
DECODE ................................................................................................................................ 202
GREATEST ............................................................................................................................ 203
HEX_TO_BINARY ................................................................................................................ 204
INET_ATON .......................................................................................................................... 205
INET_NTOA .......................................................................................................................... 206
INITCAP ................................................................................................................................ 207
INSTR .................................................................................................................................... 207
LEAST.................................................................................................................................... 209
LEFT ...................................................................................................................................... 210
LENGTH ................................................................................................................................ 211
LOWER .................................................................................................................................. 211
LPAD ..................................................................................................................................... 212
LTRIM.................................................................................................................................... 213
MD5 ....................................................................................................................................... 213
OCTET_LENGTH .................................................................................................................. 214
OVERLAY ............................................................................................................................. 214
POSITION .............................................................................................................................. 215
QUOTE_IDENT ..................................................................................................................... 216
QUOTE_LITERAL ................................................................................................................. 217
REPEAT ................................................................................................................................. 217
REPLACE............................................................................................................................... 218
RIGHT .................................................................................................................................... 219
RPAD ..................................................................................................................................... 220
RTRIM ................................................................................................................................... 220
SPLIT_PART .......................................................................................................................... 221

-xxvi-
Contents

STRPOS ................................................................................................................................. 222


SUBSTR ................................................................................................................................. 222
SUBSTRING .......................................................................................................................... 223
TO_BITSTRING..................................................................................................................... 224
TO_HEX................................................................................................................................. 225
TRANSLATE ......................................................................................................................... 226
TRIM ...................................................................................................................................... 226
UPPER.................................................................................................................................... 227
V6_ATON .............................................................................................................................. 228
V6_NTOA .............................................................................................................................. 229
V6_SUBNETA ....................................................................................................................... 230
V6_SUBNETN ....................................................................................................................... 230
V6_TYPE ............................................................................................................................... 231
System Information Functions ............................................................................................................ 234
CURRENT_DATABASE ........................................................................................................ 234
CURRENT_SCHEMA ............................................................................................................ 234
CURRENT_USER .................................................................................................................. 235
HAS_TABLE_PRIVILEGE .................................................................................................... 235
SESSION_USER .................................................................................................................... 236
USER ...................................................................................................................................... 237
VERSION ............................................................................................................................... 237
Vertica Functions ............................................................................................................................... 238
ADD_DESIGN_TABLES ....................................................................................................... 238
ADD_LOCATION .................................................................................................................. 239
ADVANCE_EPOCH............................................................................................................... 240
ALTER_LOCATION_USE ..................................................................................................... 240
ANALYZE_CONSTRAINTS ................................................................................................. 241
ANALYZE_STATISTICS....................................................................................................... 247
CANCEL_DEPLOYMENT..................................................................................................... 248
CANCEL_REFRESH .............................................................................................................. 248
CLEAR_DESIGN_SEGMENTATION_TABLE...................................................................... 249
CLEAR_DESIGN_TABLES ................................................................................................... 249
CLEAR_QUERY_REPOSITORY ........................................................................................... 250
CLOSE_ALL_SESSIONS ....................................................................................................... 250
CLOSE_SESSION .................................................................................................................. 252
CONFIGURE_DEPLOYMENT .............................................................................................. 253
CREATE_DESIGN ................................................................................................................. 255
CREATE_DESIGN_CONFIGURATION ................................................................................ 255
CREATE_DESIGN_CONTEXT ............................................................................................. 256
CREATE_DESIGN_QUERIES_TABLE ................................................................................. 257
CREATE_PROJECTION_DESIGN ........................................................................................ 257
DEPLOY_DESIGN................................................................................................................. 259
DISABLE_DUPLICATE_KEY_ERROR ................................................................................ 261
DISPLAY_LICENSE .............................................................................................................. 263
DO_TM_TASK....................................................................................................................... 263
DROP_LOCATION ................................................................................................................ 265
DROP_PARTITION ............................................................................................................... 266
DUMP_CATALOG ................................................................................................................ 269
DUMP_LOCKTABLE ............................................................................................................ 270
DUMP_PARTITION_KEYS ................................................................................................... 270
DUMP_PROJECTION_PARTITION_KEYS .......................................................................... 270
DUMP_TABLE_PARTITION_KEYS..................................................................................... 271
EXPORT_CATALOG............................................................................................................. 271

-xxvii-
SQL Reference Manual

EXPORT_DESIGN_CONFIGURATION ................................................................................ 272


EXPORT_DESIGN_TABLES................................................................................................. 273
EXPORT_STATISTICS.......................................................................................................... 273
GET_AHM_EPOCH ............................................................................................................... 274
GET_AHM_TIME .................................................................................................................. 274
GET_CURRENT_EPOCH ...................................................................................................... 275
GET_DESIGN_SCRIPT ......................................................................................................... 275
GET_LAST_GOOD_EPOCH ................................................................................................. 276
GET_NUM_ACCEPTED_ROWS ........................................................................................... 276
GET_NUM_REJECTED_ROWS ............................................................................................ 276
GET_PROJECTION_STATUS ............................................................................................... 277
GET_PROJECTIONS, GET_TABLE_PROJECTIONS ........................................................... 277
IMPLEMENT_TEMP_DESIGN ............................................................................................. 279
INTERRUPT_STATEMENT .................................................................................................. 279
LOAD_DATA_STATISTICS ................................................................................................. 282
LOAD_DESIGN_QUERIES ................................................................................................... 282
MAKE_AHM_NOW............................................................................................................... 283
MARK_DESIGN_KSAFE ...................................................................................................... 285
MEASURE_LOCATION_PERFORMANCE .......................................................................... 286
MERGE_PARTITIONS .......................................................................................................... 287
PARTITION_PROJECTION ................................................................................................... 288
PARTITION_TABLE ............................................................................................................. 289
PURGE ................................................................................................................................... 290
PURGE_PROJECTION .......................................................................................................... 290
PURGE_TABLE ..................................................................................................................... 290
READ_DATA_STATISTICS.................................................................................................. 291
REENABLE_DUPLICATE_KEY_ERROR ............................................................................ 292
REMOVE_DESIGN................................................................................................................ 292
REMOVE_DESIGN_CONTEXT ............................................................................................ 292
REMOVE_DEPLOYMENT_ENTRY ..................................................................................... 293
RESET_DESIGN_QUERIES_TABLE .................................................................................... 293
RESTORE_LOCATION ......................................................................................................... 294
RETIRE_LOCATION ............................................................................................................. 294
REVERT_DEPLOYMENT ..................................................................................................... 295
RUN_DEPLOYMENT ............................................................................................................ 296
SAVE_DESIGN_VERSION ................................................................................................... 297
SAVE_QUERY_REPOSITORY ............................................................................................. 297
SELECT CURRENT_SCHEMA ............................................................................................. 298
SET_AHM_EPOCH................................................................................................................ 298
SET_AHM_TIME ................................................................................................................... 300
SET_DESIGN_KSAFETY ...................................................................................................... 301
SET_DESIGN_LOG_FILE ..................................................................................................... 302
SET_DESIGN_LOG_LEVEL ................................................................................................. 302
SET_DESIGN_PARAMETER ................................................................................................ 303
SET_DESIGN_QUERIES_TABLE ......................................................................................... 303
SET_DESIGN_QUERY_CLUSTER_LEVEL ......................................................................... 304
SET_DESIGN_SEGMENTATION_COLUMN ....................................................................... 305
SET_DESIGN_SEGMENTATION_TABLE ........................................................................... 306
SET_DESIGN_TABLE_ROWS .............................................................................................. 306
SET_LOCATION_PERFORMANCE ..................................................................................... 307
START_REFRESH ................................................................................................................. 308
SYNC_CURRENT_DESIGN .................................................................................................. 309
TEMP_DESIGN_SCRIPT ....................................................................................................... 309
UPDATE_DESIGN................................................................................................................. 310

-xxviii-
Contents

WAIT_DEPLOYMENT .......................................................................................................... 311

SQL Statements 313


ALTER PROJECTION ...................................................................................................................... 314
ALTER SCHEMA ............................................................................................................................. 314
ALTER TABLE ................................................................................................................................. 316
table-constraint ........................................................................................................................ 319
ALTER USER ................................................................................................................................... 321
COMMIT........................................................................................................................................... 322
COPY ................................................................................................................................................ 323
CREATE PROJECTION.................................................................................................................... 337
encoding-type .......................................................................................................................... 340
hash-segmentation-clause ........................................................................................................ 341
range-segmentation-clause ....................................................................................................... 343
CREATE SCHEMA........................................................................................................................... 344
CREATE TABLE .............................................................................................................................. 346
column-definition .................................................................................................................... 348
column-constraint .................................................................................................................... 349
CREATE TEMPORARY TABLE ...................................................................................................... 351
CREATE USER ................................................................................................................................. 355
CREATE VIEW................................................................................................................................. 356
DELETE ............................................................................................................................................ 358
DROP PROJECTION ........................................................................................................................ 360
DROP SCHEMA ............................................................................................................................... 360
DROP TABLE ................................................................................................................................... 362
DROP USER ..................................................................................................................................... 364
DROP VIEW ..................................................................................................................................... 364
EXPLAIN .......................................................................................................................................... 365
GRANT (Schema).............................................................................................................................. 370
GRANT (Table) ................................................................................................................................. 371
GRANT (View) ................................................................................................................................. 372
INSERT ............................................................................................................................................. 373
LCOPY .............................................................................................................................................. 374
RELEASE SAVEPOINT ................................................................................................................... 374
REVOKE (Schema) ........................................................................................................................... 376
REVOKE (Table)............................................................................................................................... 377
REVOKE (View) ............................................................................................................................... 377
ROLLBACK ...................................................................................................................................... 379
ROLLBACK TO SAVEPOINT .......................................................................................................... 379
SAVEPOINT ..................................................................................................................................... 380
SELECT ............................................................................................................................................ 382
FROM Clause ......................................................................................................................... 384
WHERE Clause....................................................................................................................... 386
GROUP BY Clause ................................................................................................................. 388
HAVING Clause ..................................................................................................................... 390
ORDER BY Clause ................................................................................................................. 391
LIMIT Clause.......................................................................................................................... 393
OFFSET Clause ...................................................................................................................... 394
SET ................................................................................................................................................... 394
DATESTYLE ......................................................................................................................... 396
SEARCH_PATH..................................................................................................................... 397

-xxix-
SQL Reference Manual

SESSION CHARACTERISTICS............................................................................................. 398


TIME ZONE ........................................................................................................................... 399
SHOW ............................................................................................................................................... 402
SHOW SEARCH_PATH ................................................................................................................... 402
TRUNCATE TABLE ......................................................................................................................... 403
UNION .............................................................................................................................................. 404
UPDATE ........................................................................................................................................... 408

SQL System Tables (Monitoring APIs) 409


V_CATALOG Schema ...................................................................................................................... 411
COLUMNS ............................................................................................................................. 411
FOREIGN_KEYS ................................................................................................................... 412
GRANTS ................................................................................................................................ 413
PRIMARY_KEYS .................................................................................................................. 415
PROJECTIONS....................................................................................................................... 416
TABLE_CONSTRAINTS ....................................................................................................... 418
TABLES ................................................................................................................................. 419
TYPES .................................................................................................................................... 420
USERS.................................................................................................................................... 421
VIEW_COLUMNS ................................................................................................................. 421
VIEWS ................................................................................................................................... 422
SYSTEM_TABLES ................................................................................................................ 423
V_MONITOR Schema ....................................................................................................................... 424
ACTIVE_EVENTS ................................................................................................................. 424
COLUMN_STORAGE ............................................................................................................ 426
CURRENT_SESSION ............................................................................................................ 428
DISK_RESOURCE_REJECTIONS......................................................................................... 431
DISK_STORAGE ................................................................................................................... 432
EVENT_CONFIGURATIONS ................................................................................................ 435
EXECUTION_ENGINE_PROFILES ...................................................................................... 436
HOST_RESOURCES .............................................................................................................. 437
LOAD_STREAMS ................................................................................................................. 440
LOCAL_NODES .................................................................................................................... 441
LOCKS ................................................................................................................................... 441
NODE_RESOURCES ............................................................................................................. 444
PARTITIONS ......................................................................................................................... 445
PROJECTION_REFRESHES .................................................................................................. 446
PROJECTION_STORAGE ..................................................................................................... 448
QUERY_METRICS ................................................................................................................ 449
QUERY_PROFILES ............................................................................................................... 450
RESOURCE_REJECTIONS ................................................................................................... 452
RESOURCE_USAGE ............................................................................................................. 454
SESSION_PROFILES ............................................................................................................. 457
SESSIONS .............................................................................................................................. 458
STORAGE_CONTAINERS .................................................................................................... 460
SYSTEM ................................................................................................................................ 461
TUPLE_MOVER_OPERATIONS........................................................................................... 462
WOS_CONTAINER_STORAGE ............................................................................................ 463
Deprecated System Tables .................................................................................................................. 467
VT_ACTIVE_EVENTS .......................................................................................................... 468
VT_COLUMN_STORAGE ..................................................................................................... 469
VT_CURRENT_SESSION ..................................................................................................... 469

-xxx-
Contents

VT_DISK_RESOURCE_REJECTIONS.................................................................................. 472
VT_DISK_STORAGE ............................................................................................................ 472
VT_EE_PROFILING .............................................................................................................. 473
VT_GRANT ........................................................................................................................... 474
VT_LOAD_STREAMS........................................................................................................... 476
VT_LOCK .............................................................................................................................. 476
VT_NODE_INFO ................................................................................................................... 479
VT_PARTITIONS .................................................................................................................. 480
VT_PROJECTION.................................................................................................................. 481
VT_PROJECTION_REFRESH ............................................................................................... 482
VT_PROJECTION_STORAGE .............................................................................................. 484
VT_QUERY_METRICS ......................................................................................................... 484
VT_QUERY_PROFILING ...................................................................................................... 485
VT_RESOURCE_REJECTIONS ............................................................................................ 486
VT_RESOURCE_USAGE ...................................................................................................... 487
VT_SCHEMA......................................................................................................................... 488
VT_SESSION ......................................................................................................................... 489
VT_SESSION_PROFILING ................................................................................................... 490
VT_SYSTEM ......................................................................................................................... 491
VT_TABLE ............................................................................................................................ 492
VT_TABLE_STORAGE ......................................................................................................... 493
VT_TUPLE_MOVER ............................................................................................................. 493
VT_VIEW............................................................................................................................... 494
VT_WOS_STORAGE ............................................................................................................. 494

Index 497

-xxxi-
Technical Support
To submit problem reports, questions, comments, and suggestions, use the Technical Support
page on the Vertica Systems, Inc. Web site.

Note: You must be a registered user in order to access the support page.
1 Go to http://www.vertica.com/support (http://www.vertica.com/support).
2 Click My Support.
You can also email verticahelp@vertica.com.
Before reporting a problem, run the Diagnostics Utility described in the Troubleshooting Guide
and attach the resulting .zip file.

-33-
About the Documentation
This section describes how to access and print Vertica documentation. It also includes
suggested reading paths (page 36).

Where to Find the Vertica Documentation


You can read or download the Vertica documentation for the current release of Vertica® Analytic
Database from the Vertica Systems, Inc. Web site's Product Documentation Page
http://www.vertica.com/v-zone/product_documentation. You must be a registered user to access
this page.
The documentation is available as an rpm (which you can install on the database server system
in the directory: /opt/vertica/doc), .tar.gz, or .zip file. You need a V-Zone login to access the
documentation.
Note: The documentation on the Vertica Systems, Inc. Web site is updated each time a
new release is issued. If you are using an older version, refer to the documentation on your
database server or client systems.
See Installing Vertica Documentation.

Reading the Online Documentation


Reading the HTML Documentation Files
The Vertica documentation files are provided in HTML browser format for platform
independence. The HTML files require only a browser that displays frames properly with
JavaScript enabled. The HTML files do not require a Web (HTTP) server.
The Vertica documentation has been tested on the following browsers:
• Internet Explorer 7
• FireFox
Please report any script, image rendering, or text formatting problems to Technical Support (on
page 33).
Note: The Vertica documentation contains links to Web sites of other companies or
organizations that Vertica does not own or control. If you find broken links, please let us
know.

Printing the HTML Documentation Files


Mozilla Firefox
1 Right-click the frame containing the information you want to print.
2 From the menu bar, select File > Print.
3 In the Print Frames window, make sure The Selected Frame is checked, and click OK.

-35-
SQL Reference Manual

Note: In later versions of Firefox, you can right-click inside the source frame and select This
Frame > Print Frame > OK from the submenu.
Internet Explorer 7
1 Drag select the information you want to print.
2 From the menu bar, select File > Print.
3 Under the General tab, click Selection from the Page Range window and click Print.

Printing Full Books


Vertica also publishes books as Adobe Acrobat™ PDF. The books are designed to be printed on
standard 8½ x 11 paper using full duplex (two-sided printing).
Note: Vertica manuals are topic driven and not meant to be read in a linear fashion.
Therefore, the PDFs do not resemble the format of typical books. Because each topic starts
a new page, some of the pages are very short, and there are blank pages between each
topic.
Open and print the PDF documents using the Adobe Reader. You can download the latest
version of the free Acrobat Reader from the Adobe Web site
(http://www.adobe.com/products/acrobat/readstep2.html).
The following list provides links to the PDFs.
• Release Notes
• Concepts Guide
• Installation and Configuration Guide
• Getting Started Guide
• Administrator's Guide
• Programmer's Guide
• SQL Reference Manual
• Troubleshooting Guide

Suggested Reading Paths


This section provides a suggested reading path for various users. Vertica recommends that you
read the manuals listed under All Users first.

All Users
• Release Notes — Release-specific information, including new features and behavior
changes to the product and documentation
• Concepts Guide — Basic concepts critical to understanding Vertica
• Getting Started Guide — Step-by-step guide to getting Vertica up and running

System Administrators
• Installation and Configuration Guide — Platform configuration and software installation

-36-
About the Documentation

• Release Notes — Release-specific information, including new features and behavior


changes to the product and documentation
• Troubleshooting Guide — General troubleshooting information

Database Administrators
• Installation and Configuration Guide — Platform configuration and software installation
• Administrator's Guide — Database configuration, loading, security, and maintenance
• Troubleshooting Guide — General troubleshooting information

Application Developers
• Programmer's Guide — Connecting to a database, queries, transactions, and so on
• SQL Reference Manual — SQL and Vertica-specific language information
• Troubleshooting Guide — General troubleshooting information

-37-
Where to Find Additional Information
Visit the Vertica Systems, Inc. Web site (http://www.vertica.com) to keep up to date with:
• Downloads
• Frequently Asked Questions (FAQs)
• Discussion forums
• News, tips, and techniques

-39-
40

Typographical Conventions
The following are the typographical and syntatax conventions used in the Vertica documentation.

Typographical Convention Description


Bold Indicates areas of emphasis, such as a special menu command.

Code SQL and program code displays in a monospaced (fixed-width) font.


Database objects Names of database objects, such as tables, are shown in san-serif type.

Emphasis Indicates emphasis and the titles of other documents or system files; for
example, vertica.log.
monospace Indicates literal interactive or programmatic input/output.

monospace italics Indicates user-supplied information in interactive or programmatic


input/output.
UPPERCASE Indicates the name of a SQL command or keyword.
SQL keywords are case insensitive; SELECT is the same as Select,
which is the same as select.
User input Text entered by the user is shown in bold san serif type.
indicates the Return/Enter key; implicit on all user input that includes text
Right-angle bracket > Indicates a flow of events, usually from a drop-down menu.
Click Indicates that the reader should click options, such as menu command
buttons, radio buttons, and mouse selections; for example, "Click OK to
proceed."
Press Indicates that the reader perform some action on the keyboard; for
example, "Press Enter."

Syntax Convention Description


Alternatives { } When precisely one of the options must be chosen, the alternatives are
enclosed in curly braces; for example:
QUOTES { ON | OFF }
It indicates that exactly one of ON or OFF must be provided.You do not
type the braces.
Backslash \ Continuation character used to indicate text that is too long to fit on a
single line.
Brackets [ ] Indicate optional items; for example, CREATE TABLE
[schema_name.]table_name
The brackets indicate that the schema_name is optional. Do not type the
square brackets.

-40-
About the Documentation

Ellipses ... Indicate a repetition of the previous parameter. For example,


option[,...] means that you can enter multiple, comma-separated
options.
Indentation Is an attempt to maximize readability; SQL is a free-form language.

Placeholders Items that must be replaced with appropriate identifiers or expressions are
shown in italics.
Vertical bar | When none or only one of a list of items must be chosen, the items are
separated by vertical bars (also called a pipe) with the list enclosed in
square brackets; for example: [ ASC | DESC ]
The option indicates that you can choose one of ASC, DESC, or neither.
You do not type the square brackets.
Vertical ellipses Indicate an optional sequence of similar items or that part of the text has
been omitted for readability.

-41-
Preface
This document provides a reference description of the Vertica SQL database language.

Audience
This document is intended for anyone who uses Vertica. It assumes that you are familiar with the
basic concepts and terminology of the SQL language and relational database management
systems.

-43-
SQL Overview
SQL (Structured Query Language) is a widely-used, industry standard data definition and data
manipulation language for relational databases.
Note: In Vertica, use a semicolon to end a statement or combine multiple statements.

Vertica Support for ANSI SQL Standards


Vertica SQL supports a subset of ANSI SQL 99. Over time, Vertica SQL expands and eventually
converges with ANSI SQL 99.
See BNF Grammar for SQL-99 (http://savage.net.au/SQL/sql-99.bnf.html)

Vertica Major Extensions to SQL


Vertica provides several extensions to SQL that allow you to use the unique aspects of its
column store architecture:
• AT EPOCH LATEST SELECT... runs a SQL query in snapshot isolation mode in which
Vertica does not hold locks or block other processes, such as data loads.
• AT TIME 'timestamp' SELECT... runs historical queries against a snapshot of the database a
specific date and time.
• CONSTRAINT ... CORRELATION (column) REFERENCES (column) captures Functional
Dependencies, which the Database Designer can use to produce more efficient projections.
• COPY is used for bulk loading data. It reads data from a text file and inserts tuples into the
WOS (Write Optimized Store) or directly into the ROS (Read Optimized Store).
• CREATE/DROP/ALTER PROJECTION manipulate projections. CREATE PROJECTION
commands are generated for you by the Database Designer, as described in the
Administrator's Guide.
• SELECT Vertica <Function> executes special Vertica functions (page 238).
• SET DATESTYLE chooses the format in which date and time values display.
• SET SEARCH_PATH specifies the order in which Vertica searches through multiple
schemas when a SQL statement contains an unqualified table name.
• SET TIME ZONE specifies the TIME ZONE run-time parameter for the current session.
• SHOW displays run-time parameters for the current session.

Support for Historical Queries


Unlike most databases, the DELETE (page 358) command in Vertica does not delete data; it
simply marks records as deleted. The UPDATE (page 408) command performs an INSERT and
a DELETE. This behavior is necessary for historical queries. You can control how much historical
data is stored on disk.

Non-standard Syntax and Semantics


In case of non-standard SQL syntax or semantics, Vertica SQL follows Oracle whenever
possible. For Oracle SQL documentation, you need a web account to access the library. See
Oracle Database 10g Documentation Library (http://www.oracle.com/pls/db102/homepage).

-45-
SQL Reference Manual

Joins
Vertica supports typical data warehousing query joins, such as joins on columns with primary
key-foreign key relationships, inner joins, and some outer joins with restrictions based on the
physical schema. For details, see Joins in the Programmer's Guide.

Transactions
Session-scoped isolation levels determine transaction characteristics for transactions within a
specific user session. You set them through the SET SESSION CHARACTERISTICS (page 398)
command. Specifically, they determine what data a transaction can access when other
transactions are running concurrently.
Although the Vertica query parser understands all four standard SQL isolation levels (READ
UNCOMMITTED, READ COMMITTED, REPEATABLE READ, and SERIALIZABLE) for a user
session, internally Vertica uses only READ COMMITTED and SERIALIZABLE. Vertica
automatically translates READ UNCOMMITTED to READ COMMITTED and REPEATABLE
READ to SERIALIZABLE. Therefore, the isolation level Vertica uses could be more strict than
the isolation level you choose.
By default, Vertica uses the SERIALIZABLE isolation level. However, you can change the
isolation level for the database or individual transactions. See Changing Transaction Isolation
Levels.
The following table highlights the behaviors of READ COMMITTED and SERIALIZABLE
isolation. For specific information see, SERIALIZABLE Isolation and READ COMMITTED
Isolation.

Isolation Level Epoch Used Dirty Read Non Repeatable Read Phantom Read
READ COMMITTED Last epoch for reads Not Possible Possible Possible
and current epoch
for writes
SERIALIZABLE Current epoch for Not Possible Not Possible Not Possible
reads and writes

Implementation Details
Vertica supports conventional SQL transactions with standard ACID properties:
• Vertica supports ANSI SQL 92 style-implicit transactions. You do not need to execute a
BEGIN or START TRANSACTION command.
• Vertica does not use a redo/undo log or two-phase commit.
• The COPY (page 323) command automatically commits itself and any current transaction
(except when loading temp tables). Vertica recommends that you COMMIT (page 322) or
ROLLBACK (page 379) the current transaction before using COPY.

-46-
SQL Overview

Automatic Rollback
A rollback reverts data in a database to an earlier state by discarding any changes to the
database state that have been performed by a transaction's statements. In addition, it releases
any locks that the transaction might have held. A rollback can be done automatically in response
to an error or through an explicit rollback transaction.
Vertica supports transaction-level and statement-level rollbacks. A transaction-level rollback
discards all modifications made by a transaction. A statement-level rollback undoes just the
effects made by a particular statement. Most errors caused by a statement result in a
statement-level rollback to undo the effects of the erroneous statement. Vertica uses ERROR
messages to indicate this type of error. DDL errors, systemic failures, dead locks, and resource
constraints result in transaction-level rollback. Vertica uses ROLLBACK messages to indicate
this type of error.
To implement automatic, statement-level rollbacks in response to errors, Vertica automatically
inserts an implicit savepoint after each successful statement one at a time. This marker allows
the next statement, and only the next statement, to be rolled back if it results in an error. If the
statement is successful, the marker automatically rolls forward. Implicit savepoints are available
to Vertica only and cannot be referenced directly.
To explicitly roll back an entire transaction, use the ROLLBACK (page 379) statement. To
explicitly roll back individual statements, use explicit savepoints.

Savepoints
Vertica supports using savepoints. A savepoint is a special mark inside a transaction that allows
all commands that are executed after it was established to be rolled back. This restores the
transaction to the state it was in at the point in which the savepoint was established.
Savepoints are useful when creating nested transactions. For example, a savepoint could be
created at the beginning of a subroutine. That way, the result of the subroutine could be rolled
back if necessary.
Use the SAVEPOINT (page 380) statement to establish a savepoint, the RELEASE
SAVEPOINT (page 374) statement to destroy it, or the ROLLBACK TO SAVEPOINT (page 379)
statement to roll back all operations that occur within the transaction after the savepoint was
established.

-47-
System Limits
This section describes system limits on the size and number of objects in a Vertica database. In
most cases, computer memory and disk drive are the limiting factors.

Item Limit

Database size Approximates the number of files times the file size on a platform,
depending on the maximum disk configuration
Table size 2^64 rows per node, or 2^63 bytes per column, whichever is smaller
Row size 8MB. The row size is approximately the sum of its maximum column
sizes, where, for example a varchar(80) has a maximum size of 80
bytes.

Key size 1600 x 4000

Number of tables/projections per database Limited by physical RAM, as the catalog must fit in memory

Number of concurrent connections per Limited by physical RAM (or threads per process), typically 1024
node

Number of concurrent connections per Limited by physical RAM of a single node (or threads per process),
cluster typically 1024
Number of columns per table 1600
Number of rows per load 2^64, or 2^63 bytes per column, whichever is smaller
Number of partitions 256
Note: The maximum number of partitions varies with the number of
columns in the table, as well as system RAM. Vertica recommends a
maximum of 20 partitions. Ideally, create no more than 12.
Length for a fixed-length column 65000 bytes
Length for a variable-length column 65000 bytes
Length of basic names 128 bytes. Basic names include table names, column names, etc.
Depth of nesting subqueries Unlimited in FROM or WHERE or HAVING clause

-49-
SQL Language Elements
This chapter presents detailed descriptions of the language elements and conventions of Vertica
SQL.

Keywords and Reserved Words


Keywords are words that have a specific meaning in the SQL language. Although SQL is not
case-sensitive with respect to keywords, they are generally shown in uppercase letters
throughout this documentation for readability purposes.
Some SQL keywords are also reserved words that cannot be used in an identifier unless
enclosed in double quote (") characters.

Keywords
ABORT ABSOLUTE ACCESS ACTION ADD
AFTER AGGREGATE ALL ALSO ALTER
ANALYSE ANALYZE AND ANY ARRAY
AS ASC ASSERTION ASSIGNMENT AT
AUTHORIZATIO
N
BACKWARD BEFORE BEGIN BETWEEN BIGINT
BINARY BIT BLOCK_DICT BLOCKDICT_CO BOOLEAN
MP
BOTH BY
CACHE CALLED CASCADE CASE CAST
CATALOGPATH CHAIN CHAR CHARACTER CHARACTERISTIC
S
CHECK CHECKPOINT CLASS CLOSE CLUSTER
COALESCE COLLATE COLUMN COMMENT COMMIT
COMMITTED COMMONDEL CONSTRAINT CONSTRAINTS CONVERSION
TA_COMP
CONVERT COPY CORRELATION CREATE CREATEDB
CREATEUSER CROSS CSV CURRENT_DATA CURRENT_DATE
BASE
CURRENT_TIM CURRENT_TI CURRENT_USER CURRENT_SCHE CURSOR
E MESTAMP MA
CYCLE
DATA DATABASE DATAPATH DAY DEALLOCATE
DEC DECIMAL DECLARE DEFAULT DEFAULTS

-51-
SQL Reference Manual

DEFERRABLE DEFERRED DEFINER DELETE DELIMITER


DELIMITERS DELTARANGE DELTARANGE_C DELTAVAL DESC
_COMP OMP_SP
DETERMINES DIRECT DISTINCT DISTVALINDEX DO
DOMAIN DOUBLE DROP
EACH ELSE ENCODING ENCRYPTED END
EPOCH ERROR ESCAPE EXCEPT EXCEPTIONS
EXCLUDING EXCLUSIVE EXECUTE EXISTS EXPLAIN
EXTERNAL EXTRACT
FALSE FETCH FILLER FIRST FLOAT
FOR FORCE FOREIGN FORWARD FREEZE
FROM FULL FUNCTION
GLOBAL GRANT GROUP
HANDLER HAVING HOLD HOUR
ILIKE IMMEDIATE IMMUTABLE IMPLICIT IN
INCLUDING INCREMENT INDEX INHERITS INITIALLY
INNER INOUT INPUT INSENSITIVE INSERT
INSTEAD INT INTEGER INTERSECT INTERVAL
INTO INVOKER IS ISNULL ISOLATION
JOIN
KEY
LANCOMPILER LANGUAGE LARGE LAST LATEST
LEADING LEFT LESS LEVEL LIKE
LIMIT LISTEN LOAD LOCAL LOCALTIME
LOCALTIMESTA LOCATION LOCK
MP
MATCH MAXVALUE MERGEOUT MINUTE MINVALUE
MOBUF MODE MONTH MOVE MOVEOUT
MULTIALGORIT MULTIALGORI
HM_COMP THM_COMP_
SP
NAMES NATIONAL NATURAL NCHAR NEW
NEXT NO NOCREATEDB NOCREATEUSER NODE
NODES NONE NOT NOTHING NOTIFY
NOTNULL NOWAIT NULL NULLIF NUMERIC

-52-
SQL Language Elements

OBJECT OF OFF OFFSET OIDS


OLD ON ONLY OPERATOR OPTION
OR ORDER OUT OUTER OVERLAPS
OVERLAY OWNER
PARTIAL PASSWORD PINNED PLACING POSITION
PRECISION PREPARE PRESERVE PRIMARY PRIOR
PRIVILEGES PROCEDURA PROCEDURE PROJECTION
L
QUOTE
READ REAL RECHECK RECORD RECOVER
REFERENCES REFRESH REINDEX REJECTED RELATIVE
RELEASE RENAME REPEATABLE REPLACE RESET
RESTART RESTRICT RETURNS REVOKE RIGHT
RLE ROLLBACK ROW ROWS RULE
SAVEPOINT SCHEMA SCROLL SECOND SECURITY
SEGMENTED SELECT SEQUENCE SERIALIZABLE SESSION
SESSION_USE SET SETOF SHARE SHOW
R
SIMILAR SIMPLE SMALLINT SOME SPLIT
STABLE START STATEMENT STATISTICS STDIN
STDOUT STORAGE STRICT SUBSTRING SYSDATE
SYSID
TABLE TABLESPACE TEMP TEMPLATE TEMPORARY
TERMINATOR THAN THEN TIME TIMESTAMPK
TIMESTAMPTZ TIMETZ TO TOAST TRAILING
TRANSACTION TREAT TRIGGER TRIM TRUE
TRUNCATE TRUSTED TYPE
UNCOMMITTED UNENCRYPT UNION UNIQUE UNKNOWN
ED
UNLISTEN UNSEGMENT UNTIL UPDATE USAGE
ED
USER USING
VACUUM VALID VALIDATOR VALINDEX VALUES
VARCHAR VARYING VERBOSE VIEW VOLATILE
WHEN WHERE WITH WITHOUT WORK

-53-
SQL Reference Manual

WRITE
YEAR
ZONE

Reserved Words
ALL ANALYSE ANALYZE

AND ANY ARRAY

AS ASC BOTH

CASE CAST CHECK

COLLATE COLUMN CONSTRAINT

CREATE CURRENT_DATABASE CURRENT_DATE

CURRENT_TIME CURRENT_TIMESTAMP CURRENT_USER

DEFAULT DEFERRABLE DESC

DISTINCT DO ELSE

END EXCEPT FALSE

FILLER FOR FOREIGN

FROM GRANT GROUP

HAVING IN_P INITIALLY

INTERSECT INTO LEADING

LIMIT LOCALTIME LOCALTIMESTAMP

NEW NODE NODES

NOT NULL OFF

OFFSET OLD ON

ONLY OR ORDER

PLACING PRIMARY REFERENCES

-54-
SQL Language Elements

SCHEMA SELECT SESSION_USER

SOME TABLE THEN

TO TRAILING TRUE_P

UNIQUE UNION USER

USING WHEN WHERE

Identifiers
Identifiers (names) of objects such as schema, table, projection, column names, and so on, can
be up to 128 bytes in length.

Unquoted Identifiers
Unquoted SQL identifiers must begin with one of the following:
• An alphabetic character (A-Z or a-z, including letters with diacritical marks and non-Latin
letters)
• Underscore (_)
Subsequent characters in an identifier can be:
• Alphabetic
• Numeric (0-9)
• Dollar sign ($). Dollar sign is not allowed in identifiers according to the SQL standard and
could cause application portability problems.

Quoted Identifiers
Identifiers enclosed in double quote (") characters can contain any character. If you want to
include a double quote, you need a pair of them; for example '""'. You can use names that
would otherwise be invalid, such as names that include only numeric characters ("123") or
contain space characters, punctuation marks, keywords, and so on.
Note: Identifiers are not case-sensitive. Thus, identifiers "ABC", "ABc", and "aBc" are
synonymous, as are ABC, ABc, and aBc.

Constants
Constants are numbers or strings.

-55-
SQL Reference Manual

Numeric Constants
Syntax
digits
| digits.[digits][e[+-]digits]
| [digits].digits[e[+-]digits]
| digitse[+-]digits
| NaN

Parameters
digits represents one or more numeric characters (0 through 9).
Note: You can use NaN in expressions.

Notes
• At least one digit must be before or after the decimal point, if one is used.
• At least one digit must follow the exponent marker (e), if one is present.
• There cannot be any spaces or other characters embedded in the constant.
• Leading plus or minus signs are not actually considered part of the constant; they are unary
operators applied to the constant.
• A numeric constant that contains neither a decimal point nor an exponent is initially
presumed to be type INTEGER if its value fits; otherwise it is presumed to be DOUBLE
PRECISION.
• In most cases a numeric constant is automatically coerced to the most appropriate type
depending on context. When necessary, you can force a numeric value to be interpreted as a
specific data type by casting it as described in Data Type Coercion Operators (CAST)
(page 66).
• Vertica follows the IEEE specification for floating point, including NaN (not a number).
• A NaN is not greater than and at the same time not less than anything, even itself. In other
words, comparisons always return false whenever a NaN is involved. See Numeric
Expressions (page 78) for examples.

Examples
42
3.5
4.
.001
5e2
1.925e-3

-56-
SQL Language Elements

String Constants (Dollar-Quoted)


The standard syntax for specifying string constants can be difficult to understand when the
desired string contains many single quotes or backslashes. To allow more readable queries in
such situations, Vertica SQL provides "dollar quoting." Dollar quoting is not part of the SQL
standard, but it is often a more convenient way to write complicated string literals than the
standard-compliant single quote syntax. It is particularly useful when representing string
constants inside other constants.

Syntax
$$characters$$

Parameters
characters is an arbitrary sequence of UTF-8 characters bounded by paired dollar signs ($$).
Dollar-quoted string content is treated as a literal. Single quote, backslash, and dollar sign
characters have no special meaning within a dollar-quoted string.

Notes
• A dollar-quoted string that follows a keyword or identifier must be separated from it by
whitespace; otherwise the dollar quoting delimiter would be taken as part of the preceding
identifier.
• The string functions do not handle multibyte UTF-8 sequences correctly. They treat each
byte as a character.

Examples
SELECT $$Fred's\n car$$;
?column?
-------------------
Fred's\n car
(1 row)

String Constants (Standard)


Syntax
'characters'

Parameters
characters is an arbitrary sequence of UTF-8 characters bounded by single quotes (').

Using Single Quotes in a String


The SQL standard way of writing a single-quote character within a string constant is to write two
adjacent single quotes. for example:
'Chester''s gorilla'

-57-
SQL Reference Manual

Vertica SQL also allows single quotes to be escaped with a backslash (\). For example:
'Chesters\'s gorilla'

C-style Backslash Escapes


Vertica SQL also supports the following C-style backslash escapes. Any other character
following a backslash is taken literally.
• \\ is a backslash
• \b is a backspace
• \f is a form feed
• \n is a newline
• \r is a carriage return
• \t is a tab
• \xxx, where xxx is an octal number representing a byte with the corresponding code. (It is
your responsibility that the byte sequences you create are valid characters in the server
character set encoding. The character with the code zero cannot be in a string constant.)

Notes
• Vertica supports the UTF-8 character set.
• The string functions not handle multibyte UTF-8 sequences correctly. They treat each byte as
a character.

Examples
SELECT 'This is a string';
?column?
------------------
This is a string
(1 row)

Date/Time Constants
Date or time literal input must be enclosed in single quotes. Input is accepted in almost any
reasonable format, including ISO 8601, SQL-compatible, traditional POSTGRES, and others.
Vertica is more flexible in handling date/time input than the SQL standard requires.The exact
parsing rules of date/time input and for the recognized text fields including months, days of the
week, and time zones are described in Date/Time Expressions (page 76).
Time Zone Values
Vertica attempts to be compatible with the SQL standard definitions for time zones. However, the
SQL standard has an odd mix of date and time types and capabilities. Obvious problems are:
• Although the DATE (page 96) type does not have an associated time zone, the TIME (page
97) type can. Time zones in the real world have little meaning unless associated with a date
as well as a time, since the offset can vary through the year with daylight-saving time
boundaries.

-58-
SQL Language Elements

• Vertica assumes your local time zone for any data type containing only date or time.
• The default time zone is specified as a constant numeric offset from UTC. It is therefore not
possible to adapt to daylight-saving time when doing date/time arithmetic across DST
boundaries.
To address these difficulties, Vertica recommends using DATE/TIME types that contain both
date and time when using time zones. We recommend not using the type TIME WITH TIME
ZONE, even though Vertica supports it for legacy applications and for compliance with the SQL
standard.
Time zones and time-zone conventions are influenced by political decisions, not just earth
geometry. Time zones around the world became somewhat standardized during the 1900's, but
continue to be prone to arbitrary changes, particularly with respect to daylight-savings rules.
Vertica currently supports daylight-savings rules over the time period 1902 through 2038
(corresponding to the full range of conventional UNIX system time). Times outside that range are
taken to be in "standard time" for the selected time zone, no matter what part of the year they fall
in.

Example Description
PST Pacific Standard Time

-8:00 ISO-8601 offset for PST

-800 ISO-8601 offset for PST

-8 ISO-8601 offset for PST

zulu Military abbreviation for UTC

z Short form of zulu

Day of the Week Names


The following tokens are recognized as names of days of the week:

Day Abbreviations
SUNDAY SUN

MONDAY MON

TUESDAY TUE, TUES

WEDNESD WED, WEDS


AY

-59-
SQL Reference Manual

THURSDAY THU, THUR,


THURS

FRIDAY FRI

SATURDAY SAT

Month Names
The following tokens are recognized as names of months:

Month Abbreviations
JANUARY JAN

FEBRUAR FEB
Y

MARCH MAR

APRIL APR

MAY MAY

JUNE JUN

JULY JUL

AUGUST AUG

SEPTEMB SEP, SEPT


ER

OCTOBER OCT

NOVEMBE NOV
R

DECEMBE DEC
R

Interval Values
An interval value represents the duration between two points in time.

-60-
SQL Language Elements

Syntax
[ @ ] quantity unit [ quantity unit... ] [ AGO ]

Parameters
@ (at sign) is optional and ignored
quantity Is an integer numeric constant (page 56)
unit Is one of the following units or abbreviations or plurals of the
following units:
SECOND MONTH
MINUTE YEAR
HOUR DECADE
DAY CENTURY
WEEK MILLENNIUM
AGO [Optional] specifies a negative interval value (an interval going
back in time). 'AGO' is a synonym for '-'.

The amounts of different units are implicitly added up with appropriate sign accounting.

Notes
• Quantities of days, hours, minutes, and seconds can be specified without explicit unit
markings. For example:
'1 12:59:10' is read the same as '1 day 12 hours 59 min 10 sec'
• The boundaries of an interval constant are:
§ '9223372036854775807 usec' to '9223372036854775807 usec ago'
§ 296533 years 3 mons 21 days 04:00:54.775807 to -296533 years -3 mons -21 days
-04:00:54.775807
• The range of an interval constant is +/- 263 - 1 (plus or minus two to the sixty-third minus one)
microseconds.
• In Vertica, the interval fields are additive and accept large floating point numbers.

Examples
SELECT INTERVAL '1 12:59:10';
interval
----------------
1 day 12:59:10
(1 row)
SELECT INTERVAL '9223372036854775807 usec';
interval
---------------------------------------------
296533 years 3 mons 21 days 04:00:54.775807
(1 row)
SELECT INTERVAL '-9223372036854775807 usec';
interval
-------------------------------------------------
-296533 years -3 mons -21 days -04:00:54.775807
(1 row)
SELECT INTERVAL '-1 day 48.5 hours';

-61-
SQL Reference Manual

interval
----------------
1 day 00:30:00
(1 row)
SELECT TIMESTAMP 'Apr 1, 07' - TIMESTAMP 'Mar 1, 07';
?column?
----------
1 mon
(1 row)
SELECT TIMESTAMP 'Mar 1, 07' - TIMESTAMP 'Feb 1, 07';
?column?
----------
1 mon
(1 row)
SELECT TIMESTAMP 'Feb 1, 07' + INTERVAL '30 days';
?column?
---------------------
2007-03-01 00:00:00
(1 row)
SELECT TIMESTAMP WITHOUT TIME ZONE '1999-10-01' + INTERVAL '1 month - 1 second' AS "Oct 31"
Oct 31
---------------------
1999-10-30 23:59:59
(1 row)
INSERT INTO timestamp_arth values (1,'-infinity','epoch','allballs','18days 09 hours 09 mins 999999999999 secs',

Vertica:
t1 |t2 |t3 | t4 | t5 | t6 | t7
----+-----------+------------+----------+------------------------------------+------------------------+----------
1 | -infinity | 1970-01-01 | 00:00:00 | 32150 years 3 mons 2 days 10:55:39 | 1999-01-27 09:09:09-05 | 09:09:09.

Operators
Operators are logical, mathematical, and equality symbols used in SQL to evaluate, compare, or
calculate values.

Binary Operators
Each of the functions in the table below works with binary and varbinary data types. Since binary
can implicitly be converted to varbinary, these functions work for binary types as well. Keep in
mind that when the binary value 'ab'::binary(3) is translated to varbinary, the result is
equivalent to 'ab\\000'::varbinary(3).

Operato Function Description


r
'=' binary_eq Equal to
'<>' binary_ne Not equal to
'<' binary_lt Less than

-62-
SQL Language Elements

'<=' binary_le Less than or equal to


'>' binary_gt Greater than
'>=' binary_ge Greater than or equal to
'&' binary_and And
'~' binary_not Not
'|' binary_or Or
'#' binary_xor Either or
'||' binary_cat Concatenate

Notes
If the arguments vary in length these operators treat the values as though they are all equal in
length by right-extending the smaller values with the zero byte to the full width of the column
(except when using the binary_cat function). Given values 'ff' and 'f', for example, value
'f' is treated as 'f0'.
Operators are strict with respect to nulls. That is, the result is null if any argument is null. For
example, null <> 'a'::binary returns null.

Examples
Note that the zero byte is not removed when values are concatenated. For example:
SELECT 'ab'::BINARY(3) || 'cd'::BINARY(2) AS cat1, 'ab'::VARBINARY(3) ||
'cd'::VARBINARY(2) AS cat2 ;
cat1 | cat2
----------+------
ab\000cd | abcd
(1 row)
The remaining examples illustrate the behavioral differences for binary operands:
The & operator:
SELECT TO_HEX(HEX_TO_BINARY('0x10') & HEX_TO_BINARY('0xF0'));
to_hex
--------
10
(1 row)
The | operator:
SELECT TO_HEX(HEX_TO_BINARY('0x10') | HEX_TO_BINARY('0xF0'));
to_hex
--------
f0
(1 row)
The # operator:
SELECT TO_HEX(HEX_TO_BINARY('0x10') # HEX_TO_BINARY('0xF0'));
to_hex

-63-
SQL Reference Manual

--------
e0
(1 row)
The ~ operator:
SELECT TO_HEX(~ HEX_TO_BINARY('0xF0'));
to_hex
--------
0f
(1 row)

-64-
65

Boolean Operators
Syntax
[ AND | OR | NOT ]

Parameters
SQL uses a three-valued Boolean logic where the null value represents "unknown."

a b a AND a OR
b b

TRUE TRUE TRUE TRUE

TRUE FALSE FALSE TRUE

TRUE NULL NULL TRUE

FALSE FALSE FALSE FALSE

FALSE NULL FALSE NULL

NULL NULL NULL NULL

a NOT a

TRUE FALSE

FALS TRUE
E

NULL NULL

Notes
• The operators AND and OR are commutative, that is, you can switch the left and right operand
without affecting the result. However, the order of evaluation of subexpressions is not
defined. When it is essential to force evaluation order, use a CASE (page 73) construct.
• Do not confuse Boolean operators with the Boolean-predicate (page 80) or the Boolean
(page 93) data type, which can have only two values: true and false.

Comparison Operators
Comparison operators are available for all data types where comparison makes sense. All
comparison operators are binary operators that return values of True, False, or NULL.

-65-
SQL Reference Manual

Syntax and Parameters

< less than

> greater than

<= less than or equal to

>= greater than or equal to

= or <=> equal

<> or != not equal

Notes
• The != operator is converted to <> in the parser stage. It is not possible to implement != and
<> operators that do different things.
• The comparison operators return NULL (signifying "unknown") when either operand is null.
• The <=> operator performs an equality comparison like the = operator, but it returns true,
instead of NULL, if both operands are NULL, and false, instead of NULL, if one operand is
NULL.

Data Type Coercion Operators (CAST)


Data type coercion (casting) passes an expression value to an input conversion routine for a
specified data type, resulting in a constant of the indicated type.

Syntax
CAST ( expression AS data-type )
expression::data-type
data-type 'string'

Parameters
expression Is an expression of any type
data-type Converts the value of expression to one of the following data types:
BINARY (page 89)
BOOLEAN (page 93)
CHARACTER (page 94)
DATE/TIME (page 96)
NUMERIC (page 103)

Notes
Vertica syntax requires the '::' operator to perform data type coercion (casting).

-66-
SQL Language Elements

Type coercion format of data-type 'string' can be used only to specify the data type of a quoted
string constant.
The explicit type cast can be omitted if there is no ambiguity as to the type the constant must be.
For example, when a constant is assigned directly to a column, it is automatically coerced to the
column's data type.
If a binary value is cast (implicitly or explicitly) to a binary type with a smaller length, the value is
silently truncated. For example:
SELECT 'abcd'::BINARY(2);
binary
--------
ab
(1 row)
No casts other than BINARY to and from VARBINARY and resize operations are currently
supported.
On binary data that contains a value with fewer bytes than the target column, values are
right-extended with the zero byte '\0' to the full width of the column. Trailing zeroes on variable
length binary values are not right-extended:
SELECT 'ab'::BINARY(4), 'ab'::VARBINARY(4);
binary | varbinary
------------+-----------
ab\000\000 | ab
(1 row)

Examples
SELECT CAST((2 + 2) AS VARCHAR);
varchar
---------
4
(1 row)
SELECT (2 + 2)::VARCHAR;
varchar
---------
4
(1 row)
SELECT '2.2' + 2;
ERROR: invalid input syntax for integer: "2.2"
SELECT FLOAT '2.2' + 2;
?column?
----------
4.2
(1 row)

Date/Time Operators
Syntax
[ + | - | * | / ]

-67-
SQL Reference Manual

Parameters
+ Addition
- Subtraction
* Multiplication
/ Division

Notes
• The operators described below that take TIME or TIMESTAMP inputs actually come in two
variants: one that takes TIME WITH TIME ZONE or TIMESTAMP WITH TIME ZONE, and one
that takes TIME WITHOUT TIME ZONE or TIMESTAMP WITHOUT TIME ZONE. For brevity,
these variants are not shown separately.
• The + and * operators come in commutative pairs (for example both DATE + INTEGER and
INTEGER + DATE); only one of each such pair is shown.

Example Result Type Result


DATE '2001-09-28' + INTEGER '7' DATE '2001-10-05'
DATE '2001-09-28' + INTERVAL '1 HOUR' TIMESTAMP '2001-09-28 01:00:00'
DATE '2001-09-28' + TIME '03:00' TIMESTAMP '2001-09-28 03:00:00'
INTERVAL '1 DAY' + INTERVAL '1 HOUR' INTERVAL '1 DAY 01:00:00'
TIMESTAMP '2001-09-28 01:00' + INTERVAL '23 HOURS' TIMESTAMP '2001-09-29 00:00:00'
TIME '01:00' + INTERVAL '3 HOURS' TIME '04:00:00'
- INTERVAL '23 HOURS' INTERVAL '-23:00:00'
DATE '2001-10-01' - DATE '2001-09-28' INTEGER '3'
DATE '2001-10-01' - INTEGER '7' DATE '2001-09-24'
DATE '2001-09-28' - INTERVAL '1 HOUR' TIMESTAMP '2001-09-27 23:00:00'
TIME '05:00' - TIME '03:00' INTERVAL '02:00:00'
TIME '05:00' - INTERVAL '2 HOURS' TIME '03:00:00'
TIMESTAMP '2001-09-28 23:00' - INTERVAL '23 HOURS' TIMESTAMP '2001-09-28 00:00:00'
INTERVAL '1 DAY' - INTERVAL '1 HOUR' INTERVAL '1 DAY -01:00:00'
TIMESTAMP '2001-09-29 03:00' - TIMESTAMP '2001-09-27 12:00' INTERVAL '1 DAY 15:00:00'
900 * INTERVAL '1 SECOND' INTERVAL '00:15:00'
21 * INTERVAL '1 DAY' INTERVAL '21 DAYS'
DOUBLE PRECISION '3.5' * INTERVAL '1 HOUR' INTERVAL '03:30:00'
INTERVAL '1 HOUR' / DOUBLE PRECISION '1.5' INTERVAL '00:40:00'

Mathematical Operators
Mathematical operators are provided for many data types.

Operator Description Example Result


! Factorial 5 ! 120
+ Addition 2 + 3 5

-68-
SQL Language Elements

- Subtraction 2 - 3 -1
* Multiplication 2 * 3 6
/ Division (integer division truncates results) 4 / 2 2
% Modulo (remainder) 5 % 4 1
^ Exponentiation 2.0 ^ 3.0 8
|/ Square root |/ 25.0 5
||/ Cube root ||/ 27.0 3
!! Factorial (prefix operator) !! 5 120
@ Absolute value @ -5.0 5
& Bitwise AND 91 & 15 11
| Bitwise OR 32 | 3 35
# Bitwise XOR 17 # 5 20
~ Bitwise NOT ~1 -2
<< Bitwise shift left 1 << 4 16
>> Bitwise shift right 8 >> 2 2

Notes
• The bitwise operators work only on integer data types, whereas the others are available for
all numeric data types.
• Vertica supports the use of the factorial operators on positive and negative floating point
(DOUBLE PRECISION (page 105)) numbers as well as integers. For example:
SELECT 4.98!;
?column?
------------------
115.978600750905
(1 row)
• Factorial is defined in term of the gamma function, where (-1) = Infinity and the other negative
integers are undefined. For example
(-4)! = NaN
-4! = -(4!) = -24.
• Factorial is defined as z! = gamma(z+1) for all complex numbers z. See the Handbook of
Mathematical Functions http://www.math.sfu.ca/~cbm/aands/ (1964) Section 6.1.5.

NULL Operators
To check whether a value is or is not NULL, use the constructs:
expression IS NULL expression IS NOT NULL
Alternatively, use equivalent, but nonstandard, constructs:
expression ISNULL expression NOTNULL

-69-
SQL Reference Manual

Do not write expression = NULL because NULL is not "equal to" NULL. (The null value represents
an unknown value, and it is not known whether two unknown values are equal.) This behavior
conforms to the SQL standard.
Note: Some applications might expect that expression = NULL returns true if expression
evaluates to the null value. Vertica strongly recommends that these applications be
modified to comply with the SQL standard.

String Concatenation Operators


To concatenate two strings on a single line, use the concatenation operator (two consecutive
vertical bars).

Syntax
string || string

Parameters

string Is an expression of type CHAR or VARCHAR

Notes
Two consecutive strings within a single SQL statement on separate lines are concatenated.

Examples
SELECT 'auto' || 'mobile';
?column?
----------
automobile
(1 row)
SELECT 'auto'
'mobile';
?column?
----------
automobile
(1 row)

Expressions
SQL expressions are the components of a query that compare a value or values against other
values. They can also perform calculations. Expressions found inside any SQL command are
usually in the form of a conditional statement.

Operator Precedence
The following table shows operator precedence in decreasing (high to low) order.

-70-
SQL Language Elements

Note: When an expression includes more than one operator, Vertica Systems, Inc.
recommends that you specify the order of operation using parentheses, rather than relying
on operator precedence.

Operator/Element Associativity Description


.
left table/column name separator
::
left typecast
[ ]
left array element selection
-
right unary minus
^
left exponentiation
* / %
left multiplication, division, modulo
+ -
left addition, subtraction
IS
IS TRUE, IS FALSE, IS UNKNOWN, IS
NULL
IN
set membership
BETWEEN
range containment
OVERLAPS
time interval overlap
LIKE
string pattern matching
< >
less than, greater than
=
right equality, assignment
NOT
right logical negation
AND
left logical conjunction
OR
left logical disjunction

Expression Evaluation Rules


The order of evaluation of subexpressions is not defined. In particular, the inputs of an operator
or function are not necessarily evaluated left-to-right or in any other fixed order. To force
evaluation in a specific order, use a CASE (page 73) construct. For example, this is an
untrustworthy way of trying to avoid division by zero in a WHERE clause:
SELECT x, y
WHERE x <> 0 AND y/x > 1.5;

-71-
SQL Reference Manual

But this is safe:


SELECT x, y
WHERE
CASE
WHEN x <> 0 THEN y/x > 1.5
ELSE false
END;
A CASE construct used in this fashion defeats optimization attempts, so it should only be done
when necessary. (In this particular example, it would doubtless be best to sidestep the problem
by writing y > 1.5*x instead.)

Aggregate Expressions
An aggregate expression represents the application of an aggregate function (page 112)
across the rows or groups of rows selected by a query.
Using AVG as an example, the syntax of an aggregate expression is one of the following.
Invokes the aggregate across all input rows for which the given expression yields a non-null
value:
AVG (expression)
Is the same as AVG(expression), since ALL is the default:
AVG (ALL expression)
Invokes the AVG function across all input rows for all distinct, non-null values of the expression,
where expression is any value expression that does not itself contain an aggregate expression.
AVG (DISTINCT expression)
An aggregate expression only can appear in the select list or HAVING clause of a SELECT
statement. It is forbidden in other clauses, such as WHERE, because those clauses are
evaluated before the results of aggregates are formed.

-72-
73

CASE Expressions
The CASE expression is a generic conditional expression that can be used wherever an
expression is valid. It is similar to case and if/then/else statements in other languages.

Syntax (form 1)
CASE
WHEN condition THEN result
[ WHEN condition THEN result ]...
[ ELSE result ]
END

Parameters
condition Is an expression that returns a boolean (true/false) result.
If the result is false, subsequent WHEN clauses are
evaluated in the same manner.
result Specifies the value to return when the associated
condition is true.
ELSE result If no condition is true then the value of the CASE
expression is the result in the ELSE clause. If the ELSE
clause is omitted and no condition matches, the result is
null.

Syntax (form 2)
CASE expression
WHEN value THEN result
[ WHEN value THEN result ]...
[ ELSE result ]
END

Parameters
expression Is an expression that is evaluated and compared to all
the value specifications in the WHEN clauses until one is
found that is equal.
value Specifies a value to compare to the expression.
result Specifies the value to return when the expression is
equal to the specified value.
ELSE result Specifies the value to return when the expression is not
equal to any value; if no ELSE clause is specified, the
value returned is null.

Notes
The data types of all the result expressions must be convertible to a single output type.

-73-
SQL Reference Manual

Examples
SELECT * FROM test;

a
---
1
2
3
SELECT a,
CASE WHEN a=1 THEN 'one'
WHEN a=2 THEN 'two'
ELSE 'other'
END
FROM test;

a | case
---+-------
1 | one
2 | two
3 | other
SELECT a,
CASE a WHEN 1 THEN 'one'
WHEN 2 THEN 'two'
ELSE 'other'
END
FROM test;

a | case
---+-------
1 | one
2 | two
3 | other

Special Example
A CASE expression does not evaluate subexpressions that are not needed to determine the
result. You can use this behavior to avoid division-by-zero errors:
SELECT x
FROM T1
WHERE
CASE WHEN x <> 0 THEN y/x > 1.5
ELSE false
END;

Column References
Syntax
[ [ schemaname. ] tablename. ] columnname

Parameters
schemaname Is the name of the schema

-74-
SQL Language Elements

tablename Is one of:


The name of a table
An alias for a table defined by means of a FROM clause in a query
columnname Is the name of a column that must be unique across all the tables being used in a query

Notes
There are no space characters in a column reference.
If you do not specify a schemaname, Vertica searches the existing schemas according to the
order defined in the SET SEARCH_PATH command.

Example
This example uses the schema from the VMart Example Database.
In the following command, transaction_type and transaction_time are the unique
column references, store is the name of the schema, and store_sales_fact is the table
name:
SELECT transaction_type, transaction_time FROM store.store_sales_fact
ORDER BY transaction_time;
transaction_type | transaction_time
------------------+------------------
purchase | 00:00:23
purchase | 00:00:32
purchase | 00:00:54
purchase | 00:00:54
purchase | 00:01:15
purchase | 00:01:30
purchase | 00:01:50
return | 00:03:34
return | 00:03:35
purchase | 00:03:39
purchase | 00:05:13
purchase | 00:05:20
purchase | 00:05:23
purchase | 00:05:27
purchase | 00:05:30
purchase | 00:05:35
purchase | 00:05:35
purchase | 00:05:42
return | 00:06:36
purchase | 00:06:39
(20 rows)

Comments
A comment is an arbitrary sequence of characters beginning with two consecutive hyphen
characters and extending to the end of the line. For example:
-- This is a standard SQL comment

-75-
SQL Reference Manual

A comment is removed from the input stream before further syntax analysis and is effectively
replaced by white space.
Alternatively, C-style block comments can be used where the comment begins with /* and
extends to the matching occurrence of */.
/* multiline comment
* with nesting: /* nested block comment */
*/
These block comments nest, as specified in the SQL standard. Unlike C, you can comment out
larger blocks of code that might contain existing block comments.

Date/Time Expressions
Vertica uses an internal heuristic parser for all date/time input support. Dates and times are input
as strings, and are broken up into distinct fields with a preliminary determination of what kind of
information might be in the field. Each field is interpreted and either assigned a numeric value,
ignored, or rejected. The parser contains internal lookup tables for all textual fields, including
months, days of the week, and time zones.
The date/time type inputs are decoded using the following procedure.
• Break the input string into tokens and categorize each token as a string, time, time zone, or
number.
• If the numeric token contains a colon (:), this is a time string. Include all subsequent digits
and colons.
• If the numeric token contains a dash (-), slash (/), or two or more dots (.), this is a date string
which might have a text month.
• If the token is numeric only, then it is either a single field or an ISO 8601 concatenated date
(for example, 19990113 for January 13, 1999) or time (for example, 141516 for 14:15:16).
• If the token starts with a plus (+) or minus (-), then it is either a time zone or a special field.
• If the token is a text string, match up with possible strings.
• Do a binary-search table lookup for the token as either a special string (for example, today),
day (for example, Thursday), month (for example, January), or noise word (for example, at,
on).
• Set field values and bit mask for fields. For example, set year, month, day for today, and
additionally hour, minute, second for now.
• If not found, do a similar binary-search table lookup to match the token with a time zone.
• If still not found, throw an error.
• When the token is a number or number field:
• If there are eight or six digits, and if no other date fields have been previously read, then
interpret as a "concatenated date" (for example, 19990118 or 990118). The interpretation is
YYYYMMDD or YYMMDD.
• If the token is three digits and a year has already been read, then interpret as day of year.
• If four or six digits and a year has already been read, then interpret as a time (HHMM or
HHMMSS).

-76-
SQL Language Elements

• If three or more digits and no date fields have yet been found, interpret as a year (this forces
yy-mm-dd ordering of the remaining date fields).
• Otherwise the date field ordering is assumed to follow the DateStyle setting: mm-dd-yy,
dd-mm-yy, or yy-mm-dd. Throw an error if a month or day field is found to be out of range.
• If BC has been specified, negate the year and add one for internal storage. (There is no year
zero in the Gregorian calendar, so numerically 1 BC becomes year zero.)
• If BC was not specified, and if the year field was two digits in length, then adjust the year to
four digits. If the field is less than 70, then add 2000, otherwise add 1900.
Tip: Gregorian years AD 1-99 can be entered by using 4 digits with leading zeros (for example,
0099 is AD 99).

Month Day Year Ordering


For some formats, ordering of month, day, and year in date input is ambiguous and there is
support for specifying the expected ordering of these fields. See Date/Time Run-Time
Parameters for information about output styles.

Special Date/Time Values


Vertica supports several special date/time values for convenience, as shown below. All of these
values need to be written in single quotes when used as constants in SQL statements.
The values INFINITY and -INFINITY are specially represented inside the system and are
displayed the same way. The others are simply notational shorthands that are converted to
ordinary date/time values when read. (In particular, NOW and related strings are converted to a
specific time value as soon as they are read.)

String Valid Data Types Description


epoch DATE, TIMESTAMP 1970-01-01 00:00:00+00 (UNIX SYSTEM
TIME ZERO)

INFINITY TIMESTAMP Later than all other time stamps

-INFINITY TIMESTAMP Earlier than all other time stamps

NOW DATE, TIME, Current transaction's start time


TIMESTAMP
Note: NOW is not the same as the NOW (on
page 159) function.
TODAY DATE, TIMESTAMP Midnight today

TOMORROW DATE, TIMESTAMP Midnight tomorrow

YESTERDAY DATE, TIMESTAMP Midnight yesterday

ALLBALLS TIME 00:00:00.00 UTC

-77-
SQL Reference Manual

The following SQL-compatible functions can also be used to obtain the current time value for the
corresponding data type:
• CURRENT_DATE (page 143)
• CURRENT_TIME (page 144)
• CURRENT_TIMESTAMP (page 144)
• LOCALTIME (page 156)
• LOCALTIMESTAMP (page 156)
The latter four accept an optional precision specification. (See Date/Time Functions (page
139).) Note however that these are SQL functions and are not recognized as data input strings.

NULL Value
NULL is a reserved keyword used to indicate that a data value is unknown.
Be very careful when using NULL in expressions. NULL is not greater than, less than, equal to,
or not equal to any other expression. Use the Boolean-predicate (on page 80) for determining
whether or not an expression value is NULL.

Notes
NULL appears last (largest) in ascending order.
Vertica also accepts NUL characters ('\0') in constant strings and no longer removes null
characters from VARCHAR fields on input or output. NUL is the ASCII abbreviation of the NULL
character.

See Also
GROUP BY Clause (page 388)

Numeric Expressions
Vertica follows the IEEE specification for floating point, including NaN.
A NaN is not greater than and at the same time not less than anything, even itself. In other
words, comparisons always return false whenever a NaN is involved.

Examples
SELECT CBRT('Nan'); -- cube root
cbrt
------
NaN
(1 row)
SELECT 'Nan' > 1.0;
?column?
----------
f
(1 row)

-78-
SQL Language Elements

Predicates
In general, predicates are truth-valued functions; that is, when invoked, they return a truth value.
Predicates have a set of parameters and arguments.
For example, in the following example WHERE clause:
WHERE name = 'Smith';
• name = 'Smith' is the predicate
• 'Smith' is an expression

BETWEEN-predicate
The special BETWEEN predicate is available as a convenience.

Syntax
a BETWEEN x AND y

Notes
a BETWEEN x AND y
Is equivalent to:
a >= x AND a <= y
Similarly:
a NOT BETWEEN x AND y
is equivalent to
a < x OR a > y

-79-
80

Boolean-predicate
The Boolean predicate retrieves rows where the value of an expression is true, false, or
unknown (null).

Syntax
expression IS [NOT] TRUE
expression IS [NOT] FALSE
expression IS [NOT] UNKNOWN

Notes
• A Boolean predicate always return true or false, never null, even when the operand is null. A
null input is treated as the value UNKNOWN.
• Do not confuse the boolean-predicate with Boolean Operators (on page 65) or the Boolean
(page 93) data type, which can have only two values: true and false.
• IS UNKNOWN and IS NOT UNKNOWN are effectively the same as the NULL-predicate (page
86), except that the input expression does not have to be a single column value. To check a
single column value for NULL, use the NULL-predicate (page 86).

-80-
81

column-value-predicate
Syntax
column-name comparison-op constant-expression

Parameters
column-name Is a single column of one the tables specified in the FROM clause (page
384).
comparison-op Is one of the comparison operators (on page 65).
constant-expression Is a constant value of the same data type as the column-name

Notes
To check a column value for NULL, use the NULL-predicate (page 86).

Examples
Dimension.column1 = 2
Dimension.column2 = 'Seafood'

-81-
82

IN-predicate
Syntax
column-expression [ NOT ] IN ( list-expression )

Parameters
column-expression A single column of one the tables specified in the FROM clause (page
384).
list-expression A comma-separated list of constant values matching the data type of the
column-expression

Examples
x IN (5, 6, 7)

-82-
83

join-predicate
Vertica supports only equi-joins based on a primary key-foreign key relationship between the
joined tables.

Syntax
column-reference (see "Column References" on page 74) = column-reference (see
"Column References" on page 74)

Parameters
column-reference Refers to a column of one the tables specified in the FROM clause
(page 384).

See Also
Adding Primary Key and Foreign Key Constraints in the Administrator's Guide

-83-
84

LIKE-predicate
Retrieves rows where the string value of a column matches a specified pattern. The pattern can
contain one or more wildcard characters, which match all valid characters. ILIKE is equivalent to
LIKE except that the match is case-insensitive (non-standard extension).

Syntax
string { LIKE | ILIKE } pattern [ESCAPE escape-character]
string NOT { LIKE | ILIKE } pattern [ESCAPE escape-character]

Parameters
string (CHAR or VARCHAR) is the column value to be compared to the pattern.
NOT Returns true if LIKE returns false, and the reverse; equivalent to NOT
string LIKE pattern.
pattern Specifies a string containing wildcard characters.
Underscore (_) matches any single character.
Percent sign (%) matches any string of zero or more characters.
ESCAPE Specifies an escape-character. A null escape character ('') disables the
escape mechanism. To match the escape character itself, use two
consecutive escape characters.
escape-charact Causes character to be treated as a literal, rather than a wildcard, when
er preceding an underscore or percent sign character in the pattern. The
default escape character is the backslash (/) character.

Notes
• LIKE requires the entire string expression to match the pattern. To match a sequence of
characters anywhere within a string, the pattern must start and end with a percent sign.
• The LIKE predicate does not ignore trailing "white space" characters. If the data values that
you want to match have unknown numbers of trailing spaces, tabs, etc., terminate each LIKE
predicate pattern with the percent sign wildcard character.
• To use a backslash character as a literal, specify a different escape character and use two
backslashes. For example, if the escape character is circumflex (^), you would use ^\\ to
specify a literal backslash. (Alternative: simply use four backslashes.)
• The use of a column data type other than character or character varying (implicit string
conversion) is not supported and not recommended.
• Error messages caused by the LIKE predicate could refer to it by the following symbols
instead of the actual keywords:
~~ LIKE
~~* ILIKE
!~~ NOT LIKE
!~~* NOT ILIKE

-84-
SQL Language Elements

Examples
'abc' LIKE 'abc' true
'abc' LIKE 'a%' true
'abc' LIKE '_b_' true
'abc' LIKE 'c' false

-85-
86

NULL-predicate
Syntax
column-name IS [ NOT ] NULL

Parameters
column-name Is a single column of one the tables specified in the FROM
clause (page 384).

Examples

a IS NULL
b IS NOT NULL

See Also
NULL Value (page 78)

Search Conditions
Function
To specify a search condition for a WHERE clause, HAVING clause, or JOIN clause.

Syntax
{ expression compare expression
| expression IS [ NOT ] NULL
| expression [ NOT ] LIKE expression
| expression [ NOT ] IN ( { expression | subquery
| ... expr1 , expr2 [, expr3 ] ... } )
| [NOT] EXISTS ( subquery )
| condition AND condition
| condition OR condition
| ( condition )
| ( condition , estimate )
| condition IS [ NOT ] { TRUE | FALSE | UNKNOWN }

Compare Parameters
{ = | > | < | >= | <= | <> | != }

= equal

> greater than

< less than

>= greater than or equal to

-86-
SQL Language Elements

<= less than or equal to

<> or != not equal

Notes
• SQL conditions do not follow Boolean logic, where conditions are either true or false. In SQL,
every condition evaluates as one of TRUE, FALSE, or UNKNOWN. This is called
three-valued logic. The result of a comparison is UNKNOWN if either value being compared
is the NULL value.
• Rows satisfy a search condition if and only if the result of the condition is TRUE. Rows for
which the condition is UNKNOWN do not satisfy the search condition. For more information,
see NULL Value (page 78).
• Subqueries form an important class of expression that is used in many search conditions.

See Also
Expressions (page 70)
Subquery Expressions in the Programmer's Guide

-87-
SQL Data Types
Implicit Data Type Coercion
When there is no ambiguity as to the data type of an expression value, it is implicitly coerced to
match the expected data type. For example:
SELECT 2 + '2';
?column?
----------
4
(1 row)
The quoted string constant '2' is implicitly coerced into an INTEGER value so that it can be the
operand of an arithmetic operator (addition).
SELECT 2 + 2 || 2;
?column?
----------
42
(1 row)
The result of the arithmetic expression 2 + 2 and the INTEGER constant 2 are implicitly coerced
into VARCHAR values so that they can be concatenated.

See Also
Data Type Coercion Operators (CAST) (page 66)

Binary Data Types


Store raw-byte data, such as IP addresses, up to 65000 bytes.

Syntax
BINARY (length)
{ VARBINARY | BINARY VARYING | BYTEA } (max-length)

Parameters
length | max-length Specifies the length of the string.

A binary string is a sequence of octets, or bytes. Binary strings store raw-byte data, while
character strings store text.
The binary data types, BINARY and VARBINARY, are similar to the character data types (page
94), CHAR and VARCHAR, respectively, except that binary data types contain byte strings,
rather than character strings. The allowable maximum length is the same for binary data types
as it is for character data types, except that the length for BINARY and VARBINARY is a length
in bytes, rather than in characters.

BINARY VARBINARY

-89-
SQL Reference Manual

A fixed-width string of length bytes, where the A variable-width string up to a length of


number of bytes is declared as an optional max-length bytes, where the maximum number
specifier to the type. If length is omitted, the of bytes is declared as an optional specifier to
default is 1. Where necessary, values are the type. The default is the default attribute size,
right-extended to the full width of the column which is 80, and the maximum length is 65000
with the zero byte. bytes. Varbinary values are not extended to the
full width of the column. For example:
For example:
SELECT TO_HEX('ab'::VARBINARY(4));
SELECT TO_HEX('ab'::BINARY(4)); to_hex
to_hex --------
---------- 6162
61620000 (1 row)
(1 row)

Notes
• BYTEA is a synonym for VARBINARY.
• You can use several formats when working with binary values (see Loading Binary Data), but
the hexadecimal format is generally the most straightforward and is emphasized in Vertica
documentation.
• The &, ~, | and # binary operands have special behavior for binary data types. See Binary
Operators (page 62) for details and examples.

Inputs
On input, strings are translated from hexadecimal representation to a binary value using the
HEX_TO_BINARY (page 204) function. Strings are translated from bitstring representation to
binary values using the BITSTRING_TO_BINARY (page 199) function. Both functions take a
VARCHAR argument and return a VARBINARY value. See the Examples section below.
Binary values can also be represented in octal format by prefixing the value with a backslash
'\'.
Note: If you use vsql, you must use the escape character (\) when inserting another
backslash on input; for example, input '\141' as '\\141'.
You can also input values represented by printable characters. For example, the hexadecimal
value '0x61' can also be represented by the glyph 'a'.
See Loading Binary Data in the Administrator's Guide for additional binary load formats.

Outputs
Like the input format the output format is a hybrid of octal codes and printable ASCII characters.
A byte in the range of printable ASCII characters (the range [0x20, 0x7e]) is represented by
the corresponding ASCII character, with the exception of the backslash ('\'), which is escaped
as '\\'. All other byte values are represented by their corresponding octal values. For example,
the bytes {97,92,98,99}, which in ASCII are {a,\,b,c}, are translated to text as 'a\\bc'.

Supported Aggregate Functions


The following aggregate functions are supported for binary data types:

-90-
SQL Data Types

• BIT_AND (page 113)


• BIT_OR (page 114)
• BIT_XOR (page 115)
• MAX (page 120)
• MIN (page 120)
BIT_AND, BIT_OR, and BIT_XOR are bitwise operations that are applied to each non-null value
in a group, while MAX and MIN are bytewise comparisons of binary values.
Like their binary operator (page 62) counterparts, if the values in a group vary in length, the
aggregate functions treat the values as though they are all equal in length by extending shorter
values with zero bytes to the full width of the column. For example, given a group containing the
values 'ff', null, and 'f', a binary aggregate ignores the null value and treats the value
'f' as 'f0'. Also, like their binary operator counterparts, these aggregate functions operate on
VARBINARY types explicitly and operate on BINARY types implicitly through casts. See Data
Type Coercion Operators (CAST) (page 66).

Examples
The following example shows VARBINARY HEX_TO_BINARY (page 204)(VARCHAR) and
VARCHAR TO_HEX (page 225)(VARBINARY) usage.
Table t and and its projection are created with binary columns:
CREATE TABLE t (c BINARY(1));
CREATE PROJECTION t_p (c) AS SELECT c FROM t;
Insert minimum byte and maximum byte values:
INSERT INTO t values(HEX_TO_BINARY('0x00'));
INSERT INTO t values(HEX_TO_BINARY('0xFF'));
Binary values can then be formatted in hex on output using the TO_HEX function:
SELECT TO_HEX(c) FROM t;
to_hex
--------
00
ff
(2 rows)
The BIT_AND, BIT_OR, and BIT_XOR functions are interesting when operating on a group of
values. For example, create a sample table and projections with binary columns:
CREATE TABLE t (c VARBINARY(2));
SELECT IMPLEMENT_TEMP_DESIGN('');
INSERT INTO t values(HEX_TO_BINARY('0xFF'));
Note that the aggregate functions treat '0xff' like '0xFF00':
INSERT INTO t values(HEX_TO_BINARY('0xFFFF'));
INSERT INTO t values(HEX_TO_BINARY('0xF00F'));
Query table t to see column c output:
SELECT TO_HEX(c) FROM t;
to_hex

-91-
SQL Reference Manual

--------
ff
ffff
f00f
(3 rows)

Now issue the bitwise AND operation. Because these are aggregate functions, an implicit
GROUP BY operation is performed on results using (ff00&(ffff)&f00f):
SELECT TO_HEX(BIT_AND(c)) FROM t;
to_hex
--------
f000
(1 row)
Issue the bitwise OR operation on (ff00|(ffff)|f00f):
SELECT TO_HEX(BIT_OR(c)) FROM t;
to_hex
--------
ffff
(1 row)
Issue the bitwise XOR operation on (ff00#(ffff)#f00f):
SELECT TO_HEX(BIT_XOR(c)) FROM t;
to_hex
--------
f0f0
(1 row)

See Also
Aggregate functions BIT_AND (page 113), BIT_OR (page 114), BIT_XOR (page 115), MAX
(page 120), and MIN (page 120)
Binary Operators (page 62)
COPY (page 323) for examples loading binary data
Data Type Coercion Operators (CAST) (page 66)
IP conversion function INET_ATON (page 205), INET_NTOA (page 206), V6_ATON (page 228),
V6_NTOA (page 229), V6_SUBNETA (page 230), V6_SUBNETN (page 230), V6_TYPE (page
231)
Loading Binary Data in the Administrator's Guide
String functions BITCOUNT (page 198), BITSTRING_TO_BINARY (page 199),
HEX_TO_BINARY (page 204), LENGTH (page 211), REPEAT (page 217), SUBSTRING (page
223), TO_HEX (page 225), and TO_BITSTRING (page 224)

-92-
SQL Data Types

Boolean Data Type


Vertica provides the standard SQL type BOOLEAN, which has two states: true and false. The
third state in SQL boolean logic is unknown, which is represented by the NULL value.

Syntax
BOOLEAN

Parameters
Valid literal data values for input are:

TRUE 't' 'true' 'y' 'yes' '1'

FALSE 'f' 'false' 'n' 'no' '0'

Notes
• Do not confuse the BOOLEAN data type with Boolean Operators (on page 65) or the
Boolean-predicate (on page 80).
• The keywords TRUE and FALSE are preferred and are SQL-compliant.
• All other values must be enclosed in single quotes.
• Boolean values are output using the letters t and f.

-93-
SQL Reference Manual

Character Data Types


Stores strings of letters, numbers and symbols. Character data can be stored as fixed-length or
variable-length strings; the difference is that fixed-length strings are right-padded with spaces,
and variable-length strings are not padded.

Syntax
[ CHARACTER | CHAR ] ( length )
[ VARCHAR | CHARACTER VARYING ] ( length )

Parameters
length Specifies the length of the string.

Usage
CHAR is a fixed-length, blank padded string. The default length is 1 and the maximum length is
65000 bytes. CHAR columns are right-extended with zeroes to the full width of the column, as
needed. In CHAR fields, NULL characters are handled as ordinary characters.
VARCHAR is a variable-length character data type, with no blank padding added to the storage
of the strings. The VARCHAR(length) field is processed internally as a NULL-padded string of
maximum length n. The default length is 80 and the maximum length is 65000 bytes. Values
terminate at their first NULL byte, if any.

Notes
• When you define character columns, you specify the maximum size of any string to be stored
in the column. For example, if you want to store strings up to 24 characters in length, you
could use either of the following definitions:
CHAR(24) /* fixed-length */
VARCHAR(24) /* variable-length */
• If the data being loaded into a text column exceeds the maximum size for that type, data is
truncated to the specified number of characters.
• String literals in SQL statements must be enclosed in single quotes.
• Remember to include the extra bytes required for multibyte characters in the column-width
declaration.
• Due to compression in Vertica, the cost of over-estimating the length of these fields is
incurred primarily at load time and during sorts.
• NULL appears last (largest) in ascending order. See also GROUP BY Clause (page 388) for
additional information about null ordering.

ASCII NULs
VARCHAR data types accept ASCII NULs.
The following example casts the input string containing NUL values to VARCHAR:
SELECT 'vert\0ica'::CHARACTER VARYING;
varchar

-94-
SQL Data Types

---------
vert
(1 row)
The following example casts the input string containing NUL values to VARBINARY:
SELECT 'vert\0ica'::BINARY VARYING;
varbinary
-------------
vert\000ica
(1 row)
In both cases, the result contains 8 characters, but in the VARCHAR case, the '\000' is not
visible:
SELECT LENGTH('vert\0ica'::CHARACTER VARYING);
length
--------
8
(1 row)
SELECT LENGTH('vert\0ica'::BINARY VARYING);
length
--------
8
(1 row)

-95-
96

Date/Time Data Types


Vertica supports the full set of SQL date and time types. In most cases, a combination of DATE,
TIME, TIMESTAMP WITHOUT TIME ZONE, and TIMESTAMP WITH TIME ZONE should provide a
complete range of date/time functionality required by any application. However, in compliance
with the SQL standard, Vertica also supports the TIME WITH TIME ZONE data type.

Notes
• Vertica uses Julian dates for all date/time calculations. They can correctly predict and
calculate any date more recent than 4713 BC to far into the future, based on the assumption
that the length of the year is 365.2425 days.
• In Vertica, TIME ZONE is a synonym for TIMEZONE.
• All date/time types are stored in eight bits.
• A date/time value of NULL appears first (smallest) in ascending order.
• All the date/time data types accept the special literal value NOW to specify the current date
and time. For example:
SELECT TIMESTAMP 'NOW';
• In Vertica, intervals (page 102) are represented internally as some number of microseconds
and printed as up to 60 seconds, 60 minutes, 24 hours, 30 days, 12 months, and as many
years as necessary. Fields are either positive or negative.

DATE
Consists of a month, day, and year.

Syntax
DATE

Parameters

Low Value High Value Resolution


4713 BC 32767 AD 1 DAY

See SET DATESTYLE (page 396) for information about ordering.

Example Description
January 8, 1999 Unambiguous in any datestyle input mode

1999-01-08 ISO 8601; January 8 in any mode (recommended format)

1/8/1999 January 8 in MDY mode; August 1 in DMY mode

1/18/1999 January 18 in MDY mode; rejected in other modes

-96-
SQL Data Types

01/02/03 January 2, 2003 in MDY mode


February 1, 2003 in DMY mode
February 3, 2001 in YMD mode

1999-Jan-08 January 8 in any mode

Jan-08-1999 January 8 in any mode

08-Jan-1999 January 8 in any mode

99-Jan-08 January 8 in YMD mode, else error

08-Jan-99 January 8, except error in YMD mode

Jan-08-99 January 8, except error in YMD mode

19990108 ISO 8601; January 8, 1999 in any mode

990108 ISO 8601; January 8, 1999 in any mode

1999.008 Year and day of year

J2451187 Julian day

January 8, 99 Year 99 before the Common Era


BC

TIME
Consists of a time of day with or without a time zone.

Syntax
TIME [ (p) ] [ { WITH | WITHOUT } TIME ZONE ] | TIMETZ
[ AT TIME ZONE (see "TIME AT TIME ZONE" on page 99) ]

Parameters
p (Precision) specifies the number of fractional digits retained in the
seconds field. By default, there is no explicit bound on precision.
The allowed range 0 to 6.
WITH TIME ZONE Specifies that valid values must include a time zone

-97-
SQL Reference Manual

WITHOUT TIME ZONE Specifies that valid values do not include a time zone (default). If a
time zone is specified in the input it is silently ignored.
TIMETZ Is the same as TIME WITH TIME ZONE with no precision

Limits

Data Type Low Value High Value Resolution


TIME [p]
00:00:00.00 23:59:59.99 1 MS / 14 digits

TIME [p] WITH TIME ZONE


00:00:00.00+12 23:59:59.99-1 1 ms / 14 digits
2

Example Description
04:05:06.789 ISO 8601

04:05:06 ISO 8601

04:05 ISO 8601

040506 ISO 8601

04:05 AM Same as 04:05; AM does not affect


value

04:05 PM Same as 16:05; input hour must be <=


12

04:05:06.789-8 ISO 8601

04:05:06-08:00 ISO 8601

04:05-08:00 ISO 8601

040506-08 ISO 8601

04:05:06 PST Time zone specified by name

-98-
SQL Data Types

TIME AT TIME ZONE


The TIME AT TIME ZONE construct converts TIMESTAMP and TIMESTAMP WITH ZONE types
to different time zones.
TIME ZONE is a synonym for TIMEZONE. Both are allowed in Vertica syntax.

Syntax
timestamp AT TIME ZONE zone

Parameters
timestamp TIMESTAMP Converts UTC to local time in given time
zone
TIMESTAMP WITH TIME ZONE Converts local time in given time zone to
UTC
TIME WITH TIME ZONE Converts local time across time zones
zone Is the desired time zone specified either as a text string (for example: 'PST') or as
an interval (for example: INTERVAL '-08:00'). In the text case, the available zone
names are abbreviations.

Examples
The local time zone is PST8PDT. The first example takes a zone-less timestamp and interprets it
as MST time (UTC- 7) to produce a UTC timestamp, which is then rotated to PST (UTC-8) for
display:
SELECT TIMESTAMP '2001-02-16 20:38:40' AT TIME ZONE 'MST';
timezone
------------------------
2001-02-16 22:38:40-05
(1 row)
The second example takes a timestamp specified in EST (UTC-5) and converts it to local time in
MST (UTC-7):
SELECT TIMESTAMP WITH TIME ZONE '2001-02-16 20:38:40-05' AT TIME ZONE 'MST';
timezone
---------------------
2001-02-16 18:38:40
(1 row)

TIMESTAMP
Consists of a date and a time with or without a time zone and with or without a historical epoch
(AD or BC).

Syntax
TIMESTAMP [ (p) ] [ { WITH | WITHOUT } TIME ZONE ] | TIMESTAMPTZ
[ AT TIME ZONE (see "TIME AT TIME ZONE" on page 99) ]

-99-
SQL Reference Manual

Parameters
p (Precision) specifies the number of fractional digits retained in the
seconds field. By default, there is no explicit bound on precision. The
allowed range 0 to 6.
WITH TIME ZONE Specifies that valid values must include a time zone. All TIMESTAMP
WITH TIME ZONE values are stored internally in UTC.
They are converted to local time in the zone specified by the time
zone configuration parameter before being displayed to the client.
WITHOUT TIME ZONE Specifies that valid values do not include a time zone (default). If a
time zone is specified in the input it is silently ignored.
TIMESTAMPTZ Is the same as TIMESTAMP WITH TIME ZONE.

Limits

Data Type Low Value High Value Resolution


(rounded) (rounded)
TIMESTAMP [ (p) ] [ WITHOUT TIME ZONE ]
290279 BC 294277 AD 1 US / 14 digits

TIMESTAMP [ (p) ] WITH TIME ZONE


290279 BC 294277 AD 1 US / 14 digits

Notes
• AD/BC can appear before the time zone, but this is not the preferred ordering.
• The SQL standard differentiates TIMESTAMP WITHOUT TIME ZONE and TIMESTAMP WITH
TIME ZONE literals by the existence of a "+"; or "-". Hence, according to the standard:
TIMESTAMP '2004-10-19 10:23:54' is a TIMESTAMP WITHOUT TIME ZONE.
TIMESTAMP '2004-10-19 10:23:54+02' is a TIMESTAMP WITH TIME ZONE.
Note: Vertica differs from the standard by requiring that TIMESTAMP WITH TIME ZONE
literals be explicitly typed:
TIMESTAMP WITH TIME ZONE '2004-10-19 10:23:54+02'
• If a literal is not explicitly indicated as being of TIMESTAMP WITH TIME ZONE, Vertica silently
ignores any time zone indication in the literal. That is, the resulting date/time value is derived
from the date/time fields in the input value, and is not adjusted for time zone.
• For TIMESTAMP WITH TIME ZONE, the internally stored value is always in UTC. An input
value that has an explicit time zone specified is converted to UTC using the appropriate
offset for that time zone. If no time zone is stated in the input string, then it is assumed to be
in the time zone indicated by the system's TIME ZONE parameter, and is converted to UTC
using the offset for the TIME ZONE zone.

-100-
SQL Data Types

• When a TIMESTAMP WITH TIME ZONE value is output, it is always converted from UTC to
the current TIME ZONE zone and displayed as local time in that zone. To see the time in
another time zone, either change TIME ZONE or use the AT TIME ZONE (page 99) construct.
• Conversions between TIMESTAMP WITHOUT TIME ZONE and TIMESTAMP WITH TIME ZONE
normally assume that the TIMESTAMP WITHOUT TIME ZONE value should be taken or given
as TIME ZONE local time. A different zone reference can be specified for the conversion
using AT TIME ZONE.
• TIMESTAMPTZ and TIMETZ are not parallel SQL constructs. TIMESTAMPTZ records a time
and date in GMT, converting from the specified TIME ZONE. TIMETZ records the specified
time and the specified time zone, in minutes, from GMT.timezone
• The following list represents typical date/time input variations:
§ 1999-01-08 04:05:06
§ 1999-01-08 04:05:06 -8:00
§ January 8 04:05:06 1999 PST
• Vertica supports adding a floating-point (in days) to a TIMESTAMP or TIMESTAMPTZ value.
• In Vertica, intervals (page 102) are represented internally as some number of microseconds
and printed as up to 60 seconds, 60 minutes, 24 hours, 30 days, 12 months, and as many
years as necessary. Fields are either positive or negative.

Examples
In the following example, Vertica returns results in years, months, and days, whereas other
RDBMS might return results in days only:
SELECT TIMESTAMP WITH TIME ZONE '02/02/294276'- TIMESTAMP WITHOUT TIME ZONE
'02/20/2009' AS result;
result
------------------------------
292266 years 11 mons 12 days
(1 row)
To specify a specific time zone, add it to the statement, such as the use of 'ACST' in the
following example:
SELECT T1 AT TIME ZONE 'ACST', t2 FROM test;
timezone | t2
---------------------+-------------
2009-01-01 04:00:00 | 02:00:00-07
2009-01-01 01:00:00 | 02:00:00-04
2009-01-01 04:00:00 | 02:00:00-06
You can specify a floating point in days:
SELECT 'NOW'::TIMESTAMPTZ + INTERVAL '1.5 day' AS '1.5 days from now';
1.5 days from now
-------------------------------
2009-03-18 21:35:23.633-04
(1 row)
You can return infinity by simply specifying 'infinity':
SELECT TIMESTAMP 'infinity';

-101-
SQL Reference Manual

timestamp
-----------
infinity
(1 row)
The following example illustrates the difference between TIMESTAMPTZ with and without a
precision specified:
SELECT TIMESTAMPTZ(3) 'now', TIMESTAMPTZ 'now';
timestamptz | timestamptz
----------------------------+-------------------------------
2009-02-24 11:40:26.177-05 | 2009-02-24 11:40:26.177368-05
(1 row)
The following statement returns an error because the TIMESTAMP is out of range:
SELECT TIMESTAMP '294277-01-09 04:00:54.775808';
ERROR: date/time field value out of range: "294277-01-09 04:00:54.775808"

INTERVAL
Measures the difference between two points in time. It is represented internally as a number of
microseconds and printed out as up to:
• 60 seconds
• 60 minutes
• 24 hours
• 30 days
• 12 months
• As many years as necessary
All the fields are either positive or negative.

Syntax
INTERVAL [ (p) ]

Parameters
p (Precision) specifies the number of fractional digits retained in the seconds field in the
range 0 to 6. The default is the precision of the input literal.

Notes
Intervals can be expressed as a combination of fields, some positive and some negative. Vertica
adds these all together to get some number of microseconds, counting days as 24 hours and
months as 30 days.

Examples
SELECT INTERVAL '-1 +02:03' AS "22 hours ago...";
22 hours ago...
-----------------
-21:57:00
(1 row)

-102-
SQL Data Types

SELECT INTERVAL '-1 days +02:03' AS "22 hours ago...";


22 hours ago...
-----------------
-21:57:00
(1 row)
SELECT INTERVAL '10 years -11 month -12 days +13:14' AS "9 years...";
9 years...
--------------------------
9 years 18 days 13:14:00
(1 row)
SELECT'4 millenniums 5 centuries 4 decades 1 year 4 months 4 days 17 minutes 31 seconds'::i
interval
-----------------------------------
4541 years 4 mons 4 days 00:17:31
(1 row)

See Also
Interval Values (page 60) for a description of the values that can be represented in an
INTERVAL type.

Numeric Data Types


Numeric data types are numbers (such as integers) stored in columns in the Vertica® Analytic
Database.
In Vertica, overflows in floats generate +/-infinity, and 0.0/0.0 returns NaN, per the IEEE floating
point standard:
SELECT 0.0/0;
?column?
----------
NaN
(1 row)
For integers, dividing by zero by zero returns zero:
SELECT 0/0;
?column?
----------
0
(1 row)
Dividing anything else by zero returns a runtime error.
SELECT 1/0;
ERROR: division by zero
Add, subtract, and multiply ignore overflow. Sum and average use 128-bit arithmetic internally.
Sum reports an error if the final result overflows, suggesting the use of sum_float(int), which
converts the 128-bit sum to a float8. For example:
CREATE TEMP TABLE t (i INT);
INSERT INTO t VALUES (1<<62);
INSERT INTO t VALUES (1<<62);
INSERT INTO t VALUES (1<<62);

-103-
SQL Reference Manual

INSERT INTO t VALUES (1<<62);


INSERT INTO t VALUES (1<<62);
SELECT SUM(i) FROM t;
ERROR: sum() overflower
HINT: try sum_float() instead
SELECT SUM_FLOAT(i) FROM t;
sum_float
---------------------
2.30584300921369e+19

-104-
SQL Data Types

DOUBLE PRECISION (FLOAT)


Vertica supports the numeric data type DOUBLE PRECISION, which is the IEEE-754 8-byte
floating point type, along with most of the usual floating point operations.

Syntax
[ DOUBLE PRECISION | FLOAT | FLOAT8 ]

Parameters
Note: On a machine whose floating-point arithmetic does not follow IEEE 754, these values
probably do not work as expected.
Double precision is an inexact, variable-precision numeric type. In other words, some values
cannot be represented exactly and are stored as approximations. Thus, input and output
operations involving double precision might show slight discrepancies.
• For exact numeric storage and calculations (money for example), use INTEGER.
• Floating point calculations depend on the behavior of the underlying processor, operating
system, and compiler.
• Comparing two floating-point values for equality might not work as expected.

Values
COPY (page 323) accepts floating-point data in the following format:
1 Optional leading white space
2 An optional plus ("+") or minus sign ("-")
3 A decimal number, a hexadecimal number, an infinity, a NAN, or a null value
A decimal number consists of a non-empty sequence of decimal digits possibly containing a
radix character (decimal point "."), optionally followed by a decimal exponent. A decimal
exponent consists of an "E" or "e", followed by an optional plus or minus sign, followed by a
non-empty sequence of decimal digits, and indicates multiplication by a power of 10.
A hexadecimal number consists of a "0x" or "0X" followed by a non-empty sequence of
hexadecimal digits possibly containing a radix character, optionally followed by a binary
exponent. A binary exponent consists of a "P" or "p", followed by an optional plus or minus sign,
followed by a non-empty sequence of decimal digits, and indicates multiplication by a power of 2.
At least one of radix character and binary exponent must be present.
An infinity is either "INF" or "INFINITY", disregarding case.
A NaN (Not A Number) is "NAN" (disregarding case) optionally followed by a sequence of
characters enclosed in parentheses. The character string specifies the value of NAN in an
implementation-dependent manner. (The Vertica internal representation of NAN is
0xfff8000000000000LL on x86 machines.)
When writing infinity or NAN values as constants in a SQL statement, enclose them in single
quotes. For example:
UPDATE table SET x = 'Infinity'

-105-
SQL Reference Manual

Note: Vertica follows the IEEE definition of NaNs (IEEE 754). The SQL standards do not
specify how floating point works in detail.
IEEE defines NaNs as a set of floating point values where each one is not equal to anything,
even to itself. A NaN is not greater than and at the same time not less than anything, even
itself. In other words, comparisons always return false whenever a NaN is involved.
However, for the purpose of sorting data, NaN values must be placed somewhere in the result.
The value generated 'NaN' appears in the context of a floating point number matches the NaN
value generated by the hardware. For example, Intel hardware generates
(0xfff8000000000000LL), which is technically a Negative, Quiet, Non-signaling NaN.
Vertica uses a different NaN value to represent floating point NULL (0x7ffffffffffffffeLL). This is a
Positive, Quiet, Non-signaling NaN and is reserved by Vertica

The load file format of a null value is user defined, as described in the COPY (page 323)
command. The Vertica internal representation of a null value is 0x7fffffffffffffffLL. The interactive
format is controlled by the vsql printing option null. For example:
\pset null '(null)'
The default option is not to print anything.

Rules
• -0 == +0
• 1/0 = Infinity
• 0/0 == Nan
• NaN != anything (even NaN)
To search for NaN column values, use the following predicate:
... WHERE column != column
This is necessary because WHERE column = 'Nan' cannot be true by definition.

Sort Order (Ascending)


• NaN
• -Inf
• numbers
• +Inf
• Null

Notes
• Vertica does not support REAL (FLOAT4) or NUMERIC.
• NULL appears last (largest) in ascending order.
• All overflows in floats generate +/-infinity or NaN, per the IEEE floating point standard.

-106-
SQL Data Types

INTEGER
A signed 8-byte (64-bit) data type.

Syntax
[ INTEGER | INT | BIGINT | INT8 ]

Parameters
INT, INTEGER, INT8, and BIGINT are all synonyms for the same signed 64-bit integer data
type. Integer data types of other lengths are not supported at this time. Automatic compression
techniques are used to conserve disk space in cases where the full 64 bits are not required.

Notes
• The range of values is -2^63+1 to 2^63-1.
• 2^63 = 9,223,372,036,854,775,808 (19 digits).
• The value -2^63 is reserved to represent NULL.
• NULL appears first (smallest) in ascending order.
• Vertica does not have an explicit 4-byte (32-bit integer) type. Vertica's encoding and
compression automatically eliminate extra spaces.

Restrictions
• The JDBC type INTEGER is 4 bytes and is not supported by Vertica. Use BIGINT instead.
• Vertica does not support the SQL/JDBC types NUMERIC, SMALLINT, or TINYINT.
• Vertica does not check for overflow (positive or negative) except in the aggregate function
SUM (page 122)(). If you encounter overflow when using SUM, use SUM_FLOAT (page
123)() which converts to floating point.

NUMERIC
Numeric data types store numeric data. For example, a money value of $123.45 could be stored
in a NUMERIC(5,2) field.

Syntax
NUMERIC | DECIMAL | NUMBER | MONEY [ ( precision [ , scale ] ) ]

Parameters
precision The number of significant decimal digits, or the number of
digits that the data type stores. Precision p must be positive
and <= 1024.
scale Expressed in decimal digits and can be any integer
representable in a 16-bit field. The default scale s is 0 <=scale
<= precision; omitting scale is the same as s=0.

-107-
SQL Reference Manual

Notes
• NUMERIC, DECIMAL, NUMBER, and MONEY are all synonyms that return NUMERIC types.
Note, however, that the default values for NUMBER and MONEY are implemented a bit
differently:
type | precision | scale
---------+-----------+-------
NUMERIC 37 15
DECIMAL 37 15
NUMBER 38 0
MONEY 18 4
• Numeric data types support exact representations of numbers that can be expressed with a
number of digits before and after a decimal point. This contrasts slightly with existing Vertica
data types:
DOUBLE PRECISION (page 105) (FLOAT) types support ~15 digits, variable exponent, and
represent numeric values approximately.
INTEGER (page 107) (and similar types) support ~18 digits, whole numbers only.
• Numeric data types are generally called exact numeric data types because they store
numbers of a specified precision and scale. By contrast, approximate numeric data types
(DOUBLE PRECISION) use floating points and are less precise.
• Supported operations include the following:
§ Basic math (+, -, *)
§ Aggregation (SUM, MIN, MAX, COUNT)
§ Comparison operators (<, <=, =, <=>, <>, >, >=)
• LZO, RLE, and BLOCK_DICT are supported encoding types. Anything that can be used on
an INTEGER can also be used on a NUMERIC, as long as the precision is <= 18.
• NUMERIC is now preferred for non-INT constants, which should improve precision. For
example:
SELECT 1.1 + 2.2 = 3.3;
?column?
----------
t
(1 row)
SELECT 1.1::float + 2.2::float = 3.3::float;
?column?
----------
f
(1 row)

Restrictions and Cautions


• If you use a NUMERIC data type with a precision greater than 18, performance could be
affected.

-108-
SQL Data Types

• Some of the more complex operations used with NUMERIC data types result in an implicit
cast to FLOAT. For example, when doing division operations, including SQRT, LOG, AVG
and other aggregates, transcendental functions, and TO_CHAR/TO_NUMBER formatting,
the result is always FLOAT. Anything beyond SUM/COUNT produces a FLOAT result.

Input Values
COPY (page 323) accepts DECIMAL number with a decimal point ('.'), prefixed by - or
+(optional).

Examples
The following command creates a table with two columns, one with INTEGER data type and the
other with a NUMERIC data type:
CREATE TABLE num1 (id INTEGER, amount NUMERIC(8,2));
CREATE TABLE
Issue the following command, which creates a temporary physical schema design:
SELECT IMPLEMENT_TEMP_DESIGN('');
IMPLEMENT_TEMP_DESIGN
-----------------------
4
(1 row)
Now insert some values into your table:
INSERT INTO num1 VALUES (1, 123456.78);
OUTPUT
--------
1
(1 row)
And do a simple SELECT on the table:
SELECT * FROM num1;
id | amount
------+-----------
1 | 123456.78
(1 row)
The following example returns the numeric column, amount, from table num1:
SELECT amount FROM num1;
amount
-----------
123456.78
(1 row)

The following syntax adds one (1) to the amount:


SELECT amount+1 AS 'amount' FROM num1;
amount
-----------
123457.78
(1 row)

-109-
SQL Reference Manual

The following syntax multiplies the amount column by 2:


SELECT amount*2 AS 'amount' FROM num1;
amount
-----------
246913.56
(1 row)
The following syntax returns a negative number for the amount column:
SELECT -amount FROM num1;
?column?
------------
-123456.78
(1 row)
The following syntax returns the absolute value of the amount argument:
SELECT ABS(amount) FROM num1;
ABS
-----------
123456.78
(1 row)
The following syntax casts the NUMERIC amount as a FLOAT:
SELECT amount::float FROM num1;
amount
-----------
123456.78
(1 row)

See Also
Mathematical Functions (page 176)

-110-
SQL Functions
Functions return information from the database and are allowed anywhere an expression is
allowed.
This chapter describes the functions that Vertica supports.

-111-
112

Aggregate Functions
Aggregate functions summarize data over groups of rows from a query result set. The groups
are specified using the GROUP BY (page 388) clause. They are allowed only in the select list
and in the HAVING (see "HAVING Clause" on page 390) and ORDER BY (see "ORDER BY
Clause" on page 391) clauses of a SELECT (page 382) statement (as described in Aggregate
Expressions (page 72)).

Notes
• Except for COUNT, these functions return a null value when no rows are selected. In
particular, SUM of no rows returns NULL, not zero.
• In some cases you can replace an expression that includes multiple aggregates with an
single aggregate of an expression. For example SUM(x) + SUM(y) can be expressed as as
SUM(x+y) (where x and y are NOT NULL).
• Vertica does not support nested aggregate functions.

AVG
Computes the average (arithmetic mean) of an expression over a group of rows. It returns a
DOUBLE PRECISION value for a floating-point expression. Otherwise, the return value is the
same as the expression data type.

Syntax
AVG ( [ ALL | DISTINCT ] expression )

Parameters
ALL Invokes the aggregate function for all rows in the group (default)
DISTINCT Invokes the aggregate function for all distinct non-null values of the expression
found in the group
expression (INTEGER, BIGINT, DOUBLE PRECISION, or INTERVAL) contains at least
one column reference (see "Column References" on page 74)

Examples
SELECT AVG(annual_income) FROM customer_dimension;
avg
--------------
2104270.6485
(1 row)
SELECT AVG(pos_transaction_number) FROM store.store_sales_fact;
avg
-----------
2500000.5
(1 row)

-112-
SQL Functions

BIT_AND
Performs a bitwise logical AND operation on two BINARY data type columns.

Syntax
BIT_AND( column1, column2 )

Parameters
column Are the two input BINARY data type columns.

Usage
• The function returns the value of the bitwise logical AND operation.
• If the columns are different lengths, the return values are treated as though they are all equal
in length and are right-extended with zero bytes. For example, given a group containing the
hex values 'ff', null, and 'f', the function ignores the null value and extends the value
'f' to 'f0'.
• BIT_AND operates on VARBINARY types explicitly and operates on BINARY types implicitly
through casts.
• If either parameter is NULL, the return value is also NULL.

Example
First create a sample table and projections with binary columns:
CREATE TABLE t (c VARBINARY(2));
SELECT IMPLEMENT_TEMP_DESIGN('');
INSERT INTO t values(HEX_TO_BINARY('0xFF'));
Note that the aggregate functions treat '0xff' like '0xFF00':
INSERT INTO t values(HEX_TO_BINARY('0xFFFF'));
INSERT INTO t values(HEX_TO_BINARY('0xF00F'));
Query table t to see column c output:
SELECT TO_HEX(c) FROM t;
to_hex
--------
ff
ffff
f00f
(3 rows)

Finally, issue a bitwise AND operation on the first two binary columns, 'ff00' and 'ffff':
SELECT TO_HEX(BIT_AND(c)) FROM t;
to_hex
--------

-113-
SQL Reference Manual

f000
(1 row)

BIT_OR
Performs a bitwise logical OR operation on two BINARY data type columns.

Syntax
BIT_OR( column1, column2 )

Parameters
column Are the two input BINARY data type columns.

Usage
• The function returns the value of the bitwise logical OR operation.
• If the columns are different lengths, the return values are treated as though they are all equal
in length and are right-extended with zero bytes. For example, given a group containing the
hex values 'ff', null, and 'f', the function ignores the null value and extends the value
'f' to 'f0'.
• BIT_OR operates on VARBINARY types explicitly and operates on BINARY types implicitly
through casts.
• If either parameter is NULL, the return value is also NULL.

Example
First create a sample table and projections with binary columns:
CREATE TABLE t (c VARBINARY(2));
SELECT IMPLEMENT_TEMP_DESIGN('');
INSERT INTO t values(HEX_TO_BINARY('0xFF'));
Note that the aggregate functions treat '0xff' like '0xFF00':
INSERT INTO t values(HEX_TO_BINARY('0xFFFF'));
INSERT INTO t values(HEX_TO_BINARY('0xF00F'));
Query table t to see column c output:
SELECT TO_HEX(c) FROM t;
to_hex
--------
ff
ffff
f00f
(3 rows)

Finally, issue a bitwise OR operation:


SELECT TO_HEX(BIT_OR(c)) FROM t;
to_hex

-114-
SQL Functions

--------
ffff
(1 row)

BIT_XOR
Performs a bitwise logical XOR operation on two BINARY data type columns.

Syntax
BIT_XOR( column1, column2 )

Parameters
column Are the two input BINARY data type columns.

Usage
• The function returns the value of the bitwise logical XOR operation.
• If the columns are different lengths, the return values are treated as though they are all equal
in length and are right-extended with zero bytes. For example, given a group containing the
hex values 'ff', null, and 'f', the function ignores the null value and extends the value
'f' to 'f0'.
• BIT_XOR operates on VARBINARY types explicitly and operates on BINARY types implicitly
through casts.
• If either parameter is NULL, the return value is also NULL.

Example
First create a sample table and projections with binary columns:
CREATE TABLE t (c VARBINARY(2));
SELECT IMPLEMENT_TEMP_DESIGN('');
INSERT INTO t values(HEX_TO_BINARY('0xFF'));
Note that the aggregate functions treat '0xff' like '0xFF00':
INSERT INTO t values(HEX_TO_BINARY('0xFFFF'));
INSERT INTO t values(HEX_TO_BINARY('0xF00F'));
Query table t to see column c output:
SELECT TO_HEX(c) FROM t;
to_hex
--------
ff
ffff
f00f
(3 rows)

Finally, issue a bitwise XOR operation:


SELECT TO_HEX(BIT_XOR(c)) FROM t;
to_hex

-115-
SQL Reference Manual

--------
f0f0
(1 row)

COUNT
Returns the number of rows in each group of the result set for which the expression is not null.
The return value is a BIGINT.

Syntax
COUNT ( [ ALL | DISTINCT ] expression )

Parameters
ALL Invokes the aggregate function for all rows in the group (default)
DISTINCT Invokes the aggregate function for all distinct non-null values of the expression
found in the group
expression (Any data type) contains at least one column reference (see "Column
References" on page 74)

Examples
The following query returns the number of distinct values in the primary_key column of the
date_dimension table:
SELECT COUNT (DISTINCT date_key)
FROM date_dimension;
count
-------
1826
(1 row)
The next example returns all distinct values of evaluating the expression x+y for all records of
fact.
SELECT COUNT (DISTINCT date_key + product_key)
FROM inventory_fact;
count
-------
21560
(1 row)
An equivalent query is as follows (we use the LIMIT key to cut back on the number of rows
returned):
SELECT COUNT(date_key + product_key)
FROM inventory_fact
GROUP BY date_key
LIMIT 10;
count
-------
173
31

-116-
SQL Functions

321
113
286
84
244
238
145
202
(10 rows)
Each distinct product_key value in table inventory_fact and returns the number of distinct values
of date_key in all records with the specific distinct product_key value.
SELECT product_key, COUNT (DISTINCT date_key)
FROM inventory_fact
GROUP BY product_key
LIMIT 10;
product_key | count
-------------+-------
1 | 12
2 | 18
3 | 13
4 | 17
5 | 11
6 | 14
7 | 13
8 | 17
9 | 15
10 | 12
(10 rows)
This query counts each distinct product_key value in table inventory_fact with the constant "1".
Note: The DISTINCT keyword is redundant if all members of the SELECT list are present in
the GROUP BY list as well.
SELECT product_key, COUNT (DISTINCT product_key)
FROM inventory_fact
GROUP BY product_key
LIMIT 10;
product_key | count
-------------+-------
1 | 1
2 | 1
3 | 1
4 | 1
5 | 1
6 | 1
7 | 1
8 | 1
9 | 1
10 | 1
(10 rows)

-117-
SQL Reference Manual

This query selects each distinct date_key value and counts the number of distinct product_key
values for all records with the specific product_key value. It then sums of all the qty_in_stock
values in all records with the specific product_key value and groups the results by date_key.
SELECT date_key, COUNT (DISTINCT product_key),
SUM(qty_in_stock)
FROM inventory_fact
GROUP BY date_key
LIMIT 10;
date_key | count | sum
----------+-------+--------
1 | 173 | 88953
2 | 31 | 16315
3 | 318 | 156003
4 | 113 | 53341
5 | 285 | 148380
6 | 84 | 42421
7 | 241 | 119315
8 | 238 | 122380
9 | 142 | 70151
10 | 202 | 95274
(10 rows)
This query selects each distinct product_key value and then counts the number of distinct
date_key values for all records with the specific product_key value and counts the number of
distinct warehouse_key values in all records with the specific product_key value.
SELECT product_key,
COUNT (DISTINCT date_key),
COUNT (DISTINCT warehouse_key)
FROM inventory_fact
GROUP BY product_key
LIMIT 15;
product_key | count | count
-------------+-------+-------
1 | 12 | 12
2 | 18 | 18
3 | 13 | 12
4 | 17 | 18
5 | 11 | 9
6 | 14 | 13
7 | 13 | 13
8 | 17 | 15
9 | 15 | 14
10 | 12 | 12
11 | 11 | 11
12 | 13 | 12
13 | 9 | 7
14 | 13 | 13
15 | 18 | 17
(15 rows)
This query selects each distinct product_key value, counts the number of distinct date_key and
warehouse_key values for all records with the specific product_key value, and then sums all qty_in_stock

-118-
SQL Functions

values in records with the specific product_key value. It then returns the number of product_version values
in records with the specific product_key value.
SELECT product_key,
COUNT (DISTINCT date_key),
COUNT (DISTINCT warehouse_key),
SUM (qty_in_stock),
COUNT (product_version)
FROM inventory_fact
GROUP BY product_key
LIMIT 15;
product_key | count | count | sum | count
-------------+-------+-------+-------+-------
1 | 12 | 12 | 5530 | 12
2 | 18 | 18 | 9605 | 18
3 | 13 | 12 | 8404 | 13
4 | 17 | 18 | 10006 | 18
5 | 11 | 9 | 4794 | 11
6 | 14 | 13 | 7359 | 14
7 | 13 | 13 | 7828 | 13
8 | 17 | 15 | 9074 | 17
9 | 15 | 14 | 7032 | 15
10 | 12 | 12 | 5359 | 12
11 | 11 | 11 | 6049 | 11
12 | 13 | 12 | 6075 | 13
13 | 9 | 7 | 3470 | 9
14 | 13 | 13 | 5125 | 13
15 | 18 | 17 | 9277 | 18
(15 rows)

COUNT(*)
Returns the number of rows in each group of the result set. The return value is a BIGINT.

Syntax
COUNT(*)

Parameters
* Indicates that the count does not apply to any specific column or expression in the select list

Notes
COUNT(*) requires a FROM Clause (page 384).

Examples
The following example returns the number of warehouses from the warehouse dimension table:
SELECT COUNT(warehouse_name) FROM warehouse_dimension;
count
-------
100
(1 row)
The next example returns the total number of vendors:

-119-
SQL Reference Manual

SELECT COUNT(*) FROM vendor_dimension;


count
-------
50
(1 row)

MAX
Returns the greatest value of an expression over a group of rows. The return value is the same
as the expression data type.

Syntax
MAX ( [ ALL | DISTINCT ] expression )

Parameters
ALL | DISTINCT Are meaningless in this context
expression (Any numeric, string, binary, or date/time type) contains at least one
column reference (see "Column References" on page 74)

Example
This example returns the largest value (dollar amount) of the sales_dollar_amount column.
SELECT MAX(sales_dollar_amount) AS Highest_Sale
FROM store.store_sales_fact;
highest_sale
--------------
600
(1 row)

MIN
Returns the smallest value of an expression over a group of rows. The return value is the same
as the expression data type.

Syntax
MIN ( [ ALL | DISTINCT ] expression )

Parameters
ALL | DISTINCT Are meaningless in this context
expression (Any numeric, string, binary, or date/time type) contains at least one
column reference (see "Column References" on page 74)

Example
This example returns the lowest salary.

-120-
SQL Functions

SELECT MIN(annual_salary) AS Lowest_Paid


FROM employee_dimension;
lowest_paid
-------------
1200
(1 row)

STDDEV
The non-standard function STDDEV is provided for compatibility with other databases. It is
semantically identical to STDDEV_SAMP.
Evaluates the statistical sample standard deviation for each member of the group.
STDDEV_SAMP(expression) = SQRT(VAR_SAMP(expression))

Syntax
STDDEV_SAMP( expression )

Parameters
expression (INTEGER, BIGINT, DOUBLE PRECISION, or INTERVAL) contains at least
one column reference (see "Column References" on page 74)

Examples

SELECT STDDEV_SAMP(household_id)
FROM customer_dimension;
stddev_samp
------------------
8651.50842400771
(1 row)

See Also
STDDEV_SAMP (page 122)

STDDEV_POP
Evaluates the statistical population standard deviation for each member of the group.
STDDEV_POP(expression) = SQRT(VAR_POP(expression))

Syntax
STDDEV_POP ( expression )

Parameters
expression (INTEGER, BIGINT, DOUBLE PRECISION, or INTERVAL) contains at least
one column reference (see "Column References" on page 74)

-121-
SQL Reference Manual

Examples

SELECT STDDEV_POP(household_id)
FROM customer_dimension;
stddev_samp
------------------
8651.41895973367
(1 row)

STDDEV_SAMP
Evaluates the statistical sample standard deviation for each member of the group.
STDDEV_SAMP(expression) = SQRT(VAR_SAMP(expression))

Syntax
STDDEV_SAMP( expression )

Parameters
expression (INTEGER, BIGINT, DOUBLE PRECISION, or INTERVAL) contains at least
one column reference (see "Column References" on page 74)

Examples

SELECT STDDEV_SAMP(household_id)
FROM customer_dimension;
stddev_samp
------------------
8651.50842400771
(1 row)

SUM
Computes the sum of an expression over a group of rows. It returns a DOUBLE PRECISION
value for a floating-point expression. Otherwise, the return value is the same as the expression
data type.

Syntax
SUM ( [ ALL | DISTINCT ] expression )

Parameters
ALL Invokes the aggregate function for all rows in the group (default)
DISTINCT Invokes the aggregate function for all distinct non-null values of the expression
found in the group
expression (INTEGER, BIGINT, DOUBLE PRECISION, or INTERVAL) contains at least
one column reference (see "Column References" on page 74)

-122-
SQL Functions

Notes
If you encounter overflow when using SUM, use SUM_FLOAT() (page 123) which converts to
floating point.

Example
This example returns the total sum of the product_cost column.
SELECT SUM(product_cost) AS cost
FROM product_dimension;
cost
---------
9042850
(1 row)

SUM_FLOAT
Computes the sum of an expression over a group of rows. It returns a DOUBLE PRECISION
value for the expression, regardless of the expression type.

Syntax
SUM_FLOAT ( [ ALL | DISTINCT ] expression )

Parameters
ALL Invokes the aggregate function for all rows in the group (default)
DISTINCT Invokes the aggregate function for all distinct non-null values of the expression
found in the group
expression (INTEGER, BIGINT, DOUBLE PRECISION, or INTERVAL) contains at least
one column reference (see "Column References" on page 74)

Example
SELECT SUM_FLOAT(average_competitor_price) AS cost
FROM product_dimension;
cost
----------
18181102
(1 row)

VAR_POP
Evaluates the statistical population variance for each member of the group. This is defined as the
sum of squares of the difference of expression from the mean of expression, divided by the
number of rows remaining.

Syntax
VAR_POP ( expression )

-123-
SQL Reference Manual

Parameters
expression (INTEGER, BIGINT, DOUBLE PRECISION, or INTERVAL) contains at least
one column reference (see "Column References" on page 74)

Examples

SELECT VAR_POP(household_id)
FROM customer_dimension;
var_pop
------------------
74847050.0168393
(1 row)

VAR_SAMP
Evaluates the sample variance for each row of the group. This is defined as the sum of squares
of the difference of expression from the mean of expression, divided by the number of rows
remaining minus 1 (one).

Syntax
VAR_SAMP ( expression )

Parameters
expression (INTEGER, BIGINT, DOUBLE PRECISION, or INTERVAL) contains at least
one column reference (see "Column References" on page 74)

Examples

SELECT VAR_SAMP(household_id)
FROM customer_dimension;
var_samp
------------------
74848598.0106764
(1 row)

VARIANCE
The non-standard function VARIANCE is provided for compatibility with other databases. It is
semantically identical to VAR_SAMP.
Evaluates the sample variance for each row of the group. This is defined as the sum of squares
of the difference of expression from the mean of expression, divided by the number of rows
remaining minus 1 (one).

Syntax
VAR_SAMP ( expression )

-124-
SQL Functions

Parameters
expression (INTEGER, BIGINT, DOUBLE PRECISION, or INTERVAL) contains at least
one column reference (see "Column References" on page 74)

Examples

SELECT VAR_SAMP(household_id)
FROM customer_dimension;
var_samp
------------------
74848598.0106764
(1 row)

See Also
VAR_SAMP (page 124)

Analytic Functions
The ANSI SQL 99 standard introduced a set of functionality to handle complex analysis and
reporting. Called Online Analytical Processing (OLAP), these functions provide special syntax to
show, for example, a moving average of retail volume over a discrete period of time. Using a
unique construct called a sliding window, analytic functions differ from aggregate functions
(page 112) in that they return multiple rows for each group.
This group of rows is called a window and is defined by the analytic clause OVER(). Each
record contains a sliding window that determines the range of input rows used to perform
calculations on the current row, and the size of the window is based on either logical intervals
(such as time) or on a physical number of records. Before OLAP, these functions required
elaborate subqueries and many self-joins, which were complex and slow.
Note: Vertica does not currently support window customization using the OVER() clause,
except for FIRST_VALUE/LAST_VALUE (page 128).
Analytic functions:
• Require an OVER() clause.
• Occur only in the SELECT and ORDER BY clauses.
• Are not allowed in the WHERE clause.
• Cannot be nested; that is, they cannot be used in place of an expression; however, analytic
functions can be used in a subquery or in the parent query.

Analytic Function Syntax


analytic_function( [ arguments ] )
OVER
[ PARTITION BY
{ expr [, expr ]...
| ( expr [, expr ]... )

-125-
SQL Reference Manual

}
]
[ ORDER BY
{ expr }
[ ASC | DESC ]
[ NULLS FIRST | NULLS LAST ]
[, { expr }
[ ASC | DESC ]
[ NULLS FIRST | NULLS LAST ]
]...
[ windowing_clause ]
]
Vertica supports the following analytic and reporting functions:
• FIRST_VALUE/LAST_VALUE (page 128)
• LEAD/LAG (page 131)
• RANK/DENSE_RANK (page 135)
• ROW_NUMBER (page 137)
• The OVER() clause with PARTITION BY and ORDER BY

Analytic Syntactic Construct


OVER() Is required for analytic functions and indicates that the
function operates on a query result set. The result set is the
rows that are returned after the FROM, WHERE, GROUP
BY, and HAVING clauses have been evaluated. OVER() can
contain an optional partition clause, an optional ordering
clause and — in the case of some analytic functions — a
windowing clause.
Note: Vertica provides limited windowing support for
FIRST_VALUE/LAST_VALUE (page 128).
partition_clause Divides the rows in the input relation by a given list of
columns (or expressions). If the partition_clause is omitted,
all input rows are treated as a single partition.
expr Is the expression to evaluate; for example, a constant,
column, nonanalytic function, function expression, or
expressions involving any of these.
order_clause Sorts the rows in the partition and generates an ordered set
of rows that is then used as input to the windowing clause (if
present), to the analytical function, or to both.
The analytic order_clause specifies whether data is returned
in ascending or descending order and specifies where null
values appear in the sorted result as either first or last.
ASC | DESC Specifies the ordering sequence as ascending (default) or
descending.

-126-
SQL Functions

NULLS FIRST | LAST Indicates the position of nulls in the ordered sequence as
either first or last. The order makes nulls compare either high
or low with respect to non-null values.
If the sequence is specified as ascending order, ASC NULLS
FIRST implies that nulls are smaller than other non-null
values. ASC NULLS LAST implies that nulls are larger than
non-null values. The opposite is true for descending order.
The default is ASC NULLS LAST and DESC NULLS FIRST.
Note: Analytic functions operate on rows in the order
specified by the function's order_clause. However, the
analytic order_clause does not guarantee the order of the
SQL result. Use the SQL ORDER BY clause to guarantee the
ordering of the final result set.

Performance Optimization for Analytic Sort Computation


Vertica stores data in projections sorted in a specific way. For columns of data type NUMERIC,
INTEGER, DATE, TIME, TIMESTAMP, and INTERVAL, null values are placed at the beginning
of sorted projections (NULLS FIRST), while for columns of data type FLOAT, STRING, and
BOOLEAN, null values are placed at the end (NULLS LAST).
In the following example, Vertica sorts inputs from table t on column x, as specified in the
OVER(ORDER BY) clause. Then it evaluates RANK():
CREATE TABLE t (
x FLOAT,
y FLOAT
);
CREATE PROJECTION t_p (x,y) AS SELECT * FROM t
ORDER BY x,y UNSEGMENTED ALL NODES;
SELECT x, RANK() OVER (ORDER BY x) FROM t;
The projection created for table t is also sorted on column x so Vertica can eliminate the ORDER
BY clause and execute the query quickly. Because column x is a FLOAT data type and the
projection sort order matches the default ordering (ASC + NULLS LAST), Vertica can eliminate
the sort.
Assume, however, that column a had been defined as INTEGER. Vertica cannot eliminate the
sort because the projection sort order for integers (ASC + NULLS FIRST) does not match the
default ordering (ASC + NULLS LAST).
To modify the above query using (x INT), specify the placement of nulls to match default
ordering:
SELECT x, RANK() OVER (ORDER BY x NULLS FIRST) FROM t;
If column x is a STRING data type, the following query would eliminate the sort:
SELECT x, RANK() OVER (ORDER BY x NULLS LAST) FROM t;
Note, however, that omitting NULLS LAST from the (x STRING) query still eliminates the sort
because ASC + NULLS LAST is the default sort specification for both the analytical ORDER BY
clause and for string-related columns in Vertica.

-127-
SQL Reference Manual

So when formulating a SQL query involving analytics computation, if you do not care about the
placement of your null values, of if you know the column or columns contain no null values, you
can carefully formulate the SQL query to eliminate the sort and, thereby, choose the faster
query.

Summary
The analytic ORDER BY clause orders the data used by the analytic function as
either ascending (ASC) or descending (DESC) and specifies where null values
appear in the sorted result as either NULLS FIRST or NULLS LAST.
The SQL ORDER BY clause, on the other hand, specifies only ascending or
descending order.
When using analytic ORDER BY operations, the default sort order is as follows:
• With ASC + NULLS LAST, null values are placed at the end of the sorted
result.
• With DESC + NULLS FIRST, null values are placed at the beginning of the
sorted result.
In Vertica:
• For NUMERIC, INTEGER, DATE, TIME, TIMESTAMP, and INTERVAL-related
columns, null values are placed at the beginning of a sorted projection (NULLS
FIRST).
• For FLOAT, STRING, and BOOLEAN-related columns, null values are placed
at the end of a sorted projection (NULLS LAST).

FIRST_VALUE / LAST_VALUE
FIRST_VALUE returns values from the first row of a window.
LAST_VALUE returns values from the last row of a window.

Syntax
{ FIRST_VALUE | LAST_VALUE }( expr [ IGNORE NULLS ] )
OVER ( [ partition_clause ] [ order_clause ] )

Parameters
expr Is the expression to evaluate; for example, a constant, column,
nonanalytic function, function expression, or expressions involving any
of these.
IGNORE NULLS Returns the first or last non-null value in the set, or NULL if all values
are NULL.

-128-
SQL Functions

OVER() Is required. Indicates that the function operates on a query result set
(the rows that are returned after the FROM, WHERE, GROUP BY, and
HAVING clauses have been evaluated).
partition_clause Divides the rows in the input relation by a given list of columns (or
expressions). If the partition_clause is omitted, all input rows are
treated as a single partition.
order_clause Sorts the rows in the partition and generates an ordered set of rows
that is then used as input to the windowing clause (if present), to the
analytical function, or to both.
The analytic order_clause specifies whether data is returned in
ascending or descending order and specifies where null values appear
in the sorted result as either first or last.

Notes
• The FIRST_VALUE and LAST_VALUE functions allow you to select a table's first and last
value (according to the ORDER BY clause), without having to use a self-join.
These functions are useful when you want to use the first or last value as a baseline in
calculations.
• The FIRST_VALUE function takes the first record from the window, while the LAST_VALUE
function takes the record from the partition after the analytic ORDER BY clause. The
expression is then computed against the first or last record, and results are returned.
• Vertica recommends that you use FIRST_VALUE and LAST_VALUE with the analytic
ORDER BY clause to produce deterministic results.
• Due to default window semantics, LAST_VALUE does not always return the last value of a
partition. Unlike most other aggregate functions (page 112), FIRST_VALUE and
LAST_VALUE can be used only with a window function whose default window is RANGE
BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW. If the windowing clause is
omitted from the analytic clause, LAST_VALUE operates on this default window. Results,
therefore, can seem non-intuitive because the function does not return the bottom of the
current partition. It returns the bottom of the window, which continues to change along with
the current input row being processed.
• If IGNORE NULLs is not specified, the first or last value is returned whether or not it is NULL.

Examples
The following query, which asks for the first value in the partitioned day of week, illustrates the
potential nondeterministic nature of the FIRST_VALUE function:
SELECT calendar_year, date_key, day_of_week, full_date_description,
FIRST_VALUE(full_date_description) OVER
(PARTITION BY calendar_month_number_in_year ORDER BY day_of_week) AS "first_value"
FROM date_dimension
WHERE calendar_year=2003 AND calendar_month_number_in_year=1;

-129-
SQL Reference Manual

The first value returned is January 1, 2003; however, the next time the same query is run, the
first value could be January 24 or January 3, or the 10th or 17th. The reason is because the
analytic ORDER BY column (day_of_week) returns rows that contain ties (multiple Fridays).
These repeated values make the ORDER BY evaluation result nondeterministic, because rows
that contain ties can be ordered in any way, and any one of those rows qualifies as being the first
value of day_of_week.
calendar_year | date_key | day_of_week | full_date_description | first_value
---------------+----------+-------------+-----------------------+------------------
2003 | 31 | Friday | January 31, 2003 | January 31, 2003
2003 | 24 | Friday | January 24, 2003 | January 31, 2003
2003 | 3 | Friday | January 3, 2003 | January 31, 2003
2003 | 10 | Friday | January 10, 2003 | January 31, 2003
2003 | 17 | Friday | January 17, 2003 | January 31, 2003
2003 | 6 | Monday | January 6, 2003 | January 31, 2003
2003 | 27 | Monday | January 27, 2003 | January 31, 2003
2003 | 13 | Monday | January 13, 2003 | January 31, 2003
2003 | 20 | Monday | January 20, 2003 | January 31, 2003
2003 | 11 | Saturday | January 11, 2003 | January 31, 2003
2003 | 18 | Saturday | January 18, 2003 | January 31, 2003
2003 | 25 | Saturday | January 25, 2003 | January 31, 2003
2003 | 4 | Saturday | January 4, 2003 | January 31, 2003
2003 | 12 | Sunday | January 12, 2003 | January 31, 2003
2003 | 26 | Sunday | January 26, 2003 | January 31, 2003
2003 | 5 | Sunday | January 5, 2003 | January 31, 2003
2003 | 19 | Sunday | January 19, 2003 | January 31, 2003
2003 | 23 | Thursday | January 23, 2003 | January 31, 2003
2003 | 2 | Thursday | January 2, 2003 | January 31, 2003
2003 | 9 | Thursday | January 9, 2003 | January 31, 2003
2003 | 16 | Thursday | January 16, 2003 | January 31, 2003
2003 | 30 | Thursday | January 30, 2003 | January 31, 2003
2003 | 21 | Tuesday | January 21, 2003 | January 31, 2003
2003 | 14 | Tuesday | January 14, 2003 | January 31, 2003
2003 | 7 | Tuesday | January 7, 2003 | January 31, 2003
2003 | 28 | Tuesday | January 28, 2003 | January 31, 2003
2003 | 22 | Wednesday | January 22, 2003 | January 31, 2003
2003 | 29 | Wednesday | January 29, 2003 | January 31, 2003
2003 | 15 | Wednesday | January 15, 2003 | January 31, 2003
2003 | 1 | Wednesday | January 1, 2003 | January 31, 2003
2003 | 8 | Wednesday | January 8, 2003 | January 31, 2003
(31 rows)

Note: The day_of_week results are returned in alphabetical order because of lexical rules.
The fact that each day does not appear ordered by the 7-day week cycle (for example,
starting with Sunday followed by Monday, Tuesday, and so on) has no affect on results.
To return deterministic results, modify the query so that it performs its analytic ORDER BY
operations on a unique field, such as date_key:
SELECT calendar_year, date_key, day_of_week, full_date_description,
FIRST_VALUE(full_date_description) OVER
(PARTITION BY calendar_month_number_in_year ORDER BY date_key) AS "first_value"
FROM date_dimension
WHERE calendar_year=2003;

Notice that the results return a first value of January 1 for the January partition and the first value
of February 1 for the February partition. Also, there are no ties in the
full_date_description column:
calendar_year | date_key | day_of_week | full_date_description | first_value
---------------+----------+-------------+-----------------------+-------------------

-130-
SQL Functions

2003 | 1 | Wednesday | January 1, 2003 | January 1, 2003


2003 | 2 | Thursday | January 2, 2003 | January 1, 2003
2003 | 3 | Friday | January 3, 2003 | January 1, 2003
2003 | 4 | Saturday | January 4, 2003 | January 1, 2003
2003 | 5 | Sunday | January 5, 2003 | January 1, 2003
2003 | 6 | Monday | January 6, 2003 | January 1, 2003
2003 | 7 | Tuesday | January 7, 2003 | January 1, 2003
2003 | 8 | Wednesday | January 8, 2003 | January 1, 2003
2003 | 9 | Thursday | January 9, 2003 | January 1, 2003
2003 | 10 | Friday | January 10, 2003 | January 1, 2003
2003 | 11 | Saturday | January 11, 2003 | January 1, 2003
2003 | 12 | Sunday | January 12, 2003 | January 1, 2003
2003 | 13 | Monday | January 13, 2003 | January 1, 2003
2003 | 14 | Tuesday | January 14, 2003 | January 1, 2003
2003 | 15 | Wednesday | January 15, 2003 | January 1, 2003
2003 | 16 | Thursday | January 16, 2003 | January 1, 2003
2003 | 17 | Friday | January 17, 2003 | January 1, 2003
2003 | 18 | Saturday | January 18, 2003 | January 1, 2003
2003 | 19 | Sunday | January 19, 2003 | January 1, 2003
2003 | 20 | Monday | January 20, 2003 | January 1, 2003
2003 | 21 | Tuesday | January 21, 2003 | January 1, 2003
2003 | 22 | Wednesday | January 22, 2003 | January 1, 2003
2003 | 23 | Thursday | January 23, 2003 | January 1, 2003
2003 | 24 | Friday | January 24, 2003 | January 1, 2003
2003 | 25 | Saturday | January 25, 2003 | January 1, 2003
2003 | 26 | Sunday | January 26, 2003 | January 1, 2003
2003 | 27 | Monday | January 27, 2003 | January 1, 2003
2003 | 28 | Tuesday | January 28, 2003 | January 1, 2003
2003 | 29 | Wednesday | January 29, 2003 | January 1, 2003
2003 | 30 | Thursday | January 30, 2003 | January 1, 2003
2003 | 31 | Friday | January 31, 2003 | January 1, 2003
2003 | 32 | Saturday | February 1, 2003 | February 1, 2003
2003 | 33 | Sunday | February 2, 2003 | February 1, 2003
...
(365 rows)

See Also
TIME_SLICE (page 161)

LEAD / LAG
LEAD and LAG return values from the row after and before the current row, respectively. These
functions allow you to access more than one row in a table at the same time and are useful for
comparing values when the relative positions of rows can be reliably known.
LEAD provides access to a row at a given offset after the current row.
LAG provides access to a row at a given offset before the current row.

Syntax
{ LEAD | LAG } ( expr [, offset] [, default] )
OVER ( [ partition_clause ] order_clause [ ASC | DESC ] [NULLS FIRST | LAST ]
)

Parameters
expr Is the expression to evaluate; for example, a constant, column,
nonanalytic function, function expression, or expressions involving
any of these.

-131-
SQL Reference Manual

offset Is an optional parameter that defaults to 1. The offset parameter


must be (or can be evaluated to) a constant positive integer.
default Is NULL. This optional parameter is the value returned if offset falls
outside the bounds of the table or partition.
Note: The third input argument must be a constant value or an
expression that can be evaluated to a constant; its data type should
be coercible to that of the first argument.
OVER() Is required. Indicates that the function operates on a query result set
(the rows that are returned after the FROM, WHERE, GROUP BY,
and HAVING clauses have been evaluated).
partition_clause Divides the rows in the input relation by a given list of columns (or
expressions). If the partition_clause is omitted, all input rows are
treated as a single partition.
order_clause Sorts the rows in the partition and generates an ordered set of rows
that is then used as input to the windowing clause (if present), to the
analytical function, or to both.
The analytic order_clause specifies whether data is returned in
ascending or descending order and specifies where null values
appear in the sorted result as either first or last.
ASC | DESC Specifies the ordering sequence as ascending (default) or
descending.
NULLS FIRST | LAST Indicates the position of nulls in the ordered sequence as either first
or last. The order makes nulls compare either high or low with
respect to non-null values.
If the sequence is specified as ascending order, ASC NULLS FIRST
implies that nulls are smaller than other non-null values. ASC
NULLS LAST implies that nulls are larger than non-null values. The
opposite is true for descending order.
The default is ASC NULLS LAST and DESC NULLS FIRST.
Note: Analytic functions operate on rows in the order specified by
the function's order_clause. However, the analytic order_clause
does not guarantee the order of the SQL result. Use the SQL
ORDER BY clause to guarantee the ordering of the final result set.

Notes
• Because you can use LEAD and LAG to access more than one row in a table at the same
time, you can avoid using the more costly self join, thereby enhancing query processing
speed.
• Analytic functions, such as LAG and LEAD, cannot be nested within aggregate functions.

Examples
Say you want to sum the current balance by date in a table and also sum the previous balance
from the last day. Given the inputs that follow, the data should satisfy the following conditions:

-132-
SQL Functions

• For each some_id, there is exactly 1 row for each date represented by month_date.
• For each some_id, the set of dates is consecutive; that is, if there is a row for February 24
and a row for February 26, there should also be a row for February 25.
• Each some_id has the same set of dates.
CREATE TABLE balances (
month_date DATE,
current_bal INT,
some_id INT
);
CREATE PROJECTION bal_p1 (
month_date,
current_bal,
some_id)
AS SELECT * FROM balances UNSEGMENTED ALL NODES;
INSERT INTO balances values ('2009-02-24', 10, 1);
INSERT INTO balances values ('2009-02-25', 10, 1);
INSERT INTO balances values ('2009-02-26', 10, 1);
INSERT INTO balances values ('2009-02-24', 20, 2);
INSERT INTO balances values ('2009-02-25', 20, 2);
INSERT INTO balances values ('2009-02-26', 20, 2);
INSERT INTO balances values ('2009-02-24', 30, 3);
INSERT INTO balances values ('2009-02-25', 20, 3);
INSERT INTO balances values ('2009-02-26', 30, 3);
Now execute the LAG() function to sum the current balance for each date and sum the previous
balance from the last day:
SELECT month_date,
SUM(current_bal) as current_bal_sum,
SUM(previous_bal) as previous_bal_sum FROM
(SELECT month_date, current_bal,
LAG(current_bal, 1, 0) OVER
(PARTITION BY some_id ORDER BY month_date)
AS previous_bal FROM balances) AS subQ
GROUP BY month_date
ORDER BY month_date;
month_date | current_bal_sum | previous_bal_sum
------------+-----------------+------------------
2009-02-24 | 60 | 0
2009-02-25 | 50 | 60
2009-02-26 | 60 | 50
(3 rows)
Using the same example data, the following query would not be allowed because LAG is nested
inside an aggregate function:
SELECT month_date,
SUM(current_bal) as current_bal_sum,
SUM(LAG(current_bal,1,0) OVER (PARTITION BY some_id ORDER BY month_date)) AS previous_bal_sum
FROM some_table
GROUP BY month_date
ORDER BY month_date;

-133-
SQL Reference Manual

In the next example, the LAG function first returns the annual income from the previous row, and
then it calculates the difference between the income in the current row from the income in the
previous row. Note: The vmart example database returns over 50,000 rows, so we'll limit the
results to 20 records:
SELECT occupation, customer_key, customer_name, annual_income,
LAG(annual_income, 1, 0) OVER (PARTITION BY occupation ORDER BY annual_income) AS prev_income,
annual_income -
LAG(annual_income, 1, 0) OVER (PARTITION BY occupation ORDER BY annual_income) AS difference
FROM customer_dimension
ORDER BY occupation, customer_key
LIMIT 20;

occupation | customer_key | customer_name | annual_income | prev_income | difference


------------+--------------+----------------------+---------------+-------------+------------
Accountant | 15 | Midori V. Peterson | 692610 | 692535 | 75
Accountant | 43 | Midori S. Rodriguez | 282359 | 280976 | 1383
Accountant | 93 | Robert P. Campbell | 471722 | 471355 | 367
Accountant | 102 | Sam T. McNulty | 901636 | 901561 | 75
Accountant | 134 | Martha B. Overstreet | 705146 | 704335 | 811
Accountant | 165 | James C. Kramer | 376841 | 376474 | 367
Accountant | 225 | Ben W. Farmer | 70574 | 70449 | 125
Accountant | 270 | Jessica S. Lang | 684204 | 682274 | 1930
Accountant | 273 | Mark X. Lampert | 723294 | 722737 | 557
Accountant | 295 | Sharon K. Gauthier | 29033 | 28412 | 621
Accountant | 338 | Anna S. Jackson | 816858 | 815557 | 1301
Accountant | 377 | William I. Jones | 915149 | 914872 | 277
Accountant | 438 | Joanna A. McCabe | 147396 | 144482 | 2914
Accountant | 452 | Kim P. Brown | 126023 | 124797 | 1226
Accountant | 467 | Meghan K. Carcetti | 810528 | 810284 | 244
Accountant | 478 | Tanya E. Greenwood | 639649 | 639029 | 620
Accountant | 511 | Midori P. Vogel | 187246 | 185539 | 1707
Accountant | 525 | Alexander K. Moore | 677433 | 677050 | 383
Accountant | 550 | Sam P. Reyes | 735691 | 735355 | 336
Accountant | 577 | Robert U. Vu | 616101 | 615439 | 662
(20 rows)

In this example, the LEAD function finds the hire date of the employee hired just after the current
row:
SELECT employee_region, hire_date, employee_key, employee_last_name,
LEAD(hire_date, 1) OVER (PARTITION BY employee_region ORDER BY hire_date) AS "next_hired"
FROM employee_dimension
ORDER BY employee_region, hire_date, employee_key;

employee_region | hire_date | employee_key | employee_last_name | next_hired


-------------------+------------+--------------+--------------------+---------
---
East | 1956-04-08 | 9218 | Harris | 1957-02-06
East | 1957-02-06 | 7799 | Stein | 1957-05-25
East | 1957-05-25 | 3687 | Farmer | 1957-06-26
East | 1957-06-26 | 9474 | Bauer | 1957-08-18
East | 1957-08-18 | 570 | Jefferson | 1957-08-24
East | 1957-08-24 | 4363 | Wilson | 1958-02-17
East | 1958-02-17 | 6457 | McCabe | 1958-06-26
East | 1958-06-26 | 6196 | Li | 1958-07-16
East | 1958-07-16 | 7749 | Harris | 1958-09-18
East | 1958-09-18 | 9678 | Sanchez | 1958-11-10
(10 rows)
The next example uses both LEAD and LAG to return the third row after the salary in the current
row and fifth salary before the salary in the current row.

-134-
SQL Functions

SELECT hire_date, employee_key, employee_last_name,


LEAD(hire_date, 1) OVER (ORDER BY hire_date) AS "next_hired" ,
LAG(hire_date, 1) OVER (ORDER BY hire_date) AS "last_hired"
FROM employee_dimension
ORDER BY hire_date, employee_key;

hire_date | employee_key | employee_last_name | next_hired | last_hired


------------+--------------+--------------------+------------+------------
1956-04-11 | 2694 | Farmer | 1956-05-12 |
1956-05-12 | 5486 | Winkler | 1956-09-18 | 1956-04-11
1956-09-18 | 5525 | McCabe | 1957-01-15 | 1956-05-12
1957-01-15 | 560 | Greenwood | 1957-02-06 | 1956-09-18
1957-02-06 | 9781 | Bauer | 1957-05-25 | 1957-01-15
1957-05-25 | 9506 | Webber | 1957-07-04 | 1957-02-06
1957-07-04 | 6723 | Kramer | 1957-07-07 | 1957-05-25
1957-07-07 | 5827 | Garnett | 1957-11-11 | 1957-07-04
1957-11-11 | 373 | Reyes | 1957-11-21 | 1957-07-07
1957-11-21 | 3874 | Martin | 1958-02-06 | 1957-11-11
(10 rows)

The following example specifies arguments that use different data types; for example
annual_income(INT) and occupation(VARCHAR). The query returns an error:
SELECT customer_key, customer_name, occupation, annual_income,
LAG (annual_income, 1, occupation) OVER (PARTITION BY occupation ORDER BY
customer_key) LAG1
FROM customer_dimension
ORDER BY 3, 1;
ERROR: Third argument of lag could not be converted from type character varying
to type int8
HINT: You may need to add explicit type cast.

RANK / DENSE_RANK
RANK computes the rank of a value in a group of values. The return type is NUMBER. Rank
values are skipped in the event of a tie — when more than one row has the same rank.
DENSE_RANK computes the rank of a row in an ordered group of rows and returns the rank as
a NUMBER. The ranks are consecutive integers beginning with 1. The largest rank value is the
number of unique values returned by the query. No rows are skipped if more than one row has
same rank.

Syntax
{ RANK | DENSE_RANK } ( )
OVER ( [ partition_clause ] order_clause [ ASC | DESC ] [NULLS FIRST | LAST ]
)

Parameters
OVER() Is required. Indicates that the function operates on a query result
set (the rows that are returned after the FROM, WHERE, GROUP
BY, and HAVING clauses have been evaluated).
In ranking functions, this clause specifies the measures expr on
which ranking is done and defines the order in which rows are
sorted in each group (or partition). Once the data is sorted within
each partition, ranks are given to each row starting from 1.

-135-
SQL Reference Manual

partition_clause Divides the rows in the input relation by a given list of columns (or
expressions). If the partition_clause is omitted, all input rows are
treated as a single partition.
order_clause Sorts the rows in the partition and generates an ordered set of rows
that is then used as input to the windowing clause (if present), to
the analytical function, or to both.
The analytic order_clause specifies whether data is returned in
ascending or descending order and specifies where null values
appear in the sorted result as either first or last.
ASC | DESC Specifies the ordering sequence as ascending (default) or
descending.
NULLS FIRST | LAST Indicates the position of nulls in the ordered sequence as either first
or last. The order makes nulls compare either high or low with
respect to non-null values.
If the sequence is specified as ascending order, ASC NULLS
FIRST implies that nulls are smaller than other non-null values.
ASC NULLS LAST implies that nulls are larger than non-null
values. The opposite is true for descending order.
The default is ASC NULLS LAST and DESC NULLS FIRST.
Note: Analytic functions operate on rows in the order specified by
the function's order_clause. However, the analytic order_clause
does not guarantee the order of the SQL result. Use the SQL
ORDER BY clause to guarantee the ordering of the final result set.

Notes
• The primary difference between RANK and DENSE_RANK is that RANK leaves gaps when
ranking records; DENSE_RANK leaves no gaps.
For example, if more than one record occupies a particular position (a tie), RANK places all
those records in that position and it places the next record after a gap of the additional
records (it skips one). DENSE_RANK places all the records in that position only — it does not
leave a gap for the next rank.
If there is a tie at the third position with two records having the same value, both RANK and
DENSE_RANK place both the records in the third position only, but RANK has the next
record at the fifth position — leaving a gap of 1 position — while DENSE_RANK places the
next record at the forth position (no gap).
• If you omit NULLS FIRST | LAST, the ordering of the null values depends on the ASC or
DESC arguments. Null values are considered larger than any other values. If the ordering
sequence is ASC, then nulls appear last; nulls appear first otherwise. Nulls are considered
equal to other nulls and, therefore, the order in which nulls are presented is
non-deterministic.

Examples
This example ranks the longest-standing customers in Massachusetts. The query first computes
the customer_since column by region, and then partitions the results by customers with
businesses in MA. Then within each region, the query ranks customers over the age of 70.

-136-
SQL Functions

SELECT customer_type, customer_name,


RANK() OVER (PARTITION BY customer_region ORDER BY customer_since) as rank
FROM customer_dimension
WHERE customer_state = 'MA'
AND customer_age > '70';

customer_type | customer_name | rank


---------------+---------------+------
Company | Virtadata | 1
Company | Evergen | 2
Company | Infocore | 3
Company | Goldtech | 4
Company | Veritech | 5
Company | Inishop | 6
Company | Intracom | 7
Company | Virtacom | 8
Company | Goldcom | 9
Company | Infostar | 10
Company | Golddata | 11
Company | Everdata | 12
Company | Goldcorp | 13
(13 rows)
The following example shows the difference between RANK and DENSE_RANK when ranking
customers by their annual income. Notice that RANK has a tie at 10 and skips 11, while DENSE
RANK leaves no gaps in the ranking sequence:
SELECT customer_name, SUM(annual_income),
RANK () OVER (ORDER BY TO_CHAR(SUM(annual_income),'100000') DESC) rank,
DENSE_RANK () OVER (ORDER BY TO_CHAR(SUM(annual_income),'100000') DESC) dense_rank
FROM customer_dimension
GROUP BY customer_name
LIMIT 15;
customer_name | sum | rank | dense_rank
---------------------+-------+------+------------
Brian M. Garnett | 99838 | 1 | 1
Tanya A. Brown | 99834 | 2 | 2
Tiffany P. Farmer | 99826 | 3 | 3
Jose V. Sanchez | 99673 | 4 | 4
Marcus D. Rodriguez | 99631 | 5 | 5
Alexander T. Nguyen | 99604 | 6 | 6
Sarah G. Lewis | 99556 | 7 | 7
Ruth Q. Vu | 99542 | 8 | 8
Theodore T. Farmer | 99532 | 9 | 9
Daniel P. Li | 99497 | 10 | 10
Seth E. Brown | 99497 | 10 | 10
Matt X. Gauthier | 99402 | 12 | 11
Rebecca W. Lewis | 99296 | 13 | 12
Dean L. Wilson | 99276 | 14 | 13
Tiffany A. Smith | 99257 | 15 | 14
(15 rows)

ROW_NUMBER
Assigns a unique number, sequentially, starting from 1, to each row within a partition.

-137-
SQL Reference Manual

Syntax
ROW_NUMBER( )
OVER ( [ partition_clause ] [ order_clause ] [ ASC | DESC ] [NULLS FIRST | LAST
] ] )

Parameters
OVER() Is required. Indicates that the function operates on a query result
set (the rows that are returned after the FROM, WHERE, GROUP
BY, and HAVING clauses have been evaluated).
partition_clause Divides the rows in the input relation by a given list of columns (or
expressions). If the partition_clause is omitted, all input rows are
treated as a single partition.
order_clause Sorts the rows in the partition and generates an ordered set of
rows that is then used as input to the windowing clause (if
present), to the analytical function, or to both.Results are defined
by order_clause, if provided.
ASC | DESC Specifies the ordering sequence as ascending (default) or
descending.
NULLS FIRST | LAST Indicates the position of nulls in the ordered sequence as either
first or last. The order makes nulls compare either high or low
with respect to non-null values.
If the sequence is specified as ascending order, ASC NULLS
FIRST implies that nulls are smaller than other non-null values.
ASC NULLS LAST implies that nulls are larger than non-null
values. The opposite is true for descending order.
The default is ASC NULLS LAST and DESC NULLS FIRST.
Note: Analytic functions operate on rows in the order specified
by the function's order_clause. However, the analytic
order_clause does not guarantee the order of the SQL result.
Use the SQL ORDER BY clause to guarantee the ordering of the
final result set.

Notes
• You can use the optional partition clause to group data into partitions before operating on it;
for example:
SUM OVER (PARTITION BY col1, col2, ...)
• The OVER() clause is required, even if it does not contain the optional order or partition
clauses.
• You can substitute any RANK example for ROW_NUMBER. The difference is that
ROW_NUMBER assigns a unique ordinal number, starting with 1, to each row in the ordered
set.

Examples
The following query first partitions customers in the customer_dimension table by occupation and
then ranks those customers based on the ordered set specified by the analytic partition_clause.

-138-
SQL Functions

SELECT occupation, customer_key, customer_since, annual_income,


ROW_NUMBER() OVER (PARTITION BY occupation) AS customer_since_row_num
FROM public.customer_dimension
ORDER BY occupation, customer_since_row_num;
occupation | customer_key | customer_since | annual_income | customer_since_row_num
--------------------+--------------+----------------+---------------+------------------------
Accountant | 19453 | 1973-11-06 | 602460 | 1
Accountant | 42989 | 1967-07-09 | 850814 | 2
Accountant | 24587 | 1995-05-18 | 180295 | 3
Accountant | 26421 | 2001-10-08 | 126490 | 4
Accountant | 37783 | 1993-03-16 | 790282 | 5
Accountant | 39170 | 1980-12-21 | 823917 | 6
Banker | 13882 | 1998-04-10 | 15134 | 1
Banker | 14054 | 1989-03-16 | 961850 | 2
Banker | 15850 | 1996-01-19 | 262267 | 3
Banker | 29611 | 2004-07-14 | 739016 | 4
Doctor | 261 | 1969-05-11 | 933692 | 1
Doctor | 1264 | 1981-07-19 | 593656 | 2
Psychologist | 5189 | 1999-05-04 | 397431 | 1
Psychologist | 5729 | 1965-03-26 | 339319 | 2
Software Developer | 2513 | 1996-09-22 | 920003 | 1
Software Developer | 5927 | 2001-03-12 | 633294 | 2
Software Developer | 9125 | 1971-10-06 | 198953 | 3
Software Developer | 16097 | 1968-09-02 | 748371 | 4
Software Developer | 23137 | 1988-12-07 | 92578 | 5
Software Developer | 24495 | 1989-04-16 | 149371 | 6
Software Developer | 24548 | 1994-09-21 | 743788 | 7
Software Developer | 33744 | 2005-12-07 | 735003 | 8
Software Developer | 9684 | 1970-05-20 | 246000 | 9
Software Developer | 24278 | 2001-11-14 | 122882 | 10
Software Developer | 27122 | 1994-02-05 | 810044 | 11
Stock Broker | 5950 | 1965-01-20 | 752120 | 1
Stock Broker | 12517 | 2003-06-13 | 380102 | 2
Stock Broker | 33010 | 1984-05-07 | 384463 | 3
Stock Broker | 46196 | 1972-11-28 | 497049 | 4
Stock Broker | 8710 | 2005-02-11 | 79387 | 5
Writer | 3149 | 1998-11-17 | 643972 | 1
Writer | 17124 | 1965-01-18 | 444747 | 2
Writer | 20100 | 1994-08-13 | 106097 | 3
Writer | 23317 | 2003-05-27 | 511750 | 4
Writer | 42845 | 1967-10-23 | 433483 | 5
Writer | 47560 | 1997-04-23 | 515647 | 6
(39 rows)

Date/Time Functions
Date and time functions perform conversion, extraction, or manipulation operations on date and
time data types and can return date and time information.

Usage
Functions that take TIME or TIMESTAMP inputs come in two variants:
• TIME WITH TIME ZONE or TIMESTAMP WITH TIME ZONE
• TIME WITHOUT TIME ZONE or TIMESTAMP WITHOUT TIME ZONE
For brevity, these variants are not shown separately.
The + and * operators come in commutative pairs; for example, both DATE + INTEGER and
INTEGER + DATE. We show only one of each such pair.

-139-
SQL Reference Manual

Daylight Savings Time Considerations


When adding an INTERVAL value to (or subtracting an INTERVAL value from) a TIMESTAMP
WITH TIME ZONE value, the days component advances (or decrements) the date of the
TIMESTAMP WITH TIME ZONE by the indicated number of days. Across daylight saving time
changes (with the session time zone set to a time zone that recognizes DST), this means
INTERVAL '1 day' does not necessarily equal INTERVAL '24 hours'.
For example, with the session time zone set to CST7CDT:
TIMESTAMP WITH TIME ZONE '2005-04-02 12:00-07' + INTERVAL '1 day'
produces
TIMESTAMP WITH TIME ZONE '2005-04-03 12:00-06'
Adding INTERVAL '24 hours' to the same initial TIMESTAMP WITH TIME ZONE produces
TIMESTAMP WITH TIME ZONE '2005-04-03 13:00-06',
as there is a change in daylight saving time at 2005-04-03 02:00 in time zone CST7CDT.

Date/Time Functions in Transactions


CURRENT_TIMESTAMP and related functions return the start time of the current transaction; their
values do not change during the transaction. The intent is to allow a single transaction to have a
consistent notion of the "current" time, so that multiple modifications within the same transaction
bear the same timestamp. However, TIMEOFDAY() returns the wall-clock time and advances
during transactions.

See Also
Template Patterns for Date/Time Formatting (page 171)

ADD_MONTHS
Takes a DATE, TIMESTAMP, or TIMESTAMPTZ argument and a number of months and returns
a date. TIMESTAMPTZ arguments are implicitly cast to TIMESTAMP.

Syntax
SELECT ADD_MONTHS( d , n );

Parameters
d Is the incoming DATE, TIMESTAMP, or TIMESTAMPZ. If the start
date falls on the last day of the month, or if the resulting month has
fewer days than the given day of the month, then the result is the
last day of the resulting month. Otherwise, the result has the same
start day.
n Can be any integer.

-140-
SQL Functions

Notes
ADD_MONTHS() is an invariant function if called with DATE or TIMESTAMP but is stable with
TIMESTAMPTZ in that its results can change based on TIME ZONE settings.

Examples
The following example's results include a leap year:
SELECT ADD_MONTHS('31-Jan-08', 1) "Months";
Months
------------
2008-02-29
(1 row)
The next example adds four months to January and returns a date in May:
SELECT ADD_MONTHS('31-Jan-08', 4) "Months";
Months
------------
2008-05-31
(1 row)
This example subtracts 4 months from January, returning a date in September:
SELECT ADD_MONTHS('31-Jan-08', -4) "Months";
Months
------------
2007-09-30
(1 row)
Because the following example specifies NULL, the result set is empty:
SELECT ADD_MONTHS('31-Jan-03', NULL) "Months";
Months
--------

(1 row)
This example provides no date argument, so even though the number of months specified is 1,
the result set is empty:
SELECT ADD_MONTHS(NULL, 1) "Months";
Months
--------

(1 row)
In this example, the date field defaults to a timestamp, so the PST is ignored. Notice that even
though it is already the next day in Pacific time, the result falls on the same date in New York
(two years later):
SET TIME ZONE 'America/New_York';
SELECT ADD_MONTHS('2008-02-29 23:30 PST', 24);
add_months
------------
2010-02-28
(1 row)

-141-
SQL Reference Manual

This example specifies a timestamp with time zone, so the PST is taken into account:
SET TIME ZONE 'America/New_York';
SELECT ADD_MONTHS('2008-02-29 23:30 PST'::TIMESTAMPTZ, 24);
add_months
------------
2010-03-01
(1 row)

AGE
Returns an INTERVAL value representing the difference between two TIMESTAMP values.

Syntax
AGE ( expression1 [ , expression2 ] )

Parameters
expression1 (TIMESTAMP) specifies the beginning of the INTERVAL
expression2 (TIMESTAMP) specifies the end of the INTERVAL. The default is
the CURRENT_DATE (page 143)

Examples
The following example returns the age of a person born on March 2, 1972 on the date June 21,
1990, with a time elapse of 18 years, 3 months, and 19 days:
SELECT AGE(TIMESTAMP '1990-06-21', TIMESTAMP '1972-03-02');
age
-------------------------
18 years 3 mons 19 days
(1 row)
The next example shows the age of the same person (born March 2, 1972) as of February 24,
2008:
SELECT AGE(TIMESTAMP '2009-02-24', TIMESTAMP '1972-03-02');
age
--------------------------
36 years 11 mons 22 days
(1 row)
This example returns the age of a person born on November 21, 1939:
SELECT AGE(TIMESTAMP '1939-11-21');
age
------------------------
69 years 3 mons 3 days
(1 row)

-142-
SQL Functions

CLOCK_TIMESTAMP
Returns a value of type TIMESTAMP WITH TIME ZONE representing the current system-clock
time.

Syntax
CLOCK_TIMESTAMP()

Notes
This function uses the date and time supplied by the operating system on the server to which
you are connected, which should be the same across all servers. The value changes each time
you call it.

Examples
The following command returns the current time on your system:
SELECT CLOCK_TIMESTAMP() "Current Time";
Current Time
-------------------------------
2009-02-24 12:35:39.697199-05
(1 row)
Each time you call the function, you get a different result. The difference in this example is in
microseconds:
SELECT CLOCK_TIMESTAMP() "Time 1", CLOCK_TIMESTAMP() "Time 2";
Time 1 | Time 2
-------------------------------+-------------------------------
2009-03-05 14:42:58.809879-05 | 2009-03-05 14:42:58.80988-05
(1 row)

See Also
STATEMENT_TIMESTAMP (page 160) and TRANSACTION_TIMESTAMP (page 165)

CURRENT_DATE
Returns the date (date-type value) on which the current transaction started.

Syntax
CURRENT_DATE

Notes
The CURRENT_DATE function does not require parentheses.

Examples
SELECT CURRENT_DATE;
date
------------
2009-02-23
(1 row)

-143-
SQL Reference Manual

CURRENT_TIME
Returns a value of type TIME WITH TIME ZONE representing the time of day.

Syntax
CURRENT_TIME [ ( precision ) ]

Parameters
precision (INTEGER) causes the result to be rounded to the specified
number of fractional digits in the seconds field.

Notes
This function returns the start time of the current transaction; the value does not change during
the transaction. The intent is to allow a single transaction to have a consistent notion of the
current time, so that multiple modifications within the same transaction bear the same timestamp.

Examples
SELECT CURRENT_TIME "Current Time";
Current Time
--------------------
12:45:12.186089-05
(1 row)

CURRENT_TIMESTAMP
Returns a value of type TIMESTAMP WITH TIME ZONE representing the start of the current
transaction.

Syntax
CURRENT_TIMESTAMP [ ( precision ) ]

Parameters
precision (INTEGER) causes the result to be rounded to the specified
number of fractional digits in the seconds field. Range of INTEGER
is 0-6.

Notes
This function returns the start time of the current transaction; the value does not change during
the transaction. The intent is to allow a single transaction to have a consistent notion of the
"current" time, so that multiple modifications within the same transaction bear the same
timestamp.

Examples
SELECT CURRENT_TIMESTAMP;
timestamptz
-------------------------------
2009-02-24 12:49:33.649951-05

-144-
SQL Functions

(1 row)
SELECT CURRENT_TIMESTAMP(2);
timestamptz
---------------------------
2009-03-16 15:41:46.72-04
(1 row)

DATE_PART
Is modeled on the traditional Ingres equivalent to the SQL-standard function EXTRACT.

Syntax
DATE_PART( field , source )

Parameters
field Is a single-quoted string value that specifies the field to extract.
Note: The field parameter values are the same as EXTRACT (page 153).
source Is a date/time (page 96) expression

Examples
The following example extracts the day value from the input parameters:
SELECT DATE_PART('day', TIMESTAMP '2009-02-24 20:38:40') "Day";
Day
-----
24
(1 row)
The following example extracts the month value from the input parameters:
SELECT DATE_PART('month', TIMESTAMP '2009-02-24 20:38:40') "Month";
Month
-------
2
(1 row)
The following example extracts the year value from the input parameters:
SELECT DATE_PART('year', TIMESTAMP '2009-02-24 20:38:40') "Year";
Year
------
2009
(1 row)
The following example extracts the hours from the input parameters:
SELECT DATE_PART('hour', TIMESTAMP '2009-02-24 20:38:40') "Hour";
Hour
------
20
(1 row)

-145-
SQL Reference Manual

The following example extracts the minutes from the input parameters:
SELECT DATE_PART('minutes', TIMESTAMP '2009-02-24 20:38:40') "Minutes";
Minutes
---------
38
(1 row)
The following example extracts the minutes from the input parameters:
SELECT DATE_PART('seconds', TIMESTAMP '2009-02-24 20:38:40') "Seconds";
Seconds
---------
40
(1 row)

SELECT DATE_PART('day', INTERVAL '29 days 23 hours');


date_part
-----------
29
(1 row)
Notice what happens to the above query if you add an hour:
SELECT DATE_PART('day', INTERVAL '29 days 24 hours');
date_part
-----------
0
(1 row)
Similarly, the following example returns 0 because an interval in hours is up to 24 only:
SELECT DATE_PART('hour', INTERVAL '24 hours 45 minutes');
date_part
-----------
0
(1 row)

DATE_TRUNC
Is conceptually similar to the TRUNC (page 188) function for numbers. The return value is of
type TIMESTAMP or INTERVAL with all fields that are less significant than the selected one set
to zero (or one, for day and month).

Syntax
DATE_TRUNC( field , source )

Parameters
field Is a string constant that selects the precision to which truncate the input value.
Valid values for field are:
century milliseconds
day minute

-146-
SQL Functions

decade month
hour second
microsecond week
s
millennium year
source Is a value expression of type TIMESTAMP or INTERVAL.
Values of type DATE and TIME are cast automatically, to TIMESTAMP or
INTERVAL, respectively.

Examples
The following example returns the hour and truncates the minutes and seconds:
SELECT DATE_TRUNC('hour', TIMESTAMP '2009-02-24 13:38:40') AS hour;
hour
---------------------
2009-02-24 13:00:00
(1 row)
The following example returns the year and defaults month and day to January 1, truncating the
rest of the string:
SELECT DATE_TRUNC('year', TIMESTAMP '2009-02-24 13:38:40') AS year;
year
---------------------
2009-01-01 00:00:00
(1 row)
The following example returns the year and month and defaults day of month to 1, truncating the
rest of the string:
SELECT DATE_TRUNC('month', TIMESTAMP '2009-02-24 13:38:40') AS year;
year
---------------------
2009-02-01 00:00:00
(1 row)

DATEDIFF
Returns the difference between two date or time values, based on the specified start and end
arguments.

Syntax 1
SELECT DATEDIFF ( datepart , startdate , enddate );

Syntax 2
SELECT DATEDIFF ( datepart , starttime , endtime );

-147-
SQL Reference Manual

Parameters
datepart Returns the number of specified datepart boundaries between the
specified startdate and enddate.
Can be an unquoted identifier, a quoted string, or an expression in
parentheses, which evaluates to the datepart as a character string.
The following table lists the valid datepart arguments.

datepart abbreviation
--------- ------------
year yy, yyyy
quarter qq, q
month mm, m
day dd, d, dy, dayofyear, y
week wk, ww
hour hh
minute mi, n
second ss, s
millisecond ms
microsecond mcs, us
startdate Is the start date for the calculation and is an expression that
returns a TIMESTAMP (page 99), DATE (page 96), or
TIMESTAMPTZ value.
The startdate value is not included in the count.
enddate Is the end date for the calculation and is an expression that returns
a TIMESTAMP (page 99), DATE (page 96), or TIMESTAMPTZ
value.
The enddate value is included in the count.
starttime Is the start time for the calculation and is an expression that
returns an INTERVAL (page 102) or TIME (page 97) data type.
The starttime value is not included in the count.
Year, quarter, or month dateparts are not allowed.
endtime Is the end time for the calculation and is an expression that returns
an INTERVAL (page 102) or TIME (page 97) data type.
The endtime value is included in the count.
Year, quarter, or month dateparts are not allowed.

-148-
SQL Functions

Notes
• DATEDIFF() is an immutable1 function with a default type of TIMESTAMP. It also takes
DATE. If TIMESTAMPTZ is specified, the function is stable.
• Vertica accepts statements written in any of the following forms:
DATEDIFF(year, s, e);
DATEDIFF('year', s, e);
If you use an expression, the expression must be enclosed in parentheses:
DATEDIFF((expr), s, e);
• Starting arguments are not included in the count, but end arguments are included.

The datepart boundaries


DATEDIFF calculates results according to ticks—or boundaries—within the date range or time
range. Results are calculated based on the specified datepart. Let's examine the following
statement and its results:
SELECT DATEDIFF('year', TO_DATE('01-01-2005','MM-DD-YYYY'),
TO_DATE('12-31-2008','MM-DD-YYYY'));
datediff
----------
3
(1 row)
In the above example, we specified a datepart of year, a startdate of January 1, 2005 and an
enddate of December 31, 2008. DATEDIFF returns 3 by counting the year intervals as follows:
[1] January 1, 2006 + [2] January 1, 2007 + [3] January 1, 2008 = 3
The function returns 3, and not 4, because startdate (January 1, 2005) is not counted in the
calculation. DATEDIFF also ignores the months between January 1, 2008 and December 31,
2008 because the datepart specified is year and only the start of each year is counted.
Sometimes the enddate occurs earlier in the ending year than the startdate in the starting year.
For example, assume a datepart of year, a startdate of August 15, 2005, and an enddate of
January 1, 2009. In this scenario, less than three years have elapsed, but DATEDIFF counts the
same way it did in the previous example, returning 3 because it returns the number of January
1s between the limits:
[1] January 1, 2006 + [2] January 1, 2007 + [3] January 1, 2008 = 3
In the following query, Vertica recognizes the full year 2005 as the starting year and 2009 as the
ending year.
SELECT DATEDIFF('year', TO_DATE('08-15-2005','MM-DD-YYYY'),
TO_DATE('01-01-2009','MM-DD-YYYY'));
The count occurs as follows:
[1] January 1, 2006 + [2] January 1, 2007 + [3] January 1, 2008 + [4] January 1,
2009 = 4

1
Immutable functions return the same answers when provided the same inputs. For example, 2+2 always
equals 4.

-149-
SQL Reference Manual

Even though August 15 has not yet occurred in the enddate, the function counts the entire
enddate year as one tick or boundary because of the year datepart.

Examples
Year: In this example, the startdate and enddate are adjacent. The difference between the dates
is one time boundary (second) of its datepart, so the result set is 1.
SELECT DATEDIFF('year', TIMESTAMP '2008-12-31 23:59:59', '2009-01-01 00:00:00');
datediff
----------
1
(1 row)

Quarters start on January, April, July, and October.


In the following example, the result is 0 because the difference from January to February in the
same calendar year does not span a quarter:
SELECT DATEDIFF('qq', TO_DATE('01-01-1995','MM-DD-YYYY'),
TO_DATE('02-02-1995','MM-DD-YYYY'));
datediff
----------
0
(1 row)
The next example, however, returns 8 quarters because the difference spans two full years. The
extra month is ignored:
SELECT DATEDIFF('quarter', TO_DATE('01-01-1993','MM-DD-YYYY'),
TO_DATE('02-02-1995','MM-DD-YYYY'));
datediff
----------
8
(1 row)
Months are based on real calendar months.
The following statement returns 1 because there is month difference between January and
February in the same calendar year:
SELECT DATEDIFF('mm', TO_DATE('01-01-2005','MM-DD-YYYY'),
TO_DATE('02-02-2005','MM-DD-YYYY'));
datediff
----------
1
(1 row)
The next example returns a negative value of 1:
SELECT DATEDIFF('month', TO_DATE('02-02-1995','MM-DD-YYYY'),
TO_DATE('01-01-1995','MM-DD-YYYY'));
datediff
----------
-1
(1 row)
And this third example returns 23 because there are 23 months difference between

-150-
SQL Functions

SELECT DATEDIFF('m', TO_DATE('02-02-1993','MM-DD-YYYY'),


TO_DATE('01-01-1995','MM-DD-YYYY'));
datediff
----------
23
(1 row)
Weeks start on Sunday at midnight.
The first example returns 0 because, even though the week starts on a Sunday, it is not a full
calendar week:
SELECT DATEDIFF('ww', TO_DATE('02-22-2009','MM-DD-YYYY'),
TO_DATE('02-28-2009','MM-DD-YYYY'));
datediff
----------
0
(1 row)
The following example returns 1 (week); January 1, 2000 fell on a Saturday.
SELECT DATEDIFF('week', TO_DATE('01-01-2000','MM-DD-YYYY'),
TO_DATE('01-02-2000','MM-DD-YYYY'));
datediff
----------
1
(1 row)
In the next example, DATEDIFF() counts the weeks between January 1, 1995 and February 2,
1995 and returns 4 (weeks):
SELECT DATEDIFF('wk', TO_DATE('01-01-1995','MM-DD-YYYY'),
TO_DATE('02-02-1995','MM-DD-YYYY'));
datediff
----------
4
(1 row)
The next example returns a difference of 100 weeks:
SELECT DATEDIFF('ww', TO_DATE('02-02-2006','MM-DD-YYYY'),
TO_DATE('01-01-2008','MM-DD-YYYY'));
datediff
----------
100
(1 row)
Days are based on real calendar days.
The first example returns 31, the full number of days in the month of July 2008.
SELECT DATEDIFF('day', 'July 1, 2008', 'Aug 1, 2008'::date);
datediff
----------
31
(1 row)
Just over two years of days:

-151-
SQL Reference Manual

SELECT DATEDIFF('d', TO_TIMESTAMP('01-01-1993','MM-DD-YYYY'),


TO_TIMESTAMP('02-02-1995','MM-DD-YYYY'));
datediff
----------
762
(1 row)
Hours, minutes, and seconds are based on clock time.
The first example counts backwards from March 2 to February 14 and returns -384 hours:
SELECT DATEDIFF('hour', TO_DATE('03-02-2009','MM-DD-YYYY'),
TO_DATE('02-14-2009','MM-DD-YYYY'));
datediff
----------
-384
(1 row)
Another hours example:
SELECT DATEDIFF('hh', TO_TIMESTAMP('01-01-1993','MM-DD-YYYY'),
TO_TIMESTAMP('02-02-1995','MM-DD-YYYY'));
datediff
----------
18288
(1 row)
This example counts the minutes backwards:
SELECT DATEDIFF('mi', TO_TIMESTAMP('01-01-1993 03:00:45','MM-DD-YYYY HH:MI:SS'),
TO_TIMESTAMP('01-01-1993 01:30:21',' MM-DD-YYYY HH:MI:SS'));
datediff
----------
-90
(1 row)
And this example counts the minutes forward:
SELECT DATEDIFF('minute', TO_DATE('01-01-1993','MM-DD-YYYY'),
TO_DATE('02-02-1995','MM-DD-YYYY'));
datediff
----------
1097280
(1 row)
In the following example, the query counts the difference in seconds, beginning at a start time of
4:44 and ending at 5:55 with an interval of 2 days:
SELECT DATEDIFF('ss', TIME '04:44:42.315786', INTERVAL '2 05:55:52.963558');
datediff
----------
177070
(1 row)

See Also
Date/Time Expressions (page 76)

-152-
SQL Functions

EXTRACT
Retrieves subfields such as year or hour from date/time values and returns values of type
DOUBLE PRECISION (page 105). EXTRACT is primarily intended for computational processing,
rather than formatting date/time values for display.

Syntax
EXTRACT ( field FROM source )

Parameters
field Is an identifier or string that selects what field to extract from the source value.
source Is an expression of type DATE, TIMESTAMP, TIME, or INTERVAL.
Note: Expressions of type DATE are cast to TIMESTAMP.

Examples
SELECT EXTRACT (DAY FROM DATE '2008-12-25');
date_part
-----------
25
(1 row)
SELECT EXTRACT (MONTH FROM DATE '2008-12-25');
date_part
-----------
12
(1 row

GETDATE
Returns the current system date and time as a TIMESTAMP value.

Syntax
SELECT GETDATE();

Notes
• GETDATE is a volatile function that requires parentheses but accepts no arguments.
• This function uses the date and time supplied by the operating system on the server to which
you are connected, which should be the same across all servers.
• GETDATE internally converts CLOCK_TIMESTAMP() from TIMESTAMPTZ to TIMESTAMP.
• This function is identical to SYSDATE (page 161)().

-153-
SQL Reference Manual

Example
SELECT GETDATE();
getdate
----------------------------
2009-02-18 16:39:58.628483
(1 row)

See Also
Date/Time Expressions (page 76)

GETUTCDATE
Returns the current system date and time as a TIMESTAMP value relative to UTC.

Syntax
SELECT GETUTCDATE();

Notes
• GETUTCDATE is a volatile function that requires parentheses but accepts no arguments.
• This function uses the date and time supplied by the operating system on the server to which
you are connected, which should be the same across all servers.
• GETUTCDATE is internally converted to CLOCK_TIMESTAMP (page 143)() at TIME ZONE
'UTC'.

Example
SELECT GETUTCDATE();
getutcdate
----------------------------
2009-02-18 16:39:58.628483
(1 row)

See Also
Date/Time Expressions (page 76)

ISFINITE
Tests for the special TIMESTAMP constant INFINITY and returns a value of type BOOLEAN.

Syntax
ISFINITE( timestamp )

Parameters
timestamp Is an expression of type TIMESTAMP

-154-
SQL Functions

Examples
SELECT ISFINITE(TIMESTAMP '2009-02-16 21:28:30');
isfinite
----------
t
(1 row)
SELECT ISFINITE(TIMESTAMP 'INFINITY');
isfinite
----------
f
(1 row)

LAST_DAY
Returns the last day of the month based on a TIMESTAMP. The TIMESTAMP can be supplied
as a DATE or a TIMESTAMPTZ data type.

Syntax
SELECT LAST_DAY ( date );

Notes
The LAST_DAY() function is invariant unless it is called with a TIMESTAMPTZ, in which case it
is stable.

Examples
The following example returns the last day of the month, February, as 29 because 2008 was a
leap year:
SELECT LAST_DAY('2008-02-28 23:30 PST') "Last";
Last
------------
2008-02-29
(1 row)
The following example returns the last day of the month in March, after converting the string
value to the specified DATE type:
SELECT LAST_DAY('2003/03/15') "Last";
Last
------------
2003-03-31
(1 row)
The following example returns the last day of February in the specified year (not a leap year):
SELECT LAST_DAY('2003/02/03') "Last";
Last
------------
2003-02-28
(1 row)

-155-
SQL Reference Manual

LOCALTIME
Returns a value of type TIME representing the time of day.

Syntax
LOCALTIME [ ( precision ) ]

Parameters
precision Causes the result to be rounded to the specified number of
fractional digits in the seconds field.

Notes
This function returns the start time of the current transaction; the value does not change during
the transaction. The intent is to allow a single transaction to have a consistent notion of the
"current" time, so that multiple modifications within the same transaction bear the same
timestamp.

Examples
SELECT LOCALTIME;
time
-----------------
16:16:06.790771
(1 row)

LOCALTIMESTAMP
Returns a value of type TIMESTAMP representing today's date and time of day.

Syntax
LOCALTIMESTAMP [ ( precision ) ]

Parameters
precision Causes the result to be rounded to the specified number of
fractional digits in the seconds field.

-156-
SQL Functions

Notes
This function returns the start time of the current transaction; the value does not change during
the transaction. The intent is to allow a single transaction to have a consistent notion of the
"current" time, so that multiple modifications within the same transaction bear the same
timestamp.

Examples
SELECT LOCALTIMESTAMP;
timestamp
--------------------------
2009-02-24 14:47:48.5951
(1 row)

MONTHS_BETWEEN
Returns the number of months between date1 and date2 as a FLOAT8.

Syntax
SELECT MONTHS_BETWEEN ( x , y );

Parameters
x , y Takes TIMESTAMP, DATE, or TIMESTAMPTZ arguments.
If date1 is later than date2, then the result is positive. If date1 is
earlier than date2, then the result is negative.
If date1 and date2 are either the same days of the month or both
are the last days of their respective month, then the result is always
an integer. Otherwise MONTHS_BETWEEN returns a FLOAT8
result based on a 31-day month, which considers the difference
between date1 and date2.

Notes
MONTHS_BETWEEN() is invariant for TIMESTAMP and DATE but stable for TIMESTAMPTZ.

Examples
Note the following result is an integral number of days because the dates are on the same day of
the month:
SELECT MONTHS_BETWEEN('2009-03-07 16:00'::TIMESTAMP, '2009-04-07
15:00'::TIMESTAMP);
months_between
----------------
-1
(1 row)
The result from the following example returns an integral number of days because the days fall
on the last day of their respective months:

-157-
SQL Reference Manual

SELECT MONTHS_BETWEEN('29Feb2000', '30Sep2000') "Months";


Months
----------------
-7
(1 row)
In this example, and in the example that immediately follows it, MONTHS_BETWEEN() returns the
number of months between date1 and date2 as a fraction because the days do not fall on the
same day or on the last day of their respective months:
SELECT MONTHS_BETWEEN(TO_DATE('02-02-1995','MM-DD-YYYY'),
TO_DATE('01-01-1995','MM-DD-YYYY') ) "Months";
Months
------------------
1.03225806451613
(1 row)
SELECT MONTHS_BETWEEN(TO_DATE ('2003/01/01', 'yyyy/mm/dd'),
TO_DATE ('2003/03/14', 'yyyy/mm/dd') ) "Months";
Months
-------------------
-2.41935483870968
(1 row)
The following two examples use the same date1 and date2 strings, but they are cast to a
different data types (TIMESTAMP and TIMESTAMPTZ). The result set is the same for both
statements:
SELECT MONTHS_BETWEEN('2008-04-01'::timestamp, '2008-02-29'::timestamp);
months_between
------------------
1.09677419354839
(1 row)
SELECT MONTHS_BETWEEN('2008-04-01'::timestamptz, '2008-02-29'::timestamptz);
months_between
------------------
1.09677419354839
(1 row)
The following two examples show alternate inputs:
SELECT MONTHS_BETWEEN('2008-04-01'::date, '2008-02-29'::timestamp);
months_between
------------------
1.09677419354839
(1 row)
SELECT MONTHS_BETWEEN('2008-02-29'::timestamptz, '2008-04-01'::date);
months_between
-------------------
-1.09677419354839
(1 row)

-158-
SQL Functions

NOW
Is equivalent to CURRENT_TIMESTAMP (page 144) except that it does not accept a precision
parameter. It returns a value of type TIMESTAMP WITH TIME ZONE representing the start of
the current transaction.

Syntax
NOW()

Notes
This function returns the start time of the current transaction; the value does not change during
the transaction. The intent is to allow a single transaction to have a consistent notion of the
"current" time, so that multiple modifications within the same transaction bear the same
timestamp.

Examples
SELECT NOW();
now
------------------------------
2009-02-18 16:34:22.11775-05
(1 row)

See Also
CURRENT_TIMESTAMP (page 144)

OVERLAPS
Returns true when two time periods overlap, false when they do not overlap.

Syntax
( start, end ) OVERLAPS ( start, end )
( start, interval ) OVERLAPS ( start, interval )

Parameters
start Is a DATE, TIME, or TIME STAMP value that specifies the
beginning of a time period.
end Is a DATE, TIME, or TIME STAMP value that specifies the end of
a time period.
interval Is a value that specifies the length of the time period.

Examples
The first command returns true for an overlap in date range of 2007-02-16 – 2007-12-21 with
2007-10-30 – 2008-10-30.

-159-
SQL Reference Manual

SELECT (DATE '2007-02-16', DATE '2007-12-21')


OVERLAPS (DATE '2007-10-30', DATE '2008-10-30');
overlaps
----------
t
(1 row)
The next command returns false for an overlap in date range of 2007-02-16 – 2007-12-21 with
2008-10-30 – 2008-10-30.
SELECT (DATE '2007-02-16', DATE '2007-12-21')
OVERLAPS (DATE '2008-10-30', DATE '2008-10-30');
overlaps
----------
f
(1 row)
The next command returns false for an overlap in date range of 2007-02-16, 22 hours ago with
2007-10-30, 22 hours ago.
SELECT (DATE '2007-02-16', INTERVAL '1 12:59:10')
OVERLAPS (DATE '2007-10-30', INTERVAL '1 12:59:10');
overlaps
----------
f
(1 row)

STATEMENT_TIMESTAMP
Is similar to TRANSACTION_TIMESTAMP (page 165). It returns a value of type TIMESTAMP
WITH TIME ZONE representing the start of the current statement.

Syntax
STATEMENT_TIMESTAMP()

Notes
This function returns the start time of the current statement; the value does not change during
the statement. The intent is to allow a single statement to have a consistent notion of the
"current" time, so that multiple modifications within the same statement bear the same
timestamp.

Examples
SELECT STATEMENT_TIMESTAMP();
statement_timestamp
-------------------------------
2009-04-07 17:21:17.686493-04
(1 row)

See Also
CLOCK_TIMESTAMP (page 143) and TRANSACTION_TIMESTAMP (page 165)

-160-
SQL Functions

SYSDATE
Returns the current system date and time as a TIMESTAMP value.

Syntax
SELECT SYSDATE();

Notes
• SYSDATE is a stable function (called once per statement) that requires no arguments.
Parentheses are optional.
• This function uses the date and time supplied by the operating system on the server to which
you are connected, which should be the same across all servers.
• In implementation, SYSDATE converts CLOCK_TIMESTAMP (page 143) from
TIMESTAMPTZ to TIMESTAMP.
• This function is identical to GETDATE (page 153).

Examples
SELECT SYSDATE();
sysdate
----------------------------
2009-02-18 16:39:58.628483
(1 row)
SELECT SYSDATE;
sysdate
----------------------------
2009-03-16 15:37:56.917191
(1 row)

See Also
Date/Time Expressions (page 76)

TIME_SLICE
Aggregates data by different fixed-time intervals and returns a rounded-up input TIMESTAMP
value to a value corresponding to the start or end of the time slice interval.
Given an input TIMESTAMP value, such as '2000-10-28 00:00:01', the start time of a 3-second
time slice interval is '2000-10-28 00:00:00', and the end time of the same time slice is
'2000-10-28 00:00:03'.

Syntax
TIME_SLICE(expr, slice_length,
[ time_unit = 'SECOND' ], [ start_or_end = 'START' ] )

Return
Rounded-up TIMESTAMP value. Its data type is the same as the first input argument of expr.

-161-
SQL Reference Manual

Parameters
expr Is evaluated on each row.
Can be either a column of type TIMESTAMP or a (string) constant, which
can be parsed into a TIMESTAMP value, such as '2004-10-19 10:23:54'.
slice_length Is the length of the slice specified in integers. Input must be a positive
integer.

time_unit [Optional] Is the time unit of the slice with a default of SECOND.
Domain of possible values: { HOUR, MINUTE, SECOND, MILLISECOND,
MICROSECOND }
start_or_end [Optional] Indicates whether the returned value corresponds to the start or
end time of the time slice interval. The default is START.
Domain of possible values: { START, END }

Notes
• 'ms' is a synonym of millisecond and 'us' is a synonym of microsecond.
• The corresponding SQL data type for TIMESTAMP is TIMESTAMP WITHOUT TIME ZONE.
Vertica supports TIMESTAMP for TIME_SLICE instead of DATE and TIME data types.
• TIME_SLICE exhibits the following behaviors regarding NULLs:
§ The system returns an error when any one of slice_length, time_unit, or start_or_end
parameters is NULL.
§ When slice_length, time_unit, and start_or_end contain legal values, and expr is NULL,
the system returns a NULL value, instead of an error.

Examples
The following example shows the (default) start time of a 3-second time slice:
SELECT TIME_SLICE('2004-10-19 00:00:01', 3);
time_slice
---------------------
2004-10-19 00:00:00
(1 row)
This example shows the end time of a 3-second time slice:
SELECT TIME_SLICE('2004-10-19 00:00:01', 3, 'SECOND', 'END');
time_slice
---------------------
2004-10-19 00:00:03
(1 row)
This example returns results in milliseconds, using a 3-second time slice:
SELECT TIME_SLICE('2004-10-19 00:00:01', 3, 'ms');
time_slice
-------------------------
2004-10-19 00:00:00.999
(1 row)

-162-
SQL Functions

This example returns results in microseconds, using a 3-second time slice:


SELECT TIME_SLICE('2004-10-19 00:00:01', 3, 'us');
time_slice
----------------------------
2004-10-19 00:00:00.999999
(1 row)
This example uses a 3-second interval with an input value of '00:00:01'. To focus specifically on
seconds, the example omits date, hours and minutes, though all values are implied as being part
of the timestamp.
Give the input of '00:00:01':
• '00:00:00' is the start of the 3-second time slice
• '00:00:03' is the end of the 3-second time slice.
• '00:00:03' is also the start of the second 3-second time slice because of time slice
boundaries, in which the end value of a time slice does not belong to that time slice; it starts
the next one.

When the time slice interval is not a factor of 60 seconds, such as in the given slice length of 9 in
the following example, the slice will not always start or end on 00 seconds:
SELECT TIME_SLICE('2009-02-14 20:13:01', 9);
time_slice
---------------------
2009-02-14 20:12:54
(1 row)
This is expected behavior, as the following properties are true for all time slices:
• Equal in length
• Consecutive (no gaps between them)
• Non-overlapping

-163-
SQL Reference Manual

To force the above example containing '2009-02-14 20:13:01' to start at '2009-02-14 20:13:00',
adjust the output timestamp values so that the remainder of 54 counts up to 60:
SELECT TIME_SLICE('2009-02-14 20:13:01', 9 )+'6 seconds'::INTERVAL AS time;
time
---------------------
2009-02-14 20:13:00
(1 row)
Alternatively, you could use a different slice length, divisible by 60, such as 5:
SELECT TIME_SLICE('2009-02-14 20:13:01', 5);
time_slice
---------------------
2009-02-14 20:13:00
(1 row)
Within the TIME_SLICE group (set of rows belonging to the same time slice) you can use
analytic functions such as FIRST_VALUE/LAST_VALUE (page 128) to find, for example, the
first/last price value within each time slice group. This could be useful if you want to sample input
data by choosing one row from each time slice group.
SELECT date_key, transaction_time, sales_dollar_amount,
TIME_SLICE(DATE '2000-01-01' + date_key + transaction_time, 3),
FIRST_VALUE(sales_dollar_amount)
OVER (PARTITION BY TIME_SLICE(DATE '2000-01-01' + date_key + transaction_time, 3)
ORDER BY DATE '2000-01-01' + date_key + transaction_time) AS first_value
FROM store.store_sales_fact
LIMIT 20;
date_key | transaction_time | sales_dollar_amount | time_slice | first_value
----------+------------------+---------------------+---------------------+-------------
1 | 00:41:16 | 164 | 2000-01-02 00:41:15 | 164
1 | 00:41:33 | 310 | 2000-01-02 00:41:33 | 310
1 | 15:32:51 | 271 | 2000-01-02 15:32:51 | 271
1 | 15:33:15 | 419 | 2000-01-02 15:33:15 | 419
1 | 15:33:44 | 193 | 2000-01-02 15:33:42 | 193
1 | 16:36:29 | 466 | 2000-01-02 16:36:27 | 466
1 | 16:36:44 | 250 | 2000-01-02 16:36:42 | 250
2 | 03:11:28 | 39 | 2000-01-03 03:11:27 | 39
3 | 03:55:15 | 375 | 2000-01-04 03:55:15 | 375
3 | 11:58:05 | 369 | 2000-01-04 11:58:03 | 369
3 | 11:58:24 | 174 | 2000-01-04 11:58:24 | 174
3 | 11:58:52 | 449 | 2000-01-04 11:58:51 | 449
3 | 19:01:21 | 201 | 2000-01-04 19:01:21 | 201
3 | 22:15:05 | 156 | 2000-01-04 22:15:03 | 156
4 | 13:36:57 | -125 | 2000-01-05 13:36:57 | -125
4 | 13:37:24 | -251 | 2000-01-05 13:37:24 | -251
4 | 13:37:54 | 353 | 2000-01-05 13:37:54 | 353
4 | 13:38:04 | 426 | 2000-01-05 13:38:03 | 426
4 | 13:38:31 | 209 | 2000-01-05 13:38:30 | 209
5 | 10:21:24 | 488 | 2000-01-06 10:21:24 | 488
(20 rows)

Notice how TIME_SLICE rounds the transaction time down to the 3-second slice length.

See Also
FIRST_VALUE/LAST_VALUE (page 128)

-164-
SQL Functions

TIMEOFDAY
Returns a text string representing the time of day.

Syntax
TIMEOFDAY()

Notes
TIMEOFDAY() returns the wall-clock time and advances during transactions.

Examples
SELECT TIMEOFDAY();
timeofday
-------------------------------------
Tue Apr 07 17:22:01.190445 2009 EDT
(1 row)

TRANSACTION_TIMESTAMP
Is equivalent to CURRENT_TIMESTAMP (page 144) except that it does not accept a precision
parameter. It returns a value of type TIMESTAMP WITH TIME ZONE representing the start of
the current transaction.

Syntax
TRANSACTION_TIMESTAMP()

Notes
This function returns the start time of the current transaction; the value does not change during
the transaction. The intent is to allow a single transaction to have a consistent notion of the
"current" time, so that multiple modifications within the same transaction bear the same
timestamp.

Examples
SELECT TRANSACTION_TIMESTAMP();
transaction_timestamp
------------------------------
2009-02-18 16:34:22.11775-05
(1 row)

See Also
CLOCK_TIMESTAMP (page 143) and STATEMENT_TIMESTAMP (page 160)

-165-
166

Formatting Functions
The formatting functions in this section provide a powerful tool set for converting various data
types (DATE/TIME, INTEGER, FLOATING POINT) to formatted strings and for converting from
formatted strings to specific data types.
These functions all follow a common calling convention:
• The first argument is the value to be formatted.
• The second argument is a template that defines the output or input format.
Exception: The TO_TIMESTAMP function can take a single double precision argument.

TO_CHAR
Converts various date/time and numeric values into text strings.

Syntax
TO_CHAR ( expression [, pattern ] )

Parameters
expression (TIMESTAMP, INTERVAL, INTEGER, DOUBLE PRECISION)
specifies the value to convert.
pattern [Optional] (CHAR or VARCHAR) specifies an output pattern
string using the Template Patterns for Date/Time Formatting
(page 171) and and/or Template Patterns for Numeric
Formatting (page 174).

Notes
• TO_CHAR(any) casts any type, except BINARY/VARBINARY, to VARCHAR.
The following example returns the error you receive if you attempt to cast TO_CHAR to a
binary data type:
SELECT TO_CHAR('abc'::VARBINARY);
ERROR: cannot cast type binary varying to character varying
• Ordinary text is allowed in TO_CHAR templates and is output literally. You can put a
substring in double quotes to force it to be interpreted as literal text even if it contains pattern
key words. For example, in '"Hello Year "YYYY', the YYYY is replaced by the year data,
but the single Y in Year is not.
• The TO_CHAR function's day-of-the-week numbering (see the 'D' template pattern (page
171)) is different from that of the EXTRACT (page 153) function.
• Given an INTERVAL type, TO_CHAR formats HH and HH12 as hours in a single day, while
HH24 can output hours exceeding a single day, for example, >24.
• To use a double quote character in the output, precede it with a double backslash. This is
necessary because the backslash already has a special meaning in a string constant. For
example: '\\"YYYY Month\\"'

-166-
SQL Functions

• TO_CHAR does not support the use of V combined with a decimal point. For example:
99.9V99 is not allowed.

Examples

Expression Result
SELECT TO_CHAR(CURRENT_TIMESTAMP, 'Tuesday , 06 05:39:18'
'Day, DD HH12:MI:SS');
SELECT TO_CHAR(CURRENT_TIMESTAMP, 'Tuesday, 6 05:39:18'
'FMDay, FMDD HH12:MI:SS');
SELECT TO_CHAR(-0.1, '99.99'); ' -.10'
SELECT TO_CHAR(-0.1, 'FM9.99'); '-.1'
SELECT TO_CHAR(0.1, '0.9'); ' 0.1'
SELECT TO_CHAR(12, '9990999.9'); ' 0012.0'
SELECT TO_CHAR(12, 'FM9990999.9'); '0012.'
SELECT TO_CHAR(485, '999'); ' 485'
SELECT TO_CHAR(-485, '999'); '-485'
SELECT TO_CHAR(485, '9 9 9'); ' 4 8 5'
SELECT TO_CHAR(1485, '9,999'); ' 1,485'
SELECT TO_CHAR(1485, '9G999'); ' 1 485'
SELECT TO_CHAR(148.5, '999.999'); ' 148.500'
SELECT TO_CHAR(148.5, 'FM999.999'); '148.5'
SELECT TO_CHAR(148.5, 'FM999.990'); '148.500'
SELECT TO_CHAR(148.5, '999D999'); ' 148,500'
SELECT TO_CHAR(3148.5, '9G999D999'); ' 3 148,500'
SELECT TO_CHAR(-485, '999S'); '485-'
SELECT TO_CHAR(-485, '999MI'); '485-'
SELECT TO_CHAR(485, '999MI'); '485 '
SELECT TO_CHAR(485, 'FM999MI'); '485'
SELECT TO_CHAR(485, 'PL999'); '+485'
SELECT TO_CHAR(485, 'SG999'); '+485'
SELECT TO_CHAR(-485, 'SG999'); '-485'
SELECT TO_CHAR(-485, '9SG99'); '4-85'
SELECT TO_CHAR(-485, '999PR'); '<485>'
SELECT TO_CHAR(485, 'L999'); 'DM 485
SELECT TO_CHAR(485, 'RN'); ' CDLXXXV'
SELECT TO_CHAR(485, 'FMRN'); 'CDLXXXV'

-167-
SQL Reference Manual

SELECT TO_CHAR(5.2, 'FMRN'); 'V'


SELECT TO_CHAR(482, '999th'); ' 482nd'
SELECT TO_CHAR(485, '"Good number:"999'); 'Good number: 485'
SELECT TO_CHAR(485.8, '"Pre:"999" Post:" .999'); 'Pre: 485 Post: .800'
SELECT TO_CHAR(12, '99V999'); ' 12000'
SELECT TO_CHAR(12.4, '99V999'); ' 12400'
SELECT TO_CHAR(12.45, '99V9'); ' 125'
SELECT TO_CHAR(-1234.567); -1234.567
SELECT TO_CHAR('1999-12-25'::DATE); 1999-12-25
SELECT TO_CHAR('1999-12-25 11:31'::TIMESTAMP); 1999-12-25 11:31:00
SELECT TO_CHAR('1999-12-25 11:31 1999-12-25 11:31:00-05
EST'::TIMESTAMPTZ);
SELECT TO_CHAR('3 days 1000.333 secs'::INTERVAL); 3 days 00:16:40.333

TO_DATE
Converts a string value to a DATE type.

Syntax
TO_DATE ( expression , pattern )

Parameters
expression (CHAR or VARCHAR) specifies the value to convert
pattern (CHAR or VARCHAR) specifies an output pattern string using the
Template Patterns for Date/Time Formatting (page 171) and/or
Template Patterns for Numeric Formatting (page 174).

Notes
• To use a double quote character in the output, precede it with a double backslash. This is
necessary because the backslash already has a special meaning in a string constant. For
example: '\\"YYYY Month\\"'
• TO_TIMESTAMP and TO_DATE skip multiple blank spaces in the input string if the FX
option is not used. FX must be specified as the first item in the template. For example:
§ For example TO_TIMESTAMP('2000 JUN', 'YYYY MON') is correct.
§ TO_TIMESTAMP('2000 JUN', 'FXYYYY MON') returns an error, because
TO_TIMESTAMP expects one space only.

-168-
SQL Functions

• The YYYY conversion from string to TIMESTAMP or DATE has a restriction if you use a year
with more than four digits. You must use a non-digit character or template after YYYY,
otherwise the year is always interpreted as four digits. For example (with the year 20000):
TO_DATE('200001131', 'YYYYMMDD') is interpreted as a four-digit year
Instead, use a non-digit separator after the year, such as TO_DATE('20000-1131',
'YYYY-MMDD') or TO_DATE('20000Nov31', 'YYYYMonDD').
• In conversions from string to TIMESTAMP or DATE, the CC field is ignored if there is a YYY,
YYYY or Y,YYY field. If CC is used with YY or Y then the year is computed as
(CC-1)*100+YY.
• Examples

SELECT TO_DATE('13 Feb 2000', 'DD Mon YYYY');


to_date
------------
2000-02-13
(1 row)

See Also
Template Pattern Modifiers for Date/Time Formatting (page 174)

TO_TIMESTAMP
Converts a string value or a UNIX/POSIX epoch value to a TIMESTAMP WITH TIME ZONE type.

Syntax
TO_TIMESTAMP ( expression, pattern )
TO_TIMESTAMP ( unix-epoch )

Parameters
expression (CHAR or VARCHAR) is the string to convert
pattern (CHAR or VARCHAR) specifies an output pattern string using
the Template Patterns for Date/Time Formatting (page 171)
and/or Template Patterns for Numeric Formatting (page 174).
unix-epoch (DOUBLE PRECISION) specifies some number of seconds
elapsed since midnight UTC of January 1, 1970, not counting
leap seconds. INTEGER values are implicitly cast to DOUBLE
PRECISION.

Notes
• For more information about UNIX/POSIX time, see Wikipedia
http://en.wikipedia.org/wiki/Unix_time.

-169-
SQL Reference Manual

• Millisecond (MS) and microsecond (US) values in a conversion from string to TIMESTAMP
are used as part of the seconds after the decimal point. For example TO_TIMESTAMP('12:3',
'SS:MS') is not 3 milliseconds, but 300, because the conversion counts it as 12 + 0.3
seconds. This means for the format SS:MS, the input values 12:3, 12:30, and 12:300 specify
the same number of milliseconds. To get three milliseconds, use 12:003, which the
conversion counts as 12 + 0.003 = 12.003 seconds.
Here is a more complex example: TO_TIMESTAMP('15:12:02.020.001230',
'HH:MI:SS.MS.US') is 15 hours, 12 minutes, and 2 seconds + 20 milliseconds + 1230
microseconds = 2.021230 seconds.
• To use a double quote character in the output, precede it with a double backslash. This is
necessary because the backslash already has a special meaning in a string constant. For
example: '\\"YYYY Month\\"'
• TO_TIMESTAMP and TO_DATE skip multiple blank spaces in the input string if the FX
option is not used. FX must be specified as the first item in the template. For example:
§ For example TO_TIMESTAMP('2000 JUN', 'YYYY MON') is correct.
§ TO_TIMESTAMP('2000 JUN', 'FXYYYY MON') returns an error, because
TO_TIMESTAMP expects one space only.
• The YYYY conversion from string to TIMESTAMP or DATE has a restriction if you use a year
with more than four digits. You must use a non-digit character or template after YYYY,
otherwise the year is always interpreted as four digits. For example (with the year 20000):
TO_DATE('200001131', 'YYYYMMDD') is interpreted as a four-digit year
Instead, use a non-digit separator after the year, such as TO_DATE('20000-1131',
'YYYY-MMDD') or TO_DATE('20000Nov31', 'YYYYMonDD').
• In conversions from string to TIMESTAMP or DATE, the CC field is ignored if there is a YYY,
YYYY or Y,YYY field. If CC is used with YY or Y then the year is computed as
(CC-1)*100+YY.
• Examples

SELECT TO_TIMESTAMP('13 Feb 2009', 'DD Mon YYYY');


to_timestamp
------------------------
2009-02-13 00:00:00-05
(1 row)
SELECT TO_TIMESTAMP(200120400);
to_timestamp
------------------------
1976-05-05 01:00:00-04
(1 row)

See Also
Template Pattern Modifiers for Date/Time Formatting (page 174)

TO_NUMBER
Converts a string value to DOUBLE PRECISION.

-170-
SQL Functions

Syntax
TO_NUMBER ( expression, [ pattern ] )

Parameters
expression (CHAR or VARCHAR) specifies the string to convert.
pattern (CHAR or VARCHAR) Optional parameter specifies an output
pattern string using the Template Patterns for Date/Time
Formatting (page 171) and/or Template Patterns for Numeric
Formatting (page 174). If omitted, function returns a floating
point.

Notes
To use a double quote character in the output, precede it with a double backslash. This is
necessary because the backslash already has a special meaning in a string constant. For
example: '\\"YYYY Month\\"'

Examples
SELECT TO_CHAR(2009, 'rn'), TO_NUMBER('mmix', 'rn');
to_char | to_number
-----------------+-----------
mmix | 2009
(1 row)
It the pattern parameter is omitted, the function returns a floating point.
SELECT TO_NUMBER('-123.456e-01');
to_number
-----------
-12.3456

Template Patterns for Date/Time Formatting


In an output template string (for TO_CHAR), there are certain patterns that are recognized and
replaced with appropriately-formatted data from the value to be formatted. Any text that is not a
template pattern is simply copied verbatim. Similarly, in an input template string (for anything
other than TO_CHAR), template patterns identify the parts of the input data string to be looked at
and the values to be found there.
Note: Vertica uses the ISO 8601:2004 style for date/time fields in Vertica *.log files. For
example,
2008-09-16 14:40:59.123 TM Moveout:0x2aaaac002180 [Txn] <INFO>
Certain modifiers can be applied to any template pattern to alter its behavior as described in
Template Pattern Modifiers for Date/Time Formatting (page 174).

Pattern Description

HH Hour of day (01-12)

-171-
SQL Reference Manual

HH12 Hour of day (01-12)

HH24 Hour of day (00-23)

MI Minute (00-59)

SS Second (00-59)

MS Millisecond (000-999)

US Microsecond (000000-999999)

SSSS Seconds past midnight (0-86399)

AM or A.M. or PM or P.M. Meridian indicator (uppercase)

am or a.m. or pm or p.m. Meridian indicator (lowercase)

Y,YYY Year (4 and more digits) with comma

YYYY Year (4 and more digits)

YYY Last 3 digits of year

YY Last 2 digits of year

Y Last digit of year

IYYY ISO year (4 and more digits)

IYY Last 3 digits of ISO year

IY Last 2 digits of ISO year

I Last digits of ISO year

BC or B.C. or AD or A.D. Era indicator (uppercase)

bc or b.c. or ad or a.d. Era indicator (lowercase)

MONTH Full uppercase month name (blank-padded to 9 chars)

Month Full mixed-case month name (blank-padded to 9 chars)

month Full lowercase month name (blank-padded to 9 chars)

MON Abbreviated uppercase month name (3 chars)

-172-
SQL Functions

Mon Abbreviated mixed-case month name (3 chars)

mon Abbreviated lowercase month name (3 chars)

MM Month number (01-12)

DAY Full uppercase day name (blank-padded to 9 chars)

Day Full mixed-case day name (blank-padded to 9 chars)

day full lowercase day name (blank-padded to 9 chars)

DY Abbreviated uppercase day name (3 chars)

Dy Abbreviated mixed-case day name (3 chars)

dy Abbreviated lowercase day name (3 chars)

DDD Day of year (001-366)

DD Day of month (01-31)

D Day of week (1-7; Sunday is 1)

W Week of month (1-5) (The first week starts on the first day
of the month.)

WW Week number of year (1-53) (The first week starts on the


first day of the year.)

IW ISO week number of year (The first Thursday of the new


year is in week 1.)

CC Century (2 digits)

J Julian Day (days since January 1, 4712 BC)

Q Quarter

RM Month in Roman numerals (I-XII; I=January) (uppercase)

rm Month in Roman numerals (i-xii; i=January) (lowercase)

TZ Time-zone name (uppercase)

tz Time-zone name (lowercase)

-173-
SQL Reference Manual

Template Pattern Modifiers for Date/Time Formatting


Certain modifiers can be applied to any template pattern to alter its behavior. For example,
FMMonth is the Month pattern with the FM modifier.

Modifier Description
AM Time is before 12:00
AT Ignored
JULIAN, JD, J Next field is Julian Day
FM prefix fill mode (suppress padding blanks and zeroes)
For example: FMMonth
FX prefix Fixed format global option (see usage notes)
For example: FX Month DD Day
ON Ignored
PM Time is on or after 12:00
T Next field is time
TH suffix Uppercase ordinal number suffix
For example: DDTH
th suffix Lowercase ordinal number suffix
For example: DDth
TM prefix Translation mode (print localized day and month names based on
lc_messages). For example: TMMonth

Notes
FM suppresses leading zeroes and trailing blanks that would otherwise be added to make the
output of a pattern be fixed width.

Template Patterns for Numeric Formatting


Pattern Description

9 Value with the specified number of digits

0 Value with leading zeros

. (period) Decimal point

, (comma) Group (thousand) separator

-174-
SQL Functions

PR Negative value in angle brackets

S Sign anchored to number (uses locale)

L Currency symbol (uses locale)

D Decimal point (uses locale)

G Group separator (uses locale)

MI Minus sign in specified position (if number < 0)

PL Plus sign in specified position (if number > 0)

SG Plus/minus sign in specified position

RN Roman numeral (input between 1 and 3999)

TH or th Ordinal number suffix

V Shift specified number of digits (see notes)

EEEE Scientific notation (not implemented yet)

Usage
• A sign formatted using SG, PL, or MI is not anchored to the number; for example:
§ TO_CHAR(-12, 'S9999') produces ' -12'
§ TO_CHAR(-12, 'MI9999') produces '- 12'
• 9 results in a value with the same number of digits as there are 9s. If a digit is not available it
outputs a space.
• TH does not convert values less than zero and does not convert fractional numbers.
• V effectively multiplies the input values by 10^n, where n is the number of digits following V.
TO_CHAR does not support the use of V combined with a decimal point. For example:
99.9V99 is not allowed.

-175-
SQL Reference Manual

Mathematical Functions
Some of these functions are provided in multiple forms with different argument types. Except
where noted, any given form of a function returns the same data type as its argument. The
functions working with DOUBLE PRECISION (page 105) data could vary in accuracy and
behavior in boundary cases depending on the host system.

See Also
Template Patterns for Numeric Formatting (page 174)

ABS
Returns the absolute value of the argument. The return value has the same data type as the
argument..

Syntax
ABS ( expression )

Parameters
expression Is a value of type INTEGER or DOUBLE PRECISION

Examples
SELECT ABS(-28.7);
abs
------
28.7
(1 row)

ACOS
Returns a DOUBLE PRECISION value representing the trigonometric inverse cosine of the
argument.

Syntax
ACOS ( expression )

Parameters
expression Is a value of type DOUBLE PRECISION

Example
SELECT ACOS (1);
acos
------
0
(1 row)

-176-
SQL Functions

ASIN
Returns a DOUBLE PRECISION value representing the trigonometric inverse sine of the
argument.

Syntax
ASIN ( expression )

Parameters
expression Is a value of type DOUBLE PRECISION

Example
SELECT ASIN(1);
asin
-----------------
1.5707963267949
(1 row)

ATAN
Returns a DOUBLE PRECISION value representing the trigonometric inverse tangent of the
argument.

Syntax
ATAN ( expression )

Parameters
expression Is a value of type DOUBLE PRECISION

Example
SELECT ATAN(1);
atan
-------------------
0.785398163397448
(1 row)

ATAN2
Returns a DOUBLE PRECISION value representing the trigonometric inverse tangent of the
arithmetic dividend of the arguments.

Syntax
ATAN2 ( quotient, divisor )

Parameters
quotient Is an expression of type DOUBLE PRECISION representing the

-177-
SQL Reference Manual

quotient
divisor Is an expression of type DOUBLE PRECISION representing the divisor

Example
SELECT ATAN2(2,1);
atan2
------------------
1.10714871779409
(1 row)

CBRT
Returns the cube root of the argument. The return value has the type DOUBLE PRECISION.

Syntax
CBRT ( expression )

Parameters
expression Is a value of type DOUBLE PRECISION

Examples
SELECT CBRT(27.0);
cbrt
------
3
(1 row)

CEILING (CEIL)
Returns the smallest floating point not less than then argument.

Syntax
CEILING ( expression )
CEIL ( expression )

Parameters
expression Is a value of type INTEGER or DOUBLE PRECISION

Examples
SELECT CEIL(-42.8);
ceil
------
-42
(1 row)

COS
Returns a DOUBLE PRECISION value representing the trigonometric cosine of the argument.

-178-
SQL Functions

Syntax
COS ( expression )

Parameters
expression Is a value of type DOUBLE PRECISION

Example
SELECT COS(-1);
cos
------------------
0.54030230586814
(1 row)

COT
Returns a DOUBLE PRECISION value representing the trigonometric cotangent of the
argument.

Syntax
COT ( expression )

Parameters
expression Is a value of type DOUBLE PRECISION

Example
SELECT COT(1);
cot
-------------------
0.642092615934331
(1 row)

DEGREES
Converts an expression from radians to degrees. The return value has the type DOUBLE
PRECISION.

Syntax
DEGREES ( expression )

Parameters
expression Is a value of type DOUBLE PRECISION

Examples
SELECT DEGREES(0.5);
degrees

-179-
SQL Reference Manual

------------------
28.6478897565412
(1 row)

EXP
Returns the exponential function, e to the power of a number. The return value has the same
data type as the argument.

Syntax
EXP ( exponent )

Parameters
exponent Is an expression of type INTEGER or DOUBLE PRECISION

Example
SELECT EXP(1.0);
exp
------------------
2.71828182845905
(1 row)

FLOOR
Returns a floating point value representing the largest INTEGER not greater than the argument.

Syntax
FLOOR ( expression )

Parameters
expression Is an expression of type INTEGER or DOUBLE PRECISION.

Examples
Although the following example looks like an INTEGER, the number on the left is 2^49 as an
INTEGER, but the number on the right is a FLOAT:
SELECT 1<<49, FLOOR(1 << 49);
?column? | floor
-----------------+-----------------
562949953421312 | 562949953421312
(1 row)
Compare the above example to:
SELECT 1<<50, FLOOR(1 << 50);
?column? | floor
------------------+----------------------
1125899906842624 | 1.12589990684262e+15

-180-
SQL Functions

(1 row)

HASH
Calculates a hash value over its arguments, producing a value in the range 0 <= x < 263 (two to
the sixty-third power or 2^63).

Syntax
HASH ( expression [ ,... ] )

Parameters
expression Is an expression of any data type. For the purpose of hash segmentation,
each expression is a column reference (see "Column References" on
page 74).

Notes
• The HASH() function is used to provide projection segmentation over a set of nodes in a
cluster and takes up to 32 arguments, usually column names, and selects a specific node for
each row based on the values of the columns for that row. HASH (Col1, Col2).
• If your data is fairly regular and you want more even distribution than you get with HASH,
consider using MODULARHASH (page 183)() for project segmentation.

Examples
SELECT HASH(product_price, product_cost)
FROM product_dimension
WHERE product_price = '11';
hash
---------------------
4157497907121511878
1799398249227328285
3250220637492749639
(3 rows)

See Also
MODULARHASH (page 183)

LN
Returns the natural logarithm of the argument. The return data type is the same as the
argument.

-181-
SQL Reference Manual

Syntax
LN ( expression )

Parameters
expression Is an expression of type INTEGER or DOUBLE PRECISION

Examples
SELECT LN(2);
ln
-------------------
0.693147180559945
(1 row)

LOG
Returns the logarithm to the specified base of the argument. The return data type is the same as
the argument.

Syntax
LOG ( [ base, ] expression )

Parameters
base Specifies the base (default is base 10)
expression Is an expression of type INTEGER or DOUBLE PRECISION

Examples
SELECT LOG(2.0, 64);
log
-----
6
(1 row)
SELECT LOG(100);
log
-----
2
(1 row)

MOD
MOD (modulo) returns the remainder of a division operation. The return data type is the same as
the arguments.

Syntax
MOD ( expression1, expression2 )

-182-
SQL Functions

Parameters
expression1 Specifies the dividend (INTEGER or DOUBLE PRECISION)
expression2 Specifies the divisor (type same as dividend)

Notes
The dividend is the quantity to be divided. For example:
6/2 = 3
The dividend is 6. The divisor is 2.

Examples
SELECT MOD(9,4);
mod
-----
1
(1 row)

MODULARHASH
Calculates a hash value over its arguments for the purpose of projection segmentation. In all
other uses, returns 0.
If you can hash segment your data using a column with a regular pattern, such as a sequential
unique identifier, MODULARHASH distributes the data more evenly than HASH, which
distributes data using a normal statistical distribution.

Syntax
MODULARHASH ( expression [ ,... ] )

Parameters
expression Is a column reference (see "Column References" on page 74) of
any data type.

Notes
The MODULARHASH() function takes up to 32 arguments, usually column names, and selects a
specific node for each row based on the values of the columns for that row.

Examples
CREATE PROJECTION fact_ts_2 (f_price, f_cid, f_tid, f_cost, f_date)
AS (SELECT price, cid, tid, cost, dwdate
FROM fact)
SEGMENTED BY MODULARHASH(dwdate)
ALL NODES OFFSET 2;

See Also
HASH (page 181)

-183-
SQL Reference Manual

PI
Returns the constant pi (Π), the ratio of any circle's circumference to its diameter in Euclidean
geometry The return type is DOUBLE PRECISION.

Syntax
PI()

Examples
SELECT PI();
pi
------------------
3.14159265358979
(1 row)

POWER
Returns a DOUBLE PRECISION value representing one number raised to the power of another
number.

Syntax
POWER ( expression1, expression2 )

Parameters
expression1 Is an expression of type DOUBLE PRECISION that represents the
base
expression2 Is an expression of type DOUBLE PRECISION that represents the
exponent

Examples
SELECT POWER(9.0, 3.0);
power
-------
729
(1 row)

RADIANS
Returns a DOUBLE PRECISION value representing an angle expressed in degrees converted to
radians.

Syntax
RADIANS ( expression )

Parameters
expression Is an expression of type DOUBLE PRECISION representing

-184-
SQL Functions

degrees

Examples

SELECT RADIANS(45);
radians
-------------------
0.785398163397448
(1 row)

RANDOM
Returns a uniformly-distributed random number x, where 0 <= x < 1.

Syntax
RANDOM()

Parameters
RANDOM has no arguments. Its result is a FLOAT8 data type (also called DOUBLE
PRECISION (page 105)).

Notes
Typical pseudo-random generators accept a seed, which is set to generate a reproducible
pseudo-random sequence. Vertica, however, distributes SQL processing over a cluster of nodes,
where each node generates its own independent random sequence.
Results depending on RANDOM are not reproducible because the work might be divided
differently across nodes. Therefore, Vertica automatically generates truly random seeds for each
node each time a request is executed and does not provide a mechanism for forcing a specific
seed.

Examples
In the following example, the result is a float, which is >= 0 and < 1.0:
SELECT RANDOM();
random
-------------------
0.211625560652465
(1 row)

RANDOMINT
Returns a uniformly-distributed integer I, where 0 <= I < N, where N <= MAX_INT8. That is,
RANDOMINT(N) returns one of the N integers from 0 through N-1.

Syntax
RANDOMINT(N)

-185-
SQL Reference Manual

Example
In the following example, the result is an INT8, which is >= 0 and < N. In this case, INT8 is
randomly chosen from the set {0,1,2,3,4}.
SELECT RANDOMINT(5);
randomint
----------
3
(1 row)

ROUND
Rounds off a value to a specified number of decimal places. Fractions greater than or equal to .5
are rounded up. Fractions less than .5 are rounded down (truncated).

Syntax
ROUND ( expression [ , decimal-places ] )

Parameters
expression Is an expression of type DOUBLE PRECISION
decimal-places If positive, specifies the number of decimal places to display to the right of the
decimal point; if negative, specifies the number of decimal places to display to
the left of the decimal point.

Notes
The ROUND function rounds off except in the case of a decimal constant with more than 15
decimal places. For example:
SELECT ROUND(3.499999999999999); -- 15 decimal places
round
-------
3
(1 row)
The internal integer representation used to compute the ROUND function causes the fraction to
be evaluated precisely and it is thus rounded down. However:
SELECT ROUND(3.4999999999999999); -- 16 decimal places
round
-------
4
(1 row)
The internal floating point representation used to compute the ROUND function causes the
fraction to be evaluated as 3.5, which is rounded up.

Examples
SELECT ROUND(3.14159, 3);
round
-------

-186-
SQL Functions

3.142
(1 row)
SELECT ROUND(1234567, -3);
round
---------
1235000
(1 row)
SELECT ROUND(3.4999, -1);
round
-------
0
(1 row)

SIGN
Returns a DOUBLE PRECISION value of -1, 0, or 1 representing the arithmetic sign of the
argument.

Syntax
SIGN ( expression )

Parameters
expression Is an expression of type DOUBLE PRECISION

Examples
SELECT SIGN(-8.4);
sign
------
-1
(1 row)

SIN
Returns a DOUBLE PRECISION value representing the trigonometric sine of the argument.

Syntax
SIN ( expression )

Parameters
expression Is an expression of type DOUBLE PRECISION

Example
SELECT SIN(30 * 2 * 3.14159 / 360);
sin
-------------------
0.499999616987256

-187-
SQL Reference Manual

(1 row)

SQRT
Returns a DOUBLE PRECISION value representing the arithmetic square root of the argument.

Syntax
SQRT ( expression )

Parameters
expression Is an expression of type DOUBLE PRECISION

Examples
SELECT SQRT(2);
sqrt
-----------------
1.4142135623731
(1 row)

TAN
Returns a DOUBLE PRECISION value representing the trigonometric tangent of the argument.

Syntax
TAN ( expression )

Parameters
expression Is an expression of type DOUBLE PRECISION

Example
SELECT TAN(30);
tan
-------------------
-6.40533119664628
(1 row)

TRUNC
Returns a value representing the argument fully truncated (toward zero) or truncated to a specific
number of decimal places.

Syntax
TRUNC ( expression [ , places ]

Parameters
expression Is an expression of type INTEGER or DOUBLE PRECISION that

-188-
SQL Functions

represents the number to truncate


places Is an expression of type INTEGER specifies the number of
decimal places to return

Examples
SELECT TRUNC(42.8);
trunc
-------
42
(1 row)

SELECT TRUNC(42.4382, 2);


trunc
-------
42.43
(1 row)

WIDTH_BUCKET
Lets you construct equiwidth histograms, in which the histogram range is divided into intervals
(buckets) of identical sizes. In addition, values below the low bucket return 0, and values above
the high bucket return bucket_count +1. Returns an integer value.

Syntax
WIDTH_BUCKET( expr, hist_min, hist_max, bucket_count )

Parameters
expr Is the expression for which the histogram is
created. This expression must evaluate to a
numeric or datetime value or to a value that can
be implicitly converted to a numeric or datetime
value. If expr evaluates to null, then the
expression returns null.
hist_min Is an expression that resolves to the low
boundary of bucket 1. Must also evaluate to
numeric or datetime values and cannot evaluate
to null.
hist_max Is an expression that resolves to the high
boundary of bucket bucket_count. Must also
evaluate to a numeric or datetime value and
cannot evaluate to null.
bucket_count Is an expression that resolves to a constant,
indicating the number of buckets. This
expression always evaluates to a positive
INTEGER.

-189-
SQL Reference Manual

Notes
• WIDTH_BUCKET divides a data set into buckets of equal width. For example, Age = 0-20,
20-40, 40-60, 60-80. This is known as an equiwidth histogram.
• When using WIDTH_BUCKET pay attention to the minimum and maximum boundary values.
Each bucket contains values equal to or greater than the base value of that bucket, so that
age ranges of 0-20, 20-40, and so on, are actually 0-19.99 and 20-39.999.
• WIDTH_BUCKET accepts the following data types: (float and/or int), (timestamp and/or date
and/or timestamptz), or (interval and/or time).

Examples
The following example returns five possible values and has three buckets: 0 [Up to 100), 1
[100-300), 2 [300-500), 3 [500-700), and 4 [700 and up):
SELECT product_description, product_cost,
WIDTH_BUCKET(product_cost, 100, 700, 3);
The following example creates a nine-bucket histogram on the annual_income column for
customers in Connecticut who are female doctors. The results return the bucket number to an
“Income” column, divided into eleven buckets, including an underflow and an overflow. Note that
if customers had an annual incomes greater than the maximum value, they would be assigned to
an overflow bucket, 10:
SELECT customer_name, annual_income,
WIDTH_BUCKET (annual_income, 100000, 1000000, 9) AS "Income"
FROM public.customer_dimension WHERE customer_state='CT'
AND title='Dr.' AND customer_gender='Female' AND household_id < '1000'
ORDER BY "Income";
In the following result set, the reason there is a bucket 0 is because buckets are numbered from
1 to bucket_count. Anything less than the given value of hist_min goes in bucket 0, and
anything greater than the given value of hist_max goes in the bucket bucket_count+1. In
this example, bucket 9 is empty, and there is no overflow. The value 12,283 is less than 100,000,
so it goes into the underflow bucket.
customer_name | annual_income | Income
--------------------+---------------+--------
Joanna A. Nguyen | 12283 | 0
Amy I. Nguyen | 109806 | 1
Juanita L. Taylor | 219002 | 2
Carla E. Brown | 240872 | 2
Kim U. Overstreet | 284011 | 2
Tiffany N. Reyes | 323213 | 3
Rebecca V. Martin | 324493 | 3
Betty . Roy | 476055 | 4
Midori B. Young | 462587 | 4
Martha T. Brown | 687810 | 6
Julie D. Miller | 616509 | 6
Julie Y. Nielson | 894910 | 8
Sarah B. Weaver | 896260 | 8
Jessica C. Nielson | 861066 | 8
(14 rows)

-190-
SQL Functions

NULL-handling Functions
NULL-handling functions take arguments of any type, and their return type is based on their
argument types.

COALESCE
Returns the value of the first non-null expression in the list. If all expressions evaluate to null,
then the COALESCE function returns null.

Syntax
SELECT COALESCE ( expr1, expr2 );
SELECT COALESCE ( expr1, expr2, ... exprn );

Parameters
• COALESCE (expr1, expr2) is equivalent to the following CASE expression:
CASE WHEN expr1 IS NOT NULL THEN expr1 ELSE expr2 END;
• COALESCE (expr1, expr2, ... exprn), for n >= 3, is equivalent to the following CASE
expression:
CASE WHEN expr1 IS NOT NULL THEN expr1
ELSE COALESCE (expr2, . . . , exprn) END;

Notes
COALESCE is an ANSI standard function (SQL92).

Example
SELECT product_description, COALESCE(lowest_competitor_price,
highest_competitor_price, average_competitor_price) AS price
FROM product_dimension;
product_description | price
------------------------------------+-------
Brand #54109 kidney beans | 264
Brand #53364 veal | 139
Brand #50720 ice cream sandwhiches | 127
Brand #48820 coffee cake | 174
Brand #48151 halibut | 353
Brand #47165 canned olives | 250
Brand #39509 lamb | 306
Brand #36228 tuna | 245
Brand #34156 blueberry muffins | 183
Brand #31207 clams | 163
(10 rows)

See Also
Case Expressions (page 73)
ISNULL (page 192)

-191-
SQL Reference Manual

ISNULL
Returns the value of the first non-null expression in the list.
ISNULL is an alias of NVL (page 193).

Syntax
SELECT ISNULL ( expr1 , expr2 );

Parameters
• If expr1 is null, then ISNULL returns expr2.
• If expr1 is not null, then ISNULL returns expr1.

Notes
• COALESCE (page 191) is the more standard, more general function.
• ISNULL is equivalent to COALESCE except that ISNULL is called with only two arguments.
• ISNULL(a,b) is different from x IS NULL.
• The arguments can have any data type consistent with SQL92 clause 9.3, “Set operation
result data types.”
• Implementation is equivalent to the CASE expression. For example:
CASE WHEN expr1 IS NULL THEN expr2 ELSE expr1 END;
• The following statement returns the value 140:
SELECT ISNULL(NULL, 140) FROM employee_dimension;
• The following statement returns the value 60:
SELECT ISNULL(60, 90) FROM employee_dimension;

Examples
SELECT product_description, product_price, ISNULL(product_cost, 0.0) AS cost FROM
product_dimension;
product_description | product_price | cost
--------------------------------+---------------+------
Brand #59957 wheat bread | 405 | 207
Brand #59052 blueberry muffins | 211 | 140
Brand #59004 english muffins | 399 | 240
Brand #53222 wheat bread | 323 | 94
Brand #52951 croissants | 367 | 121
Brand #50658 croissants | 100 | 94
Brand #49398 white bread | 318 | 25
Brand #46099 wheat bread | 242 | 3
Brand #45283 wheat bread | 111 | 105
Brand #43503 jelly donuts | 259 | 19
(10 rows)

See Also
Case Expressions (page 73)
COALESCE (page 191)

-192-
SQL Functions

NVL (page 193)

NULLIF
Compares expr1 and expr2. If the expressions are not equal, the function returns expr1. If the
expressions are equal, the function returns null.

Syntax
NULLIF( expr1, expr2 )

Parameters
expr1 Is a value of any data type.
expr2 Must have the same data type as expr1 or a type that can be
implicitly cast to match expr1. The result has the same type as
expr1.

Examples
The following series of statements illustrates one use of the NULLIF function.
Creates a single-column table t:
CREATE TABLE t (x TIMESTAMPTZ);
Create temporary projections:
SELECT IMPLEMENT_TEMP_DESIGN('')
Insert some values into table t:
INSERT INTO t VALUES('2009-09-04 09:14:00-04');
INSERT INTO t VALUES('2010-09-04 09:14:00-04');
Issue a select statement:
SELECT x, nullif(x, '2009-09-04 09:14:00 EDT') FROM t;
x | nullif
------------------------+------------------------
2009-09-04 09:14:00-04 |
2010-09-04 09:14:00-04 | 2010-09-04 09:14:00-04

NVL
Returns the value of the first non-null expression in the list.

Syntax
SELECT NVL( expr1 , expr2 );

Parameters
• If expr1 is null, then NVL returns expr2.

-193-
SQL Reference Manual

• If expr1 is not null, then NVL returns expr1.

Notes
• COALESCE (page 191) is the more standard, more general function.
• NVL is equivalent to COALESCE except that NVL is called with only two arguments.
• The arguments can have any data type consistent with SQL92 clause 9.3, “Set operation
result data types.”
• Implementation is equivalent to the CASE expression:
CASE WHEN expr1 IS NULL THEN expr2 ELSE expr1 END;

Examples
expr1 is not null, so NVL returns expr1:
SELECT NVL('fast', 'database');
nvl
------
fast
(1 row)
expr1 is null, so NVL returns expr2:
SELECT NVL(null, 'database');
nvl
----------
database
(1 row)
expr2 is null, so NVL returns expr1:
SELECT NVL('fast', null);
nvl
------
fast
(1 row)
In the following example, expr1 (title) contains nulls, so NVL returns expr2 and substitutes
'Withheld' for the unknown values:
SELECT customer_name,
NVL(title, 'Withheld') as title
FROM customer_dimension
ORDER BY title;
customer_name | title
------------------------+-------
Alexander I. Lang | Dr.
Steve S. Harris | Dr.
Daniel R. King | Dr.
Luigi I. Sanchez | Dr.
Duncan U. Carcetti | Dr.
Meghan K. Li | Dr.
Laura B. Perkins | Dr.
Samantha V. Robinson | Dr.
Joseph P. Wilson | Mr.
Kevin R. Miller | Mr.

-194-
SQL Functions

Lauren D. Nguyen | Mrs.


Emily E. Goldberg | Mrs.
Darlene K. Harris | Ms.
Meghan J. Farmer | Ms.
Bettercare | Withheld
Ameristar | Withheld
Initech | Withheld
(17 rows)

See Also
Case Expressions (page 73)
COALESCE (page 191)
ISNULL (page 192)
NVL2 (page 195)

NVL2
Takes three arguments. If the first argument is not NULL, it returns the second argument,
otherwise it returns the third argument. The data types of the second and third arguments are
implicitly cast to a common type if they don't agree, similar to COALESCE (page 191).

Syntax
SELECT NVL2 ( expr1 , expr2 , expr3 );

Parameters
• If expr1 is not null, then NVL2 returns expr2.
• If expr1 is null, then NVL2 returns expr3.

Notes
Arguments two and three can have any data type consistent with SQL92 clause 9.3, “Set
operation result data types.”
Implementation is equivalent to the CASE expression:
CASE WHEN expr1 IS NOT NULL THEN expr2 ELSE expr3 END;

Examples
In this example, expr1 is not null, so NVL2 returns expr2:
SELECT NVL2('very', 'fast', 'database');
nvl2
------
fast
(1 row)
In this example, expr1 is null, so NVL2 returns expr3:
SELECT NVL2(null, 'fast', 'database');
nvl2
----------

-195-
SQL Reference Manual

database
(1 row)
In the following example, expr1 (title) contains nulls, so NVL2 returns expr3 ('Withheld') and also
substitutes the non-null values with the expression 'Known':
SELECT customer_name,
NVL2(title, 'Known', 'Withheld') as title
FROM customer_dimension
ORDER BY title;
customer_name | title
------------------------+-------
Alexander I. Lang | Known
Steve S. Harris | Known
Daniel R. King | Known
Luigi I. Sanchez | Known
Duncan U. Carcetti | Known
Meghan K. Li | Known
Laura B. Perkins | Known
Samantha V. Robinson | Known
Joseph P. Wilson | Known
Kevin R. Miller | Known
Lauren D. Nguyen | Known
Emily E. Goldberg | Known
Darlene K. Harris | Known
Meghan J. Farmer | Known
Bettercare | Withheld
Ameristar | Withheld
Initech | Withheld
(17 rows)

See Also
Case Expressions (page 73)
COALESCE (page 191)
NVL (page 191)

-196-
197

String Functions
String functions perform conversion, extraction, or manipulation operations on strings, or return
information about strings.
This section describes functions and operators for examining and manipulating string values.
Strings in this context include values of the types CHAR, VARCHAR, BINARY, and
VARBINARY.
Unless otherwise noted, all of the functions listed below work on all of these types, but be wary
of potential effects of the automatic padding when using the CHARACTER type. Generally, the
functions described here also work on data of non-string types by converting that data to a string
representation first. Some functions also exist natively for the bit-string types.
Note: The string functions do not handle multibyte UTF-8 sequences correctly. They treat
each byte as a character.
BINARY implicitly converts to VARBINARY, so functions that take VARBINARY arguments work
with BINARY.

ASCII
Converts the first 8-bit byte of a VARCHAR to an INTEGER.

Syntax
ASCII ( expression )

Parameters
expression (VARCHAR) is the string to convert.

Notes
ASCII is the opposite of the CHR (page 201) function.

Examples

Expression Result
SELECT ASCII('A'); 65
SELECT ASCII('ab'); 97
SELECT ASCII(null);
SELECT ASCII('');

-197-
SQL Reference Manual

BIT_LENGTH
Returns the length of the string expression in bits (bytes * 8) as an INTEGER.

Syntax
BIT_LENGTH( expression )

Parameters
expression (CHAR or VARCHAR or BINARY or VARBINARY)
is the string to convert.

Notes
BIT_LENGTH applies to the contents of VARCHAR and VARBINARY fields.

Examples

Expression Result
SELECT BIT_LENGTH('abc'::varbinary); 24
SELECT BIT_LENGTH('abc'::binary); 8
SELECT BIT_LENGTH(''::varbinary); 0
SELECT BIT_LENGTH(''::binary); 8
SELECT BIT_LENGTH(null::varbinary);
SELECT BIT_LENGTH(null::binary);
SELECT BIT_LENGTH(VARCHAR 'abc'); 24
SELECT BIT_LENGTH(CHAR 'abc'); 24
SELECT BIT_LENGTH(CHAR(6) 'abc'); 48
SELECT BIT_LENGTH(VARCHAR(6) 'abc'); 24
SELECT BIT_LENGTH(BINARY(6) 'abc'); 48
SELECT BIT_LENGTH(BINARY 'abc'); 24
SELECT BIT_LENGTH(VARBINARY 'abc'); 24
SELECT BIT_LENGTH(VARBINARY(6) 'abc'); 24

See Also
CHARACTER_LENGTH (page 200), LENGTH (page 211), OCTET_LENGTH (page 214)

BITCOUNT
Returns the number of one-bits (sometimes referred to as set-bits) in the given VARBINARY
value. This is also referred to as the population count.

-198-
SQL Functions

Syntax
BITCOUNT ( expression )

Parameters
expression (BINARY or VARBINARY) is the string to return.

Examples
SELECT BITCOUNT(HEX_TO_BINARY('0x10'));
bitcount
----------
1
(1 row)
SELECT BITCOUNT(HEX_TO_BINARY('0xF0'));
bitcount
----------
4
(1 row)
SELECT BITCOUNT(HEX_TO_BINARY('0xAB'))
bitcount
----------
5
(1 row)

BITSTRING_TO_BINARY
Translates the given VARCHAR bitstring representation into a VARBINARY value.

Syntax
BITSTRING_TO_BINARY ( expression )

Parameters
expression (VARCHAR) is the string to return.

Notes
VARBINARY BITSTRING_TO_BINARY(VARCHAR) converts data from character type (in
bitstring format) to binary type. This function is the inverse of TO_BITSTRING.
BITSTRING_TO_BINARY(TO_BITSTRING(x)) = x)
TO_BITSTRING(BITSTRING_TO_BINARY(x)) = x)

Examples
If there are an odd number of characters in the hex value, then the first character is treated as
the low nibble of the first (furthest to the left) byte.

-199-
SQL Reference Manual

SELECT BITSTRING_TO_BINARY('0110000101100010');
bitstring_to_binary
---------------------
ab
(1 row)
If an invalid bitstring is supplied, the system returns an error:
SELECT BITSTRING_TO_BINARY('010102010');
ERROR: invalid bitstring "010102010"

BTRIM
Removes the longest string consisting only of specified characters from the start and end of a
string.

Syntax
BTRIM ( expression [ , characters-to-remove ] )

Parameters
expression (CHAR or VARCHAR) is the string to modify
characters-to-remove (CHAR or VARCHAR) specifies the characters to
remove. The default is the space character.

Examples
SELECT BTRIM('xyxtrimyyx', 'xy');
btrim
-------
trim
(1 row)

See Also
LTRIM (page 213), RTRIM (page 220), TRIM (page 226)

CHARACTER_LENGTH
Returns an INTEGER value representing the number of characters in a string. It strips the
padding from CHAR expressions but not from VARCHAR expressions.

Syntax
[ CHAR_LENGTH | CHARACTER_LENGTH ] ( expression )

Parameters
expression (CHAR or VARCHAR) is the string to measure

Notes
CHARACTER_LENGTH is identical to LENGTH (page 211). See BIT_LENGTH (page 198) and
OCTET_LENGTH (page 214) for similar functions.

-200-
SQL Functions

Examples
SELECT CHAR_LENGTH('1234 '::CHAR(10));
char_length
-------------
4
(1 row)
SELECT CHAR_LENGTH('1234 '::VARCHAR(10));
char_length
-------------
6
(1 row)
SELECT CHAR_LENGTH(NULL::CHAR(10)) IS NULL;
?column?
----------
t
(1 row)

CHR
Converts an INTEGER to a 1-byte VARCHAR.

Syntax
CHR( expression )

Parameters
expression (INTEGER) is masked to a single byte.

Notes
CHR is the opposite of the ASCII (page 197) function.

Examples

Expression Result
SELECT CHR(65); A
SELECT CHR(65+32); a
SELECT CHR(null);

CLIENT_ENCODING
Returns a VARCHAR value representing the character set encoding of the client system.

Syntax
CLIENT_ENCODING()

-201-
SQL Reference Manual

Notes
• Vertica supports the UTF-8 character set.
• CLIENT_ENCODING returns the same value as the vsql meta-command \encoding and
variable ENCODING

Examples
SELECT CLIENT_ENCODING();
client_encoding
-----------------
UTF-8
(1 row)

DECODE
Compares expr to each search value one by one. If expr is equal to a search, the function
returns the corresponding result. If no match is found, the function returns default. If default is
omitted, the function returns null.

Syntax
SELECT DECODE( expr, search, result[ , search, result ]...[, default ] );

Parameters
expression Is the value to compare.
search Is the value compared against expression.
result Is the value returned, if expression is equal to search.
default Is optional. If no matches are found, DECODE returns default. If
default is omitted, then DECODE returns NULL (if no matches are
found).

Usage
DECODE is similar to the IF-THEN-ELSE and CASE (page 73) expression:
CASE expr
WHEN search THEN result
[WHEN search THEN result]
[ELSE default];
Exact type conversion rules for determining the result type (of a CASE expression) are according
to SQL92, clause 9.3, “Set operation result data types.” The result types of individual results are
promoted to the least common type that can be used to represent all of them. This leads to a
character string type, an exact numeric type, an approximate numeric type, or a DATETIME type,
where all the various result arguments must be of the same type grouping.

Examples
The following example converts numeric values in the weight column from the
product_dimension table to descriptive values in the output.

-202-
SQL Functions

SELECT product_description, DECODE(weight,


2, 'Light',
50, 'Medium',
71, 'Heavy',
99, 'Call for help',
'N/A')
FROM product_dimension
WHERE category_description = 'Food'
AND department_description = 'Canned Goods'
AND sku_number BETWEEN 'SKU-#49750' AND 'SKU-#49999'
LIMIT 15;

product_description | case
-----------------------------------+---------------
Brand #499 canned corn | N/A
Brand #49900 fruit cocktail | Medium
Brand #49837 canned tomatoes | Heavy
Brand #49782 canned peaches | N/A
Brand #49805 chicken noodle soup | N/A
Brand #49944 canned chicken broth | N/A
Brand #49819 canned chili | N/A
Brand #49848 baked beans | N/A
Brand #49989 minestrone soup | N/A
Brand #49778 canned peaches | N/A
Brand #49770 canned peaches | N/A
Brand #4977 fruit cocktail | N/A
Brand #49933 canned olives | N/A
Brand #49750 canned olives | Call for help
Brand #49777 canned tomatoes | N/A
(15 rows)

GREATEST
Returns the largest value in a list of expressions.

Syntax
GREATEST( expr1, expr2, ... expr_n );

Parameters
expr1, expr2, and expr_n are the expressions to be evaluated.

Notes
• Works for all data types, and implicitly casts similar types. See Examples.
• A NULL value in any one of the expressions returns NULL.

Examples
This example returns 9 as the greatest in the list of expressions:
SELECT GREATEST(7, 5, 9);
greatest
----------
9

-203-
SQL Reference Manual

(1 row)
Note that putting quotes around the integer expressions returns the same result as the first
example:
SELECT GREATEST('7', '5', '9');
greatest
----------
9
(1 row)
The next example returns FLOAT 1.5 as the greatest because the integer is implicitly cast to
float:
SELECT GREATEST(1, 1.5);
greatest
----------
1.5
(1 row)
The following example returns 'vertica' as the greatest:
SELECT GREATEST('vertica', 'analytic', 'database');
greatest
----------
vertica
(1 row)
Notice this next command returns NULL:
SELECT GREATEST('vertica', 'analytic', 'database', null);
greatest
----------

(1 row)
And one more:
SELECT GREATEST('sit', 'site', 'sight');
greatest
----------
site
(1 row)

See Also
LEAST (page 209)

HEX_TO_BINARY
Translates the given VARCHAR hexadecimal representation into a VARBINARY value.

Syntax
HEX_TO_BINARY( [ 0x ] expression )

Parameters
expression (BINARY or VARBINARY) is the string to translate.

-204-
SQL Functions

0x Is optional prefix

Notes
VARBINARY HEX_TO_BINARY(VARCHAR) converts data from character type in hexadecimal
format to binary type. This function is the inverse of TO_HEX (page 225).
HEX_TO_BINARY(TO_HEX(x)) = x)
TO_HEX(HEX_TO_BINARY(x)) = x)
If there are an odd number of characters in the hexadecimal value, the first character is treated
as the low nibble of the first (furthest to the left) byte.

Examples
If the given string begins with "0x" the prefix is ignored. For example:
SELECT HEX_TO_BINARY('0x6162') AS hex1, HEX_TO_BINARY('6162') AS hex2;
hex1 | hex2
------+------
ab | ab
(1 row)
If an invalid hex value is given, Vertica returns an “invalid binary representation" error; for
example:
SELECT HEX_TO_BINARY('0xffgf');
ERROR: invalid hex string "0xffgf"

See Also
TO_HEX (page 225)

INET_ATON
Given the dotted-quad representation of a network address as a string, returns an integer that
represents the value of the address in host byte order.

Syntax
INET_ATON( expression )

Parameters
expression (VARCHAR) is the string to convert.

Usage
The following syntax converts an IPv4 address represented as the string A to an integer I.
INET_ATON trims any spaces from the right of A, calls the Linux function inet_pton
http://www.opengroup.org/onlinepubs/000095399/functions/inet_ntop.html, and converts the
result from network byte order to host byte order using ntohl
http://opengroup.org/onlinepubs/007908775/xns/ntohl.html.
INET_ATON(VARCHAR A) -> INT8 I

-205-
SQL Reference Manual

If A is NULL, too long, or inet_pton returns an error, the result is NULL.

Examples
The generated number is always in host byte order. In the following example, the number is
calculated as 209×256^3 + 207×256^2 + 224×256 + 40.
SELECT INET_ATON('209.207.224.40');
inet_aton
------------
3520061480
(1 row)
SELECT INET_ATON('1.2.3.4');
inet_aton
-----------
16909060
(1 row)
SELECT TO_HEX(INET_ATON('1.2.3.4'));
to_hex
---------
1020304
(1 row)

See Also
INET_NTOA (page 206)

INET_NTOA
Given a network address as an integer in network byte order, returns the dotted-quad
representation of the address as a VARCHAR.

Syntax
INET_NTOA( expression )

Parameters
expression (INTEGER) is the network address to convert.

Usage
The following syntax converts an IPv4 address represented as integer I to a string A.
INET_NTOA converts I from host byte order to network byte order using htonl
http://opengroup.org/onlinepubs/007908775/xns/htonl.html, and calls the Linux function
inet_ntop http://www.opengroup.org/onlinepubs/000095399/functions/inet_ntop.html.
INET_NTOA(INT8 I) -> VARCHAR A
If I is NULL, greater than 2^32 or negative, the result is NULL.

Examples
SELECT INET_NTOA(16909060);

-206-
SQL Functions

inet_ntoa
-----------
1.2.3.4
(1 row)
SELECT INET_NTOA(03021962);
inet_ntoa
-------------
0.46.28.138
(1 row)

See Also
INET_ATON (page 205)

INITCAP
Capitalizes first letter of each alphanumeric word and puts the rest in lowercase.

Syntax
INITCAP( expression )

Parameters
expression (VARCHAR) is the string to format.

Examples

Expression Result
SELECT INITCAP('high speed database'); High Speed Database
SELECT INITCAP('LINUX TUTORIAL'); Linux Tutorial
SELECT INITCAP('abc DEF 123aVC 124Btd,lAsT'); Abc Def 123avc
124btd,Last
SELECT INITCAP('');
SELECT INITCAP(null);

INSTR
Searches string for substring and returns an integer indicating the position of the character in
string that is the first character of this occurrence.

Syntax
SELECT INSTR( string , substring [, position [, occurrence ] ] )

Parameters
string Is the text expression to search.
substring Is the string to search for.

-207-
SQL Reference Manual

position Is a nonzero integer indicating the character of string where Vertica


begins the search. If position is negative, then Vertica counts
backward from the end of string and then searches backward from
the resulting position. The first character of string occupies the
default position 1, and position cannot be 0.
occurrence Is an integer indicating which occurrence of string Vertica should
search. The value of occurrence must be positive (greater than 0),
and the default is 1.

Usage
Both position and occurrence must be of types that can resolve to an integer. The default values
of both parameters are 1, meaning Vertica begins searching at the first character of string for the
first occurrence of substring. The return value is relative to the beginning of string, regardless of
the value of position, and is expressed in characters.
If the search is unsuccessful (that is, if substring does not appear occurrence times after the
position character of string, then the return value is 0.

Examples
The first example searches forward in string ‘abc’ for substring ‘b’. The search returns the
position in ‘abc’ where ‘b’ occurs, or position 2. Because no position parameters are given, the
default search starts at ‘a’, position 1.
SELECT INSTR('abc', 'b');
instr
-------
2
(1 row)
The following three examples use character position to search backward to find the position of a
substring.
Note: Although it seems intuitive that the result should be a negative integer, the position of
n occurrence is read left to right in the sting, even though the search happens in reverse
(from the end — or right side — of the string).
In the first example, the function counts backward one character from the end of the string,
starting with character ‘c’. The function then searches backward for the first occurrence of ‘a’,
which it finds it in the first position in the search string.
SELECT INSTR('abc', 'a', -1);
instr
-------
1
(1 row)
In the second example, the function counts backward one character from the end of the string,
starting with character ‘b’, and searches backward for substring ‘bc’, which it finds in the second
position of the search string.
SELECT INSTR('abcb', 'bc', -1);
instr
-------

-208-
SQL Functions

2
(1 row)
In the third example, the function counts backward one character from the end of the string,
starting with character ‘b’, and searches backward for substring ‘bcef’, which it does not find. The
result is 0.
SELECT INSTR('abcb', 'bcef', -1);
instr
-------
0
(1 row)

LEAST
Returns the smallest value in a list of expressions.

Syntax
LEAST( expr1, expr2, ... expr_n );

Parameters
expr1, expr2, and expr_n are the expressions to be evaluated.

Notes
• Works for all data types, and implicitly casts similar types. See Examples.
• A NULL value in any one of the expressions returns NULL.

Examples
This example returns 5 as the least:
SELECT LEAST(7, 5, 9);
least
-------
5
(1 row)
Note that putting quotes around the integer expressions returns the same result as the first
example:
SELECT LEAST('7', '5', '9');
least
-------
5
(1 row)
In the above example, the values are being compared as strings, so '10' would be less than '2'.
The next example returns 1.5, as INTEGER 2 is implicitly cast to FLOAT:
SELECT LEAST(2, 1.5);
least
-------
1.5

-209-
SQL Reference Manual

(1 row)
The following example returns 'analytic' as the least:
SELECT LEAST('vertica', 'analytic', 'database');
least
----------
analytic
(1 row)
Notice this next command returns NULL:
SELECT LEAST('vertica', 'analytic', 'database', null);
least
-------

(1 row)
And one more:
SELECT LEAST('sit', 'site', 'sight');
least
-------
sight
(1 row)

See Also
GREATEST (page 203)

LEFT
Returns the length leftmost character string or binary data type depending on the data type of the
given input.

Syntax
LEFT ( string , length )

Parameters
string (CHAR or VARCHAR or BINARY or VARBINARY) is the string
to return.
length Is an INTEGER value that specifies the count of characters or
bytes to return.

Examples
SELECT LEFT('vertica', 3);
left
------
ver
(1 row)

-210-
SQL Functions

SELECT LEFT(HEX_TO_BINARY('0x6162'), 2);


left
------
ab
(1 row)
SELECT LEFT(TO_BITSTRING(HEX_TO_BINARY('0x10')), 4);
left
------
0001
(1 row)

See Also
SUBSTR (page 222)

LENGTH
Takes one argument as an input and returns returns an INTEGER value representing the
number of characters in a string.

Syntax
LENGTH ( expression )

Parameters
expression (CHAR or VARCHAR or BINARY or VARBINARY) is the string
to measure

Notes
LENGTH strips the padding from CHAR expressions but not from VARCHAR expressions.
LENGTH is identical to CHARACTER_LENGTH (page 200). See BIT_LENGTH (page 198) and
OCTET_LENGTH (page 214) for similar functions.

Examples

Expression Result
SELECT LENGTH('1234 '::CHAR(10)); 4
SELECT LENGTH('1234 '::VARCHAR(10)); 6
SELECT LENGTH('1234 '::BINARY(10)); 10
SELECT LENGTH('1234 '::VARBINARY(10)); 6
SELECT LENGTH(NULL::CHAR(10)) IS NULL; t

LOWER
Returns a VARCHAR value containing the argument converted to lower case letters.

-211-
SQL Reference Manual

Syntax
LOWER ( expression )

Parameters
expression (CHAR or VARCHAR) is the string to convert

Examples
SELECT LOWER('AbCdEfG');
lower
----------
abcdefg
(1 row)
SELECT LOWER('The Cat In The Hat');
lower
--------------------
the cat in the hat
(1 row)

LPAD
Returns a VARCHAR value representing a string of a specific length filled on the left with specific
characters.

Syntax
LPAD( expression , length [ , fill ] )

Parameters
expression (CHAR OR VARCHAR) specifies the string to fill
length (INTEGER) specifies the number of characters to return
fill (CHAR OR VARCHAR) specifies the repeating string of characters with
which to fill the output string. The default is the space character.

Examples
SELECT LPAD('database', 15, 'xzy');
lpad
-----------------
xzyxzyxdatabase
(1 row)
If the string is already longer than the specified length it is truncated on the right:
SELECT LPAD('establishment', 10, 'abc');
lpad
------------
establishm
(1 row)

-212-
SQL Functions

LTRIM
Returns a VARCHAR value representing a string with leading blanks removed from the left side
(beginning).

Syntax
LTRIM ( expression [ , characters ] )

Parameters
expression (CHAR or VARCHAR) is the string to trim
characters (CHAR or VARCHAR) specifies the characters to remove from
the left side of expression. The default is the space character.

Examples

SELECT LTRIM('zzzyyyyyyxxxxxxxxtrim', 'xyz');


ltrim
-------
trim
(1 row)

See Also
BTRIM (page 200), RTRIM (page 220), TRIM (page 226)

MD5
Calculates the MD5 hash of string, returning the result as a VARCHAR string in hexadecimal.

Syntax
MD5( string )

Parameters
string Is the argument string.

Examples
SELECT MD5('123');
md5
----------------------------------
202cb962ac59075b964b07152d234b70
(1 row)

SELECT MD5('Vertica'::bytea);
md5
----------------------------------
fc45b815747d8236f9f6fdb9c2c3f676
(1 row)

-213-
SQL Reference Manual

OCTET_LENGTH
Returns an INTEGER value representing the maximum number of bytes in a string for CHAR
and BINARY; returns the current actual number of bytes for VARCHAR and VARBINARY.

Syntax
OCTET_LENGTH ( expression )

Parameters
expression (CHAR or VARCHAR or BINARY or VARBINARY) is the string
to measure

Examples

Expression Result
SELECT OCTET_LENGTH(CHAR(10) '1234 '); 10
SELECT OCTET_LENGTH(VARCHAR(10) '1234 '); 6
SELECT OCTET_LENGTH('abc'::VARBINARY); 3
SELECT OCTET_LENGTH(VARBINARY 'abc'); 3
SELECT OCTET_LENGTH(BINARY(6) 'abc'); 6
SELECT OCTET_LENGTH(VARBINARY ''); 0
SELECT OCTET_LENGTH(''::BINARY); 1
SELECT OCTET_LENGTH(null::VARBINARY);
SELECT OCTET_LENGTH(null::BINARY);

See Also
BIT_LENGTH (page 198), CHARACTER_LENGTH (page 200), LENGTH (page 211)

OVERLAY
Returns a VARCHAR value representing a string having had a substring replaced by another
string.

Syntax
OVERLAY ( expression1 PLACING expression2 FROM position [ FOR extent ] )

Parameters
expression1 (CHAR or VARCHAR) is the string to process
expression2 (CHAR or VARCHAR) is the substring to overlay
position (INTEGER) is the character position (counting from one) at which to

-214-
SQL Functions

begin the overlay


extent (INTEGER) specifies the number of characters to replace with the overlay

Examples
SELECT OVERLAY('123456789' PLACING 'xxx' FROM 2);
overlay
-----------
1xxx56789
(1 row)
SELECT OVERLAY('123456789' PLACING 'xxx' FROM 2 FOR 4);
overlay
----------
1xxx6789
(1 row)
SELECT OVERLAY('123456789' PLACING 'xxx' FROM 2 FOR 5);
overlay
---------
1xxx789
(1 row)
SELECT OVERLAY('123456789' PLACING 'xxx' FROM 2 FOR 6);
overlay
---------
1xxx89
(1 row)

POSITION
Returns an INTEGER values representing the location of a specified substring with a string
(counting from one).

Syntax
POSITION ( substring IN string )

Parameters
substring (CHAR or VARCHAR) is the substring to locate
string (CHAR or VARCHAR) is the string in which to locate the substring

Notes
POSITION is identical to STRPOS (page 222) except for the order of the arguments.

Examples
SELECT POSITION('3' IN '1234');
position
----------
3
(1 row)

-215-
SQL Reference Manual

QUOTE_IDENT
Returns the given string, suitably quoted, to be used as an identifier (page 55) in a SQL
statement string. Quotes are added only if necessary; that is, if the string contains non-identifier
characters, is a SQL keyword (page 51), such as "1time", "Next week" and "Select". Embedded
double quotes are doubled.

Syntax
QUOTE_IDENT( string )

Parameters
string Is the argument string.

Notes
• SQL identifiers, such as table and column names, are stored as created, and references to
them are resolved using case-insensitive compares. Thus, you do not need to double-quote
mixed-case identifiers.
• Vertica quotes all currently-reserved keywords, even those not currently being used.

Examples
Quoted identifiers are case-insensitive, and Vertica does not supply the quotes:
SELECT QUOTE_IDENT('VErtIcA');
QUOTE_IDENT
-------------
VErtIcA
(1 row)

SELECT QUOTE_IDENT('Vertica database');


QUOTE_IDENT
--------------------
"Vertica database"
(1 row)
Embedded double quotes are doubled:
SELECT QUOTE_IDENT('Vertica "!" database');
QUOTE_IDENT
-------------------------
"Vertica ""!"" database"
(1 row)
The following example uses the SQL keyword, SELECT; results are double quoted:
SELECT QUOTE_IDENT('select');
QUOTE_IDENT
-------------
"select"
(1 row)

-216-
SQL Functions

QUOTE_LITERAL
Returns the given string, suitably quoted, to be used as a string literal in a SQL statement string.
Embedded single quotes and backslashes are doubled.

Syntax
QUOTE_LITERAL( string )

Parameters
string Is the argument string.

Notes
Vertica's use of backslashes in this context is not SQL compliant and is subject to change.

Examples
mydb=> SELECT QUOTE_LITERAL('O''Reilly');
quote_literal
---------------
'O''Reilly'
(1 row)

REPEAT
Given a value and a count this function returns a VARCHAR or VARBINARY value that repeats
the given value COUNT times.
If the return value is truncated the given value might not be repeated count times, and the last
occurrence of the given value might be truncated.

Syntax
REPEAT ( string , repetitions )

Parameters
string (CHAR or VARCHAR or BINARY or VARBINARY) is the
string to repeat
repetitions (INTEGER) is the number of times to repeat the string

-217-
SQL Reference Manual

Notes
If the repetitions field depends on the contents of a column (is not a constant), then the repeat
operator maximum length is 65000 bytes. You can add a cast of the repeat to cast the result
down to a size big enough for your purposes (reflects the actual maximum size) so you can do
other things with the result.
If you run the following example, you get an error message:
SELECT '123456' || REPEAT('a', colx);
ERROR: Operator || may give a 65006-byte Varchar result; the limit is 65000 bytes.
If you know that colx can never be greater than 3, the solution is to add a cast (::VARCHAR(3)):
SELECT '123456' || REPEAT('a', colx)::VARCHAR(3);
If colx is greater than 3, the repeat is truncated to exactly three (3) a's.

Examples
SELECT REPEAT ('1234', 5);
repeat
----------------------
12341234123412341234
(1 row)
SELECT REPEAT ('vmart', 3);
repeat
-----------------
vmartvmartvmart
(1 row)

REPLACE
Replaces all occurrences of characters in a string with another set of characters.

Syntax
REPLACE ( string , target , replacement )

Parameters
string (CHAR OR VARCHAR) is the string to which to perform the replacement
target (CHAR OR VARCHAR) is the string to replace
replacement (CHAR OR VARCHAR) is the string with which to replace the target

Examples
SELECT REPLACE('Documentation%20Library', '%20', ' ');

-218-
SQL Functions

replace
-----------------------
Documentation Library
(1 row)
SELECT REPLACE('This & That', '&', 'and');
replace
---------------
This and That
(1 row)

RIGHT
Returns the length rightmost character string or binary data type depending on the data type of
the given input.

Syntax
RIGHT( string , length )

Parameters
string (CHAR or VARCHAR or BINARY or VARBINARY) is the string
to return.
length Is an INTEGER value that specifies the count of characters or
bytes to return.

Examples
The following command returns the last three characters of the string 'vertica':
SELECT RIGHT('vertica', 3);
right
-------
ica
(1 row)
SELECT RIGHT('ab'::binary(4), 2);
right
----------
\000\000
(1 row)
SELECT RIGHT(TO_BITSTRING(HEX_TO_BINARY('0x10')), 4);
right
-------
0000
(1 row)

See Also
SUBSTR (page 222)

-219-
SQL Reference Manual

RPAD
Returns a VARCHAR value representing a string of a specific length filled on the right with
specific characters.

Syntax
RPAD ( expression , length [ , fill ] )

Parameters
expression (CHAR OR VARCHAR) specifies the string to fill
length (INTEGER) specifies the number of characters to return
fill (CHAR OR VARCHAR) specifies the repeating string of characters with
which to fill the output string. The default is the space character.

Examples
SELECT RPAD('database', 15, 'xzy');
rpad
-----------------
databasexzyxzyx
(1 row)
If the string is already longer than the specified length it is truncated on the right:
SELECT RPAD('database', 6, 'xzy');
rpad
--------
databa
(1 row)

RTRIM
Returns a VARCHAR value representing a string with trailing blanks removed from the right side
(end).

Syntax
RTRIM ( expression [ , characters ] )

Parameters
expression (CHAR or VARCHAR) is the string to trim
characters (CHAR or VARCHAR) specifies the characters to remove from
the right side of expression. The default is the space
character.

Examples
SELECT RTRIM('trimzzzyyyyyyxxxxxxxx', 'xyz');
ltrim
-------
trim

-220-
SQL Functions

(1 row)

See Also
BTRIM (page 200), LTRIM (page 213), TRIM (page 226)

SPLIT_PART
Splits string on the delimiter and returns the given field (counting from one).

Syntax
SPLIT_PART( string , delimiter , field )

Parameters
string Is the argument string.
delimiter Is the given delimiter.
field (INTEGER) is the number of the part to return.

Examples
The specified integer of 2 returns the second string, or def.
SELECT SPLIT_PART('abc~@~def~@~ghi', '~@~', 2);
split_part
------------
def
(1 row)
Here, we specify 3, which returns the third string, or 789.
SELECT SPLIT_PART('123~|~456~|~789', '~|~', 3);
split_part
------------
789
(1 row)
Note that the tildas are for readability only. Omitting them returns the same results:
SELECT SPLIT_PART('123|456|789', '|', 3);
split_part
------------
789
(1 row)
See what happens if you specify an integer that exceeds the number of strings: No results.
SELECT SPLIT_PART('123|456|789', '|', 4);
split_part
------------

(1 row)
The above result is not null, it is the empty string.

-221-
SQL Reference Manual

SELECT SPLIT_PART('123|456|789', '|', 4) IS NULL;


?column?
----------
f
(1 row)
If SPLIT_PART had returned NULL, LENGTH would have returned null.
SELECT LENGTH (SPLIT_PART('123|456|789', '|', 4));
length
--------
0
(1 row)

STRPOS
Returns an INTEGER value representing the location of a specified substring within a string
(counting from one).

Syntax
STRPOS ( string , substring )

Parameters
string (CHAR or VARCHAR) is the string in which to locate the substring
substring (CHAR or VARCHAR) is the substring to locate

Notes
STRPOS is identical to POSITION (page 215) except for the order of the arguments.

Examples
SELECT STRPOS('1234','3');
strpos
--------
3
(1 row)

SUBSTR
SUBSTR returns a VARCHAR value representing a substring of a specified string.

Syntax
SUBSTR ( string , position [ , extent ] )

Parameters
string (CHAR or VARCHAR or BINARY or VARBINARY) is the string from
which to extract a substring.
position (INTEGER) is the starting position of the substring (counting from

-222-
SQL Functions

one).
extent (INTEGER) is the length of the substring to extract. The default is
the end of the string.

Notes
SUBSTR performs the same function as SUBSTRING (page 223). The only difference is the
syntax allowed.

Examples
SELECT SUBSTR('123456789', 3, 2);
substr
--------
34
(1 row)
SELECT SUBSTR('123456789', 3);
substr
---------
3456789
(1 row)
SELECT SUBSTR(TO_BITSTRING(HEX_TO_BINARY('0x10')), 2, 2);
substr
--------
00
(1 row)
SELECT SUBSTR(TO_HEX('10010'), 2, 2);
substr
--------
71
(1 row)

SUBSTRING
Given a value, a position, and an optional length, returns a value representing a substring of the
specified string at the given position.

Syntax
SUBSTRING ( string , position [ , length ] )
SUBSTRING ( string FROM position [ FOR length ] )

Parameters
string (CHAR or VARCHAR or BINARY or VARBINARY) is the string from
which to extract a substring
position (INTEGER) is the starting position of the substring (counting from
one). If position is greater than the length of the given value, an
empty value is returned.

-223-
SQL Reference Manual

length (INTEGER) is the length of the substring to extract. The default is


the end of the string.If a length is given the result is at most that
many bytes. The maximum length is the length of the given value
less the given position. If no length is given or if the given length is
greater than the maximum length then the length is set to the
maximum length.

Notes
SUBSTRING performs the same function as SUBSTR (page 222). The only difference is the
syntax allowed.
Neither length nor position can be negative, and the position cannot be zero because it is one
based. If these forms are violated, the system returns an error:
SELECT SUBSTRING('ab'::binary(2), -1, 2);
ERROR: negative or zero substring start position not allowed

Examples
SELECT SUBSTRING('123456789', 3, 2);
substring
-----------
34
(1 row)
SELECT SUBSTRING('123456789' FROM 3 FOR 2);
substring
-----------
34
(1 row)

TO_BITSTRING
Returns a VARCHAR that represents the given VARBINARY value in bitstring format

Syntax
TO_BITSTRING( expression )

Parameters
expression (VARCHAR) is the string to return.

Notes
VARCHAR TO_BITSTRING(VARBINARY) converts data from binary type to character type
(where the character representation is the bitstring format). This function is the inverse of
BITSTRING_TO_BINARY:
TO_BITSTRING(BITSTRING_TO_BINARY(x)) = x)
BITSTRING_TO_BINARY(TO_BITSTRING(x)) = x)

-224-
SQL Functions

Examples
SELECT TO_BITSTRING('ab'::BINARY(2));
to_bitstring
------------------
0110000101100010
(1 row)
SELECT TO_BITSTRING(HEX_TO_BINARY('0x10'));
to_bitstring
--------------
00010000
(1 row)
SELECT TO_BITSTRING(HEX_TO_BINARY('0xF0'));
to_bitstring
--------------
11110000
(1 row)

See Also
BITCOUNT (page 198) and BITSTRING_TO_BINARY (page 199)

TO_HEX
Returns a VARCHAR or VARBINARY representing the hexadecimal equivalent of a number.

Syntax
TO_HEX ( number )

Parameters
number (INTEGER) is the number to convert to hexadecimal

Notes
VARCHAR TO_HEX(INTEGER) and VARCHAR TO_HEX(VARBINARY) are similar. The
function converts data from binary type to character type (where the character representation is
in hexadecimal format). This function is the inverse of HEX_TO_BINARY.
TO_HEX(HEX_TO_BINARY(x)) = x).
HEX_TO_BINARY(TO_HEX(x)) = x).

Examples
SELECT TO_HEX(123456789);
to_hex
---------
75bcd15
(1 row)
For VARBINARY inputs, the returned value is not preceded by "0x". For example:
SELECT TO_HEX('ab'::binary(2));
to_hex

-225-
SQL Reference Manual

--------
6162
(1 row)

TRANSLATE
Replaces individual characters in string_to_replace with other characters.

Syntax
SELECT TRANSLATE ( string_to_replace , from_string , to_string );

Parameters
string_to_replace Is the string to be translated.
from_string Contains characters that should be replaced in
string_to_replace.
to_string Any character in string_to_replace that matches a
character in from_string is replaced by the corresponding
character in to_string.

Example
SELECT TRANSLATE('12345', '14', 'zq');
translate
-----------
z23q5
(1 row)
SELECT TRANSLATE('simple', 'i', 'a');
translate
-----------
sample
(1 row)

TRIM
Combines the BTRIM, LTRIM, and RTRIM functions into a single function.

Syntax
TRIM ( [ [ LEADING | TRAILING | BOTH ] characters FROM ] expression )

Parameters
LEADING Removes the specified characters from the left side of the string
TRAILING Removes the specified characters from the right side of the string
BOTH Removes the specified characters from both sides of the string (default)

-226-
SQL Functions

characters (CHAR or VARCHAR) specifies the characters to remove from


expression. The default is the space character.
expression (CHAR or VARCHAR) is the string to trim

Examples
SELECT '-' || TRIM(LEADING 'x' FROM 'xxdatabasexx') || '-';
?column?
--------------
-databasexx-
(1 row)
SELECT '-' || TRIM(TRAILING 'x' FROM 'xxdatabasexx') || '-';
?column?
--------------
-xxdatabase-
(1 row)
SELECT '-' || TRIM(BOTH 'x' FROM 'xxdatabasexx') || '-';
?column?
------------
-database-
(1 row)
SELECT '-' || TRIM('x' FROM 'xxdatabasexx') || '-';
?column?
------------
-database-
(1 row)
SELECT '-' || TRIM(LEADING FROM ' database ') || '-';
?column?
--------------
-database -
(1 row)
SELECT '-' || TRIM(' database ') || '-';
?column?
------------
-database-
(1 row)

See Also
BTRIM (page 200), LTRIM (page 213), RTRIM (page 220)

UPPER
Returns a VARCHAR value containing the argument converted to upper case letters.

Syntax
UPPER ( expression )

Parameters
expression (CHAR or VARCHAR) is the string to convert

-227-
SQL Reference Manual

Examples

SELECT UPPER('AbCdEfG');
upper
----------
ABCDEFG
(1 row)

V6_ATON
Converts an IPv6 address represented as a character string to a binary string.

Syntax
V6_ATON ( expression )

Parameters
expression (VARCHAR) is the string to convert.

Notes
The following syntax converts an IPv6 address represented as the character string A to a binary
string B.
V6_ATON trims any spaces from the right of A and calls the Linux function inet_pton
http://www.opengroup.org/onlinepubs/000095399/functions/inet_ntop.html.
V6_ATON(VARCHAR A) -> VARBINARY(16) B
If A has no colons it is prepended with '::ffff:'. If A is NULL, too long, or if inet_pton returns an
error, the result is NULL.

Examples
SELECT V6_ATON('2001:DB8::8:800:200C:417A');
v6_aton
------------------------------------------------------
\001\015\270\000\000\000\000\000\010\010\000 \014Az
(1 row)
SELECT TO_HEX(V6_ATON('2001:DB8::8:800:200C:417A'));
to_hex
----------------------------------
20010db80000000000080800200c417a
(1 row)
SELECT V6_ATON('1.2.3.4');
v6_aton
------------------------------------------------------------------
\000\000\000\000\000\000\000\000\000\000\377\377\001\002\003\004
(1 row)
SELECT V6_ATON('::1.2.3.4');
v6_aton
------------------------------------------------------------------
\000\000\000\000\000\000\000\000\000\000\000\000\001\002\003\004

-228-
SQL Functions

(1 row)

See Also
V6_NTOA (page 229)

V6_NTOA
Converts an IPv6 address represented as varbinary to a character string.

Syntax
V6_NTOA ( expression )

Parameters
expression (VARBINARY) is the binary string to convert.

Notes
The following syntax converts an IPv6 address represented as VARBINARY B to a string A.
V6_NTOA right-pads B to 16 bytes with zeros, if necessary, and calls the Linux function
inet_ntop http://www.opengroup.org/onlinepubs/000095399/functions/inet_ntop.html.
V6_NTOA(VARBINARY B) -> VARCHAR A
If B is NULL or longer than 16 bytes, the result is NULL.
Note: Vertica automatically converts the form '::ffff:1.2.3.4' to '1.2.3.4'.

Examples
SELECT V6_NTOA(' \001\015\270\000\000\000\000\000\010\010\000 \014Az');
v6_ntoa
---------------------------
2001:db8::8:800:200c:417a
(1 row)
SELECT V6_NTOA(V6_ATON('1.2.3.4'));
v6_ntoa
---------
1.2.3.4
(1 row)
SELECT V6_NTOA(V6_ATON('::1.2.3.4'));
v6_ntoa
-----------

-229-
SQL Reference Manual

::1.2.3.4
(1 row)

See Also
N6_ATON (page 228)

V6_SUBNETA
Calculates a subnet address in CIDR (Classless Inter-Domain Routing) format from a binary or
alphanumeric IPv6 address.

Syntax
V6_SUBNETA( expression1 , expression2 )

Parameters
expression1 (VARBINARY or VARCHAR) is the string to
calculate.
expression2 (INTEGER) is the size of the subnet.

Notes
The following syntax calculates a subnet address in CIDR format from a binary or varchar IPv6
address.
V6_SUBNETA masks a binary IPv6 address B so that the N leftmost bits form a subnet address,
while the remaining rightmost bits are cleared. It then converts to an alphanumeric IPv6 address,
appending a slash and N.
V6_SUBNETA(BINARY B, INT8 N) -> VARCHAR C
The following syntax calculates a subnet address in CIDR format from an alphanumeric IPv6
address.
V6_SUBNETA(VARCHAR A, INT8 N) -> V6_SUBNETA(V6_ATON(A), N) -> VARCHAR C

Examples
SELECT V6_SUBNETA(V6_ATON('2001:db8::8:800:200c:417a'), 28);
v6_subneta
---------------
2001:db0::/28
(1 row)

See Also
V6_SUBNETN (page 230)

V6_SUBNETN
Calculates a subnet address in CIDR (Classless Inter-Domain Routing) format from a varbinary
or alphanumeric IPv6 address.

-230-
SQL Functions

Syntax
V6_SUBNETN( expression1 , expression2 )

Parameters
expression1 (VARBINARY or VARCHAR or INTEGER) is the
string to calculate.
expression2 (INTEGER) is the size of the subnet.

Notes
The following syntax masks a BINARY IPv6 address B so that the N left-most bits of S form a
subnet address, while the remaining right-most bits are cleared.
V6_SUBNETN right-pads B to 16 bytes with zeros, if necessary and masks B, preserving its
N-bit subnet prefix.
V6_SUBNETN(VARBINARY B, INT8 N) -> VARBINARY(16) S
If B is NULL or longer than 16 bytes, or if N is not between 0 and 128 inclusive, the result is
NULL.
Note: S = [B]/N in Classless Inter-Domain Routing
http://en.wikipedia.org/wiki/Classless_Inter-Domain_Routing notation (CIDR notation).
The following syntax masks an alphanumeric IPv6 address A so that the N leftmost bits form a
subnet address, while the remaining rightmost bits are cleared.
V6_SUBNETN(VARCHAR A, INT8 N) -> V6_SUBNETN(V6_ATON(A), N) -> VARBINARY(16) S

Example
SELECT V6_SUBNETN(V6_ATON('2001:db8::8:800:200c:417a'), 28);
v6_subnetn
---------------------------------------------------------------
\001\015\260\000\000\000\000\000\000\000\000\000\000\000\000

See Also
V6_SUBNETA (page 230)

V6_TYPE
Characterizes a binary or alphanumeric IPv6 address B as an integer type.

Syntax
V6_TYPE( expression )

Parameters
expression (VARBINARY or VARCHAR) is the type to convert.

-231-
SQL Reference Manual

Notes
V6_TYPE(VARBINARY B) returns INT8 T.
V6_TYPE(VARCHAR A) -> V6_TYPE(V6_ATON(A)) -> INT8 T
The IPv6 types are defined in the Network Working Group's IP Version 6 Addressing
Architecture memo http://www.ietf.org/rfc/rfc4291.txt.
GLOBAL = 0 Global unicast addresses
LINKLOCAL = 1 Link-Local unicast (and Private-Use) addresses
LOOPBACK = 2 Loopback
UNSPECIFIED = 3 Unspecified
MULTICAST = 4 Multicast
IPv4-mapped and IPv4-compatible IPv6 addresses are also interpreted, as specified in IPv4
Global Unicast Address Assignments http://www.iana.org/assignments/ipv4-address-space.
• For IPv4, Private-Use is grouped with Link-Local.
• If B is VARBINARY, it is right-padded to 16 bytes with zeros, if necessary.
• If B is NULL or longer than 16 bytes, the result is NULL.

Details
IPv4 (either kind):
0.0.0.0/8 UNSPECIFIED
10.0.0.0/8 LINKLOCAL
127.0.0.0/8 LOOPBACK
169.254.0.0/16 LINKLOCAL
172.16.0.0/12 LINKLOCAL
192.168.0.0/16 LINKLOCAL
224.0.0.0/4 MULTICAST
others GLOBAL
IPv6:
::0/128 UNSPECIFIED
::1/128 LOOPBACK
fe80::/10 LINKLOCAL
ff00::/8 MULTICAST
others GLOBAL

See Also
INET_ATON (page 205)
IP Version 6 Addressing Architecture http://www.ietf.org/rfc/rfc4291.txt
IPv4 Global Unicast Address Assignments
http://www.iana.org/assignments/ipv4-address-space

-232-
SQL Functions

Examples
SELECT V6_TYPE(V6_ATON('192.168.2.10'));
v6_type
---------
1
(1 row)
SELECT V6_TYPE(V6_ATON('2001:db8::8:800:200c:417a'));
v6_type
---------
0
(1 row)

-233-
234

System Information Functions


These functions provide system information regarding user sessions. The superuser has
unrestricted access to all system information, but users can view only information about their
own, current sessions.

CURRENT_DATABASE
Returns a VARCHAR value containing the name of the database to which you are connected.

Syntax
CURRENT_DATABASE()

Notes
The CURRENT_DATABASE function does not require parentheses.

Examples
SELECT CURRENT_DATABASE();
current_database
------------------
vmartschema
(1 row)
The following command returns the same results without the parentheses:
SELECT CURRENT_DATABASE;
current_database
------------------
vmartschema
(1 row)

CURRENT_SCHEMA
Returns the name of the current schema.

Syntax
CURRENT_SCHEMA()

Notes
The CURRENT_SCHEMA function does not require parentheses.

Examples
SELECT CURRENT_SCHEMA();
current_schema
----------------
public
(1 row)
The following command returns the same results without the parentheses:

-234-
SQL Functions

SELECT CURRENT_SCHEMA;
current_schema
----------------
public
(1 row)

See Also
SET (http://www.postgresql.org/docs/8.0/static/sql-set.html)

CURRENT_USER
Returns a VARCHAR containing the name of the user who initiated the current database
connection.

Syntax
CURRENT_USER()

Notes
• The CURRENT_USER function does not require parentheses.
• This function is useful for permission checking and is equivalent to SESSION_USER (page
236) and USER (page 237).

Examples
SELECT CURRENT_USER();
current_user
--------------
dbadmin
(1 row)
The following command returns the same results without the parentheses:
SELECT CURRENT_USER;
current_user
--------------
dbadmin
(1 row)

HAS_TABLE_PRIVILEGE
Returns a true/false value indicating whether or not a user can access a table in a particular way.

Syntax
HAS_TABLE_PRIVILEGE ( [ user, ] table , privilege )

Parameters
user specifies the name of a database user. The default is the CURRENT_USER
(page 235).
table specifies the name of a table in the logical schema.
privilege SELECT Allows the user to SELECT from any column of the specified table.

-235-
SQL Reference Manual

INSERT Allows the user to INSERT records into the specified table and to use the
COPY (page 323) command to load the table.
UPDATE Allows the user to UPDATE records in the specified table.
DELETE Allows the user to delete a row from the specified table.
REFERENCES Allows the user to create a foreign key constraint (privileges
required on both the referencing and referenced tables).

Notes
All arguments must be quoted string constants (page 57).

Examples
SELECT HAS_TABLE_PRIVILEGE('store.store_dimension', 'SELECT');
has_table_privilege
---------------------
t
(1 row)
SELECT HAS_TABLE_PRIVILEGE('release', 'store.store_dimension', 'INSERT');
has_table_privilege
---------------------
t
(1 row)
SELECT HAS_TABLE_PRIVILEGE('store.store_dimension', 'UPDATE');
has_table_privilege
---------------------
t
(1 row)
SELECT HAS_TABLE_PRIVILEGE('store.store_dimension', 'REFERENCES');
has_table_privilege
---------------------
t
(1 row)

SESSION_USER
Returns a VARCHAR containing the name of the user who initiated the current database
session.

Syntax
SESSION_USER()

Notes
• The SESSION_USER function does not require parentheses.
• Is equivalent to CURRENT_USER (page 235) and USER (page 237).

Examples
SELECT SESSION_USER();
session_user
--------------
dbadmin

-236-
SQL Functions

(1 row)
The following command returns the same results without the parentheses:
SELECT SESSION_USER;
session_user
--------------
dbadmin
(1 row)

USER
Returns a VARCHAR containing the name of the user who initiated the current database
connection.

Syntax
USER()

Notes
• The USER function does not require parentheses.
• Is equivalent to CURRENT_USER (page 235).

Examples
SELECT USER();
current_user
--------------
dbadmin
(1 row)
The following command returns the same results without the parentheses:
SELECT USER;
current_user
--------------
dbadmin
(1 row)

VERSION
Returns a VARCHAR containing a Vertica node's version information.

Syntax
VERSION()

Examples
SELECT VERSION();
version
-------------------------------------------------
Vertica Analytic Database v3.0.0-20090407010008
(1 row)

-237-
238

Vertica Functions
The functions in this section are specific to the Vertica database.

ADD_DESIGN_TABLES
Identifies tables for which to create projections.

Syntax
ADD_DESIGN_TABLES ( design_context_name , tableSpecification )

Parameters
design_context_name Specifies the name of the design context schema in which to add
the tables.
table_specification Specifies the names of one or more tables to add to the design.
Database Designer creates projections for these tables. Use a
comma-delimited list to specify tables, as follows:
To specify a specific table, use the form:
[schema_name.]table_name
where schema_name is the schema that contains the table for
which to create a projection.
To specify all the tables in a particular schema, use the form:
schema_name.*

Notes
• None of these tables are marked for segmentation when they are added to the design
context. See SET_DESIGN_SEGMENTATION_TABLE (page 306) for information about
segmenting tables.
• Vertica will automatically assume that the number of rows for each table will match the data
statistics, when loaded. If the number of rows in a table will differ by an order of magnitude at
implementation, use the SET_DESIGN_TABLE_ROWS (page 306) function to specify the
approximate number of rows for the table.
• If a design context contains more than one design configuration, these tables are used by all
the design configurations within the design context.

Examples
The following example adds all the tables in the public schema to the vmart design context.
SELECT ADD_DESIGN_TABLES('vmart','public.*');
The following example adds the store_orders_fact and store_sales_fact tables in the store
schema to the vmart design context.
SELECT
ADD_DESIGN_TABLES('vmart','store.store_orders_fact,store.store_sales_fact');
The following example adds all the tables in the database to the vmart design context.
SELECT ADD_DESIGN_TABLES('vmart','');

-238-
SQL Functions

See Also
CLEAR_DESIGN_TABLES (page 249), SET_DESIGN_TABLE_ROWS (page 306), and
SET_DESIGN_SEGMENTATION_TABLE (page 306)

ADD_LOCATION
Adds a location to store data.

Syntax
ADD_LOCATION ( path , [ node ] , [ usage_string ] )

Parameters
path Specifies where the storage location is mounted.
Path must be an empty directory with write permissions
for user, group, or all.
node [Optional] Is the Vertica node where the location is
available.
If this parameter is omitted, node defaults to the initiator.
usage_string Is one of the following:
DATA: Only data is stored in the location.
TEMP: Only temporary files that are created during
loads or queries are stored in the location.
DATA,TEMP: Both types of files are stored in the
location.

Notes
• By default, the location is used to store both data and temporary files.
• Locations can be added from any node to any node.
• The DBA can specify the resource type (temp files, ROS containers) for catalog storage
locations, which could help improve I/O performance.

Example
This example adds a location that stores data and temporary files:
SELECT ADD_LOCATION('/secondVerticaStorageLocation/' , 'node2');
This example adds a location to store data only:
SELECT ADD_LOCATION('/secondVerticaStorageLocation/' , 'node2' , 'DATA');

See Also
ALTER_LOCATION_USE and RETIRE_LOCATION (page 294)

-239-
SQL Reference Manual

ADVANCE_EPOCH
Manually closes the current epoch and begins a new epoch. Use ADVANCE_EPOCH
immediately before using ALTER PROJECTION MOVEOUT.

Syntax
SELECT ADVANCE_EPOCH()

See Also
ALTER PROJECTION (page 314)

ALTER_LOCATION_USE
Alters the type of files stored in the specified storage location.

Syntax
ALTER_LOCATION_USE ( path , [ node ] , usage_string )

Parameters
path Specifies where the storage location is mounted.
node [Optional] Is the Vertica node where the location is
available.
If this parameter is omitted, node defaults to the initiator.
usage_string Is one of the following:
DATA: Only data is stored in the location.
TEMP: Only temporary files that are created during
loads or queries are stored in the location.
DATA,TEMP: Both types of files are stored in the
location.

Notes
• Altering the type of files stored in a particular location is useful if you create additional
storage locations and you want to isolate execution engine temporary files from data files.
• After modifying the location's use, at least one location must remain for storing data and temp
files. These files can be stored in the same storage location or separate storage locations.
• When a storage location is altered, it stores only the type of information indicated from that
point forward. For example:
§ If you modify a storage location that previously stored both temp and data files so that it
only stores temp files, the data is eventually merged out through the ATM. You can also
merge it out manually.
§ If you modify a storage location that previously stored both temp and data files so that it
only stores data files, all currently running statements that use these temp files, such as
queries and loads, will continue to run. Subsequent statements will no longer use this
location.

-240-
SQL Functions

Example
The following example alters the storage location on node3 to store data only:
SELECT ALTER_LOCATION_USE ('/thirdVerticaStorageLocation/' , 'node3' , 'DATA');

See Also
ADD_LOCATION (page 239), RETIRE_LOCATION (page 294), and Modifying Storage
Locations

ANALYZE_CONSTRAINTS
Analyzes and reports on constraint violations within the current schema search path.
You can check for constraint violations by passing an empty argument (which returns violations
on all tables within the current schema), by passing a single table argument, or by passing two
arguments comprising a table and a column or list of columns.

Syntax
SELECT ANALYZE_CONSTRAINTS [ ( '' )
| ( schema.table )
| [ ( schema.table , column ) ]

Parameters
('') Analyzes and reports on all tables within the current schema search path.
table Analyzes and reports on all constraints referring to the specified table.
column Analyzes and reports on all constraints referring to specified table that
contains the specified columns.

Notes
Use COPY with NO COMMIT keywords to incorporate detection of constraint violations into the
load process. Vertica checks for constraint violations when queries are executed, not when data
is loaded. To avoid constraint violations, load data without committing it and then perform a
post-load check of your data using the ANALYZE_CONSTRAINTS function. If the function finds
constraint violations, you can roll back the load because you have not committed it.
ANALYZE_CONSTRAINTS() fails if the database cannot perform constraint checks, such as
when the system is out of resources. Vertica returns an error that identifies the specific condition
that caused the failure.

Return Values
ANALYZE_CONSTRAINTS() returns results in a structured set (see table below) that lists the
schema name, table name, column name, constraint name, constraint type, and the column
values that caused the violation.
If the result set is empty, then no constraint violations exist; for example:
SELECT ANALYZE_CONSTRAINTS ('public.product_dimension', 'product_key');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values

-241-
SQL Reference Manual

-------------+------------+--------------+-----------------+-----------------+---------------
(0 rows)

The following result set, on the other hand, shows a primary key violation, along with the value
that caused the violation:
SELECT ANALYZE_CONSTRAINTS ('');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+---------------
store t1 c1 pk_t1 PRIMARY ('10')
(1 row)

The result set columns are described in further detail in the following table:

Column Name Data Type Description


Schema Name VARCHAR The name of the schema.
Table Name VARCHAR The name of the table, if specified.
Column Names VARCHAR Names of columns containing constraints.
Multiple columns are in a comma-separated
list:
store_key,
store_key, date_key,
Constraint Name VARCHAR The given name of the primary key, foreign
key, unique, or not null constraint, if specified.
Constraint Type VARCHAR Identified by one of the following strings:
'PRIMARY KEY', 'FOREIGN KEY', 'UNIQUE',
or 'NOT NULL'.
Column Values VARCHAR Value of the constraint column, in the same
order in which Column Names contains the
value of that column in the violating row.
When interpreted as SQL, the value of this
column forms a list of values of the same type
as the columns in Column Names; for
example:
('1'),
('1', 'z')

Locks
ANALYZE_CONSTRAINTS(), takes locks in the same way that SELECT * FROM t1 holds a lock
on table t1.
The following table describes the locks taken by ANALYZE_CONSTRAINTS:

Transaction Mode Locks Acquired

-242-
SQL Functions

SERIALIZABLE S (read) locks on all tables involved in all constraints


(default) analyzed.
For example, if a FOREIGN KEY constraint is analyzed, an S
lock is acquired on both the table with the FOREIGN KEY and
the table with the corresponding PRIMARY KEY.
Caution: ANALYZE_CONSTRAINTS in SERIALIZABLE
mode results in S (read) locks on all tables involved in all
constraints analyzed. Thus, the database's ability to perform
other concurrent loads while those locks are held could be
severely impaired. If concurrent parallel loads are important,
do not use ANALYZE_CONSTRAINTS with SERIALIZABLE
isolation level.
READ COMMITTED None.
Note: If ANALYZE_CONSTRAINT is run at the READ
COMMITTED isolation level, the function might not find
duplicates across concurrent transactions or duplicates that
were committed by another transaction in the current epoch.
In READ COMMITTED mode, a query sees changes in the
current transaction, so any uncommitted duplicates within the
current transaction are detected.

Disabling Duplicate Key Errors


When ANALYZE_CONSTRAINTS finds violations, such as when you insert a duplicate value
into a primary key, you can correct errors using the following functions. Effects last until the end
of the session only:
• SELECT DISABLE_DUPLICATE_KEY_ERROR (page 261)
• SELECT REENABLE_DUPLICATE_KEY_ERROR (page 292)

Examples
Given the following sample inputs, the result set contains one violation because the same
primary key value (10) was inserted into table t1 twice:
CREATE TABLE t1(c1 INT);
ALTER TABLE t1 ADD CONSTRAINT pk_t1 PRIMARY KEY (c1);
CREATE PROJECTION t1_p (c1) AS SELECT * FROM t1 UNSEGMENTED ALL NODES;
INSERT INTO t1 values (10);
INSERT INTO t1 values (10); --Duplicate primary key value
SELECT ANALYZE_CONSTRAINTS('');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+---------------
store t1 c1 pk_t1 PRIMARY ('10')
(1 row)

This example returns three rows, or three violations; one row each for the primary key and
unique key violation, and one row for the foreign key violation (missing primary key):
CREATE TABLE fact0(c1 INTEGER PRIMARY KEY, c2 INTEGER UNIQUE);
CREATE TABLE dim0 (c1 INTEGER REFERENCES fact0(c1));
SELECT IMPLEMENT_TEMP_DESIGN(''); --Creates the required projections
INSERT INTO fact0 values (1, 2); --Primary key (PK) and unique key violation
INSERT INTO fact0 values (3); --Foreign key (FK) violation
SELECT ANALYZE_CONSTRAINTS('');

-243-
SQL Reference Manual

Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+---------------
public fact0 c1 - PRIMARY KEY ('1')
public fact0 c1 - UNIQUE ('3')
public fact0 c2 - FOREIGN KEY ('2')
(3 rows)

If you specify the wrong table, the system returns an error message:
SELECT ANALYZE_CONSTRAINTS('abc');
ERROR: 'abc' is not a table in the current search path
If you issue the function using incorrect syntax, the system returns an error message with a hint:
ANALYZE ALL CONSTRAINT;
Or
ANALYZE CONSTRAINT abc;
ERROR: ANALYZE CONSTRAINT is not supported.
HINT: You may consider using analyze_constraints().
In this example, create a table that contains 3 integer columns, one a unique key and one a
primary key:
CREATE TABLE fact_1(f INTEGER, f_UK INTEGER UNIQUE, f_PK INTEGER PRIMARY KEY);

Issue the command to create superprojections.


Note: The empty string in the following code example creates a temporary physical schema
design (projections) for any table in the database that lacks K-safe projections:
SELECT IMPLEMENT_TEMP_DESIGN('');
implement_temp_design
-----------------------
4
(1 row)

Try issuing a command that refers to a nonexistent column:


SELECT ANALYZE_CONSTRAINTS('fee', 'f2');
ERROR: 'fee' is not a table name in the current search path

Insert some values into table fact_1 and commit the changes:
INSERT INTO fact_1 values (1, 1, 1);
COMMIT;

Now issue the ANALYZE_CONSTRAINTS command on table fact_1. No constraint violations


are expected and none are found:
SELECT ANALYZE_CONSTRAINTS('fact_1');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+---------------
(0 rows)

Now insert duplicate unique and primary key values and run ANALYZE_CONSTRAINTS on
table fact_1 again. The system shows two violations: one against the primary key and one
against the unique key:
INSERT INTO fact_1 VALUES (1, 1, 1);
COMMIT;
SELECT ANALYZE_CONSTRAINTS('fact_1');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+---------------
public | fact_1 | f_pk | - | PRIMARY | ('1')
public | fact_1 | f_uk | - | UNIQUE | ('1')

-244-
SQL Functions

(2 rows)

The following command specifies constraint validation on only the unique key in table fact_1:
SELECT ANALYZE_CONSTRAINTS('fact_1', 'f_UK');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+---------------
public | fact_1 | f_uk | - | UNIQUE | ('1')
(1 row)

The following example shows that you can specify the same column more than once; the
function, however, returns the violation once only:
SELECT ANALYZE_CONSTRAINTS('fact_1', 'f_PK, F_PK');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+---------------
public | fact_1 | f_pk | - | PRIMARY | ('1')
(1 row)

The following example creates a new dimension table, dim_1, and inserts a foreign key and
different (character) data types:
CREATE TABLE dim_1 (b VARCHAR(3), b_PK VARCHAR(4), b_FK INTEGER REFERENCES fact_1(f_PK));

Alter the table to create a multicolumn unique key and multicolumn foreign key and issue the
command that creates the superprojections:
ALTER TABLE dim_1 ADD CONSTRAINT dim_1_multiuk PRIMARY KEY (b, b_PK);
SELECT IMPLEMENT_TEMP_DESIGN('');

The following command inserts a missing foreign key (0) in table dim_1 and commits the
changes:
INSERT INTO dim_1 VALUES ('r1', 'Xpk1', 0);
COMMIT;

Checking for constraints on table dim_1 detects a foreign key violation:


SELECT ANALYZE_CONSTRAINTS('dim_1');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column
Values
-------------+------------+--------------+-----------------+-----------------+
---------------
public | dim_1 | b_fk | - | FOREIGN |
('0')
(1 row)
Now add a duplicate value into the unique key and commit the changes:
INSERT INTO dim_1 values ('r2', 'Xpk1', 1);
INSERT INTO dim_1 values ('r1', 'Xpk1', 1);
COMMIT;

Checking for constraint violations on table dim_1 detects the duplicate unique key error:
SELECT ANALYZE_CONSTRAINTS('dim_1');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+----------------
public | dim_1 | b, b_pk | dim_1_multiuk | PRIMARY | ('r1', 'Xpk1')
public | dim_1 | b_fk | - | FOREIGN | ('0')
(2 rows)

Now create a table with multicolumn foreign key and create the superprojections:
CREATE TABLE dim_2(z_fk1 VARCHAR(3), z_fk2 VARCHAR(4));
ALTER TABLE dim_2 ADD CONSTRAINT dim_2_multifk FOREIGN KEY (z_fk1, z_fk2) REFERENCES dim_1(b, b_PK);
select implement_temp_design('');

-245-
SQL Reference Manual

Now insert a foreign key that matches a foreign key in table dim_1 and commit the changes:
INSERT INTO dim_2 VALUES ('r1', 'Xpk1');
COMMIT;

Checking for constraints on table dim_2 detects no violations:


SELECT ANALYZE_CONSTRAINTS('dim_2');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+---------------
(0 rows)

Add a value that does not match and commit the change:
INSERT INTO dim_2 values ('r1', 'NONE');
COMMIT;

Check for constraints on table dim_2 detects a foreign key violation:


SELECT ANALYZE_CONSTRAINTS('dim_2'); -- expect FK violation
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+----------------
public | dim_2 | z_fk1, z_fk2 | dim_2_multifk | FOREIGN | ('r1', 'NONE')
(1 row)

Now analyze all constraints on all tables:


SELECT ANALYZE_CONSTRAINTS('');
Schema Name | Table Name | Column Names | Constraint Name | Constraint Type | Column Values
-------------+------------+--------------+-----------------+-----------------+---------------
public | dim_1 | b, b_pk | dim_1_multiuk | PRIMARY | ('r1', 'Xpk1')
public | dim_1 | b_fk | - | FOREIGN | ('0')
public | dim_2 | z_fk1, z_fk2 | dim_2_multifk | FOREIGN | ('r1', 'NONE')
public | fact_1 | f_pk | - | PRIMARY | ('1')
public | fact_1 | f_uk | - | UNIQUE | ('1')
(5 rows)

To quickly clean up your database, issue the following command:


DROP TABLE fact_1 cascade;
DROP TABLE dim_1 cascade;
DROP TABLE dim_2 cascade;

To understand about removing violating rows, see the DISABLE_DUPLICATE_KEY_ERROR


(page 261) function.

See Also
Adding Primary Key and Foreign Key Constraints in the Administrator's Guide
COPY (page 323) NO COMMIT
CREATE TABLE (page 346) and ALTER TABLE (page 316) ADD CONSTRAINT

-246-
SQL Functions

ANALYZE_STATISTICS
Collects and aggregates data samples and storage information as a background process from all
nodes on which a projection is stored, then writes statistics into the catalog so that they can be
used by the query optimizer. Without these statistics, the query optimizer would assume uniform
distribution of data values and equal storage usage for all projections.

Syntax
SELECT ANALYZE_STATISTICS { ( '' ) | ( '[schema.]table' ) | ( 'projection' ) }

Return Value
• 0 - For success.
• 1 - For failure. Refer to vertica.log for details.

Parameters
'' Empty string. Collects statistics for all projections.
[schema.]tab Specifies the name of the table and optional schema. When using
le more than one schema, specify the schema that contains the
projection.
Collects statistics for all projections of the specified table.
projection Specifies the name of the projection. Collects statistics for the
specified projection.

Notes
Issuing the command against very large tables/projections could return results more slowly.

Example
The following example computes statistics on all projections in the database and returns 0
(success):
SELECT ANALYZE_STATISTICS ('');
analyze_statistics
--------------------
0
(1 row)
The following command computes statistics on the shipping_dimension table and returns 0
(success):
SELECT ANALYZE_STATISTICS ('shipping_dimension');
analyze_statistics
--------------------
0
(1 row)
The following command computes statistics on one of the shipping_dimension table's projections
and returns 0 (success):
SELECT ANALYZE_STATISTICS('shipping_dimension_site02');
analyze_statistics

-247-
SQL Reference Manual

--------------------
0
(1 row)

CANCEL_DEPLOYMENT
Cancels a design deployment.

Syntax
CANCEL_DEPLOYMENT()

Notes
• Once you've started a deployment through the standard API, you must have permission to
access the design context in order to cancel the associated deployment.
• When you cancel a deployment, Database Designer cancels the projection refresh operation.
It does not roll back any projection that it has already deployed and refreshed.
• Once you cancel a design, you have three options:
§ Complete the deployment process. Use the RUN_DEPLOYMENT function. The
deployment will continue where it left off.
§ Use the DEPLOY_DESIGN function to re-deploy the design. Database DESIGNER will
not refresh any projections which are already up-to-date, as indicated by the
deploy_status "duplicate".
§ Modify and deploy the design from scratch.

See Also
DEPLOY_DESIGN (page 259), RUN_DEPLOYMENT (page 296), and
REVERT_DEPLOYMENT (page 295)

Once you've started a deployment through the standard API, you must have permission to
access the design context to cancel an associated deployment.
When you cancel a deployment, Database Designer cancels the projection refresh operation.

CANCEL_REFRESH
Cancels refresh operations initiated by START_REFRESH().

Syntax
CANCEL_REFRESH()

Notes
• Refresh tasks run in a background thread in an internal session, so you cannot use
INTERRUPT_STATEMENT (page 279) to cancel those statements. Instead, use
CANCEL_REFRESH to cancel statements that are run by refresh-related internal sessions.

-248-
SQL Functions

• Execute CANCEL_REFRESH() on the same node on which START_REFRESH() was


initiated.
• CANCEL_REFRESH() cancels the refresh operation running on a node, waits for the
cancellation to complete, and returns SUCCESS.
• Only one set of refresh operations executes on a node at any time.

See Also
INTERRUPT_STATEMENT (page 279), START_REFRESH (page 308),
VT_PROJECTION_REFRESH (page 482), and VT_SESSION (page 489)

CLEAR_DESIGN_SEGMENTATION_TABLE
If specified, removes all information regarding which columns to use to segment tables.

Syntax
CLEAR_DESIGN_SEGMENTATION_TABLE ( design_context_name , design_name )

Parameters
design_context_name Specifies the name of the design context schema to modify.
design_name Specifies the name of the design configuration to modify.

Note
By default, Database Designer determines which columns to use to segment tables across all
nodes. If you used the SET_DESIGN_SEGMENTATION_COLUMN (page 305) function to
determine how tables are segmented, this function clears the information you specified.

CLEAR_DESIGN_TABLES
Removes the list of tables for which Database designer was going to create projections from the
design context.

Syntax
CLEAR_DESIGN_TABLES ( design_context_name )

Notes
This is useful for quickly removing the entire list, so you can modify it.

Example
The following drops the list of tables for which to develop projections from the vmart design
context schema.
SELECT CLEAR_DESIGN_TABLES('vmart');

See Also
ADD_DESIGN_TABLES (page 238), REMOVE_DESIGN (page 292), and
REMOVE_DESIGN_CONTEXT (page 292)

-249-
SQL Reference Manual

CLEAR_QUERY_REPOSITORY
Triggers Vertica to clear query data from the query repository immediately.

Syntax
CLEAR_QUERY_REPOSITORY()

Notes
• Vertica clears data based on established query repository configuration parameters. For
example, it will use the value of the QueryRepoRetentionTime parameter to determine the
number of days worth of query data to retain. (See Configuring Query Repository.)
• The CLEAR_QUERY_REPOSITORY function resets the clock for the
CleanQueryRepoInterval back to zero (0).

See Also
Collecting Query Information

CLOSE_ALL_SESSIONS
Closes all external sessions except the one issuing the CLOSE_ALL_SESSIONS functions.

Syntax
CLOSE_ALL_SESSIONS()

Notes
Closing of the sessions is processed asynchronously. It might take some time for the session to
be closed. Check the SESSIONS (page 458) table for the status

Message
close_all_sessions | Close all sessions command sent. Check SESSIONS for progress.

Examples
2 user sessions opened, each on a different node
=> select * from sessions;
-[ RECORD 1 ]
timestamp | 2008-03-26 21:57:59
node_name | site01
username | release
client | 127.0.0.1:55642
login_time | 2008-03-26 21:43:50
sessionid | rhel4-1-26555:0x14a26:534149977
txn_start | 2008-03-26 21:44:06
txnid | 45035996273734813
txn_descript | user release (select * from sessions;)
stmt_start | 2008-03-26 21:57:59
stmtid | 0
last_stmt_duration | 1
current_stmt | select * from sessions;
last_stmt | select * from sessions;

-250-
SQL Functions

-[ RECORD 2 ]
timestamp | 2008-03-26 21:57:59
node_name | site01
username | release
client | 10.242.62.8:55652
login_time | 2008-03-26 21:57:42
sessionid | rhel4-1-26555:0x150bb:1059190338
txn_start | 2008-03-26 21:57:42
txnid | 45035996273734822
txn_descript | user release (COPY ClickStream_Fact FROM
'/data/ClickStream_Fact.tbl' DELIMITER '|' NULL '\\n';)
stmt_start | 2008-03-26 21:57:42
stmtid | 17179869186
last_stmt_duration | 0
current_stmt | COPY ClickStream_Fact FROM '/data/ClickStream_Fact.tbl'
DELIMITER '|' NULL '\\n';
last_stmt |
-[ RECORD 3 ]
timestamp | 2008-03-26 21:57:59
node_name | site01
username | release
client | 10.242.62.8:55659
login_time | 2008-03-26 21:57:45
sessionid | rhel4-1-26555:0x150c3:304606079
txn_start | 2008-03-26 21:57:45
txnid | 45035996273734823
txn_descript | user release (COPY ClickStream_Fact FROM
'/data/ClickStream_Fact.tbl' DELIMITER '|' NULL '\\n' DIRECT;)
stmt_start | 2008-03-26 21:57:46
stmtid | 17179869187
last_stmt_duration | 0
current_stmt | COPY ClickStream_Fact FROM '/data/ClickStream_Fact.tbl'
DELIMITER '|' NULL '\\n' DIRECT;
last_stmt |

close_all_sessions
=> select close_all_sessions();
close_all_sessions | Close all sessions command sent. Check sessions for progress.
sessions contents after close_all_sessions()
=> select * from sessions;
-[ RECORD 1 ]
timestamp | 2008-03-26 22:00:25
node_name | site01
username | release
client | 127.0.0.1:55642
login_time | 2008-03-26 21:43:50
sessionid | rhel4-1-26555:0x14a26:534149977
txn_start | 2008-03-26 21:44:06
txnid | 45035996273734813
txn_descript | user release (select * from sessions;)
stmt_start | 2008-03-26 22:00:25
stmtid | 0
last_stmt_duration | 557
current_stmt | select * from sessions;

-251-
SQL Reference Manual

last_stmt | select close_all_sessions();

CLOSE_SESSION
Interrupts the specified external session and rolls back the current transaction, if any, and closes
the socket.

Syntax
CLOSE_SESSION( sessionid )

Parameters
sessionid A string that specifies the session to close. This identifier is unique
within the cluster at any point in time but can be reused when the
session closes.

Notes
Closing of the session is processed asynchronously. It could take some time for the session to
be closed. Check the SESSIONS (page 458) table for the status.

Messages
close_session | Session close command sent. Check SESSIONS for progress.
Error: invalid Session ID format
For a badly formatted sessionID
Error: Invalid session ID or statement key
For an incorrect sessionID parameter

Examples
User session opened. RECORD 2 shows the user session running COPY DIRECT statement.
=> SELECT * FROM SESSIONS;
-[ RECORD 1 ]
current_timestamp | 2008-04-01 14:53:31
node_name | site01
user_name | release
client_hostname | 127.0.0.1:57141
login_timestamp | 2008-04-01 14:41:26
session_id | rhel4-1-30361:0xd7e3e:994462853
transaction_start | 2008-04-01 14:48:54
transaction_id | 45035996273741092
transaction_description | user release (select * from SESSIONS;)
statement_start | 2008-04-01 14:53:31
statement_id | 0
last_stmt_duration | 1

-252-
SQL Functions

current_statement | select * from SESSIONS;


last_statement | select * from SESSIONS;

-[ RECORD 2 ]
current_timestamp | 2008-04-01 14:53:31
node_name | site01
user_name | release
client_hostname | 127.0.0.1:57142
login_timestamp | 2008-04-01 14:52:55
session_id | rhel4-1-30361:0xd83ac:1017578618
transaction_start | 2008-04-01 14:53:26
transaction_id | 45035996273741096
transaction_description | user release (COPY ClickStream_Fact FROM
'/data/clickstream/1g/ClickStream_Fact.tbl'
DELIMITER '|' NULL '\\n' DIRECT;)
statement_start | 2008-04-01 14:53:26
statement_id | 17179869528
last_statement_duration | 0
current_statement | COPY ClickStream_Fact FROM
'/data/clickstream/1g/ClickStream_Fact.tbl'
DELIMITER '|' NULL '\\n' DIRECT;
last_statement |
Close user session rhel4-1-30361:0xd83ac:1017578618
=> SELECT CLOSE_SESSION('rhel4-1-30361:0xd83ac:1017578618');
-[ RECORD 1 ]
close_session | Session close command sent. Check SESSIONS for progress.
Session rhel4-1-30361:0xd83ac:1017578618 closed, no longer displayed in SESSIONS
=> SELECT * FROM SESSIONS;
-[ RECORD 1 ]
current_timestamp | 2008-04-01 14:54:11
node_name | site01
user_name | release
client_hostname | 127.0.0.1:57141
login_timestamp | 2008-04-01 14:41:26
session_id | rhel4-1-30361:0xd7e3e:994462853
transaction_start | 2008-04-01 14:48:54
transaction_id | 45035996273741092
transaction_description | user release (select * from SESSIONS;)
statement_start | 2008-04-01 14:54:11
statement_id | 0
last_statement_duration | 98
current_statement | select * from SESSIONS;
last_statement | select
close_session('rhel4-1-30361:0xd83ac:1017578618');

See Also
SESSION (page 458)

CONFIGURE_DEPLOYMENT
Analyzes a design for deployment, but does not deploy it.

-253-
SQL Reference Manual

Syntax
CONFIGURE_DEPLOYMENT ( design_context_name , design_name )

Parameters
design_context_na Specifies the name of the design context schema that contains the
me design to analyze.
design_name Specifies the name of the design to analyze.

Notes
Instructing Database Designer to create a deployment implementation without actually deploying
the design is useful if you want to:
• Create a test design. See Deploying Test Designs.
• Prevent specific projections from being dropped during deployment. See Preserving Existing
Projections During Deployment.
CONFIGURE_DEPLOYMENT prepares a design for deployment by:
• Setting the deployment status of the design to pending. See the
design_<design_name>_deployment view.
• Either creating a <designName>_deployment_projections table, if one does not exist, to track
the status of every projection in the design (for new designs) or, if the table already exists,
clearing it of existing projection entries.
• Analyzing the design to be deployed and determining its specific projection implementation.
• Placing a record in the <designName>_deployment_projections table for each projection to
be added to or dropped from the deployment.
If a duplicate of projection to be added already exists in the catalog, it is given a status of
duplicate and Database Designer does not redeploy it. Database Designer determines
duplicate projections if they are anchored on the sample table as the projection to be added
and have the same:
§ Table columns with same encoding
§ Sort columns in the same order
§ Segmentation nodes.
§ Segmentation expression (none for unsegmented).
§ Segmentation offset (none for unsegmented).
Note: If there is a naming conflict between a new projection and one already in the catalog,
the name of the deployed projection is modified.
Note: If you attempt to configure a deployment for a design created in Vertica version 3.0,
Vertica displays an error indicating that you must regenerate the design. This occurs
because there is no deployment system table for the design. Use either CREATE_DESIGN
(page 255) or UPDATE_DESIGN (page 310).
When you're ready to deploy the design, use the RUN_DEPLOYMENT (page 296) function.

-254-
SQL Functions

Example
The following example analyzes the design (VMartDesign) from the vmart context, but does not
deploy it.
SELECT CONFIGURE_DEPLOYMENT('vmart','VMartDesign');

See Also
DEPLOY_DESIGN (page 259), RUN_DEPLOYMENT (page 296), CANCEL_DEPLOYMENT
(page 248) and REVERT_DEPLOYMENT (page 295)

CREATE_DESIGN
Generates a new physical schema design.

Syntax
CREATE_DESIGN ( design_context_name , design__name )

Parameters
design_context_name Specifies the name of the design context schema for which to
generate a design.
design_name Specifies the name of the design configuration for which to
generate a design.

Notes
CREATE_DESIGN creates one of the following designs:
• A basic design if no query input table exists or it is empty.
• An optimized design if the query input table is populated.

Example
The following example creates a design for the VMartDesign configuration within the vmart
design context:
SELECT CREATE_DESIGN('vmart','VMartDesign');

See Also
UPDATE_DESIGN (page 310), REMOVE_DESIGN (page 292)

CREATE_DESIGN_CONFIGURATION
Creates a design configuration with the specified name.

Syntax
CREATE_DESIGN_CONFIGURATION ( design_context_name , design_name )

-255-
SQL Reference Manual

Parameters
design_context_na Specifies the name of the design context schema in which to create the
me design.
design_name Specifies the name of the design configuration to create. Design names
should be no longer than 64 characters long.

Notes
This statement creates a design configuration within the context schema specified. Specifically, it
creates system tables to store the design configuration.

Example
The following example creates a design configuration called VMartDesign in the vmart design
context:
SELECT CREATE_DESIGN_CONFIGURATION('vmart','VMartDesign');

See Also
CREATE_DESIGN_CONTEXT (page 256)

CREATE_DESIGN_CONTEXT
Creates a design context schema with the specified name.

Syntax
CREATE_DESIGN_CONTEXT ( design_context_name , [ user_name ] )

Parameters
design_context_name Specifies the name of the design context schema to be created.
user_name Grants USAGE privileges on the design context schema to the
user specified. This enables the user to create, modify, and drop
designs within the context schema. It does not give the user the
ability to drop the context.

Notes
• Only the database administrator can use the CREATE_DESIGN_CONTEXT statement.
• By default, the database administrator has all privileges on the design context schema. To
grant USAGE privileges on the design context schema, specify an existing user_name. See
Required Privileges for Creating Designs.

Example
The following creates a design context schema for the VMart demo database.
SELECT CREATE_DESIGN_CONTEXT('vmart');

-256-
SQL Functions

See Also
REMOVE_DESIGN_CONTEXT (page 292)

CREATE_DESIGN_QUERIES_TABLE
Creates a query input table with the name you specify.

Syntax
CREATE_DESIGN_QUERIES_TABLE ( query_table_name )

Parameters
query_table_name Specifies the name of the query input table to create.

Notes
This function is used in conjunction with LOAD_DESIGN_QUERIES (page 282) and
SET_DESIGN_QUERIES_TABLE (page 303) to create a query input table, load it with data from
the query repository, and establish it as the query input table for the specified design.

Example
The following example creates a query input table and names it vmart_query_input.
SELECT CREATE_DESIGN_QUERIES_TABLE('vmart_query_input');

See Also
SET_DESIGN_QUERIES_TABLE (page 303) and RESET_DESIGN_QUERIES_TABLE (page
293)

CREATE_PROJECTION_DESIGN
Creates the specified physical schema design.

Syntax
CREATE_PROJECTION_DESIGN ( design_name ,
...[ query_table_name ,
.....design_[schema.]tables ,
.....large_tables ,
.....int_k_safety_value ] )

Parameters
design_name Specifies the name of the design context schema and design
configuration to be created. Design names should be no longer
than 64 characters long.
query_table_name Specifies the name of the query input table that contains the
queries you want to use to optimize the design.
The default value for this parameter is ". If you do not specify

-257-
SQL Reference Manual

this parameter, Database Designer creates a basic design.

design_[schema.]tables Specifies the tables for which to create projections. Use a


comma-delimited list to specify tables, as follows:
To specify a specific table, use the form:
[schema_name.]table_name
where schema_name is the schema that contains the table
for which to create a projection.
To specify all the tables in a particular schema, use the form:
schema_name.*
The default value for this parameter is ". If you do not specify
this parameter, Database Designer creates projections for all
user tables in the database.
large_tables Specifies tables for which to create segmented projections.
Segmented projections are used for large tables (typically fact
tables).
Use a comma-delimited list to specify tables. To specify a
specific table, use the form:
[schema_name.]table_name
Where schema_name is the schema that contains the table for
which to create a segmented projection.
Be sure to specify only tables that are specified as design
tables in the design_[schema.]tables parameter.
The default value for this parameter is ". If you do not specify
this parameter, Database Designer does not create any
segmented projections.
int_k-safety_value Sets the K-safety value to 0, 1, or 2. The value of K can be one
1 or two 2 only when the physical schema design meets certain
requirements. See K-Safety for an overview.
If you do not specify this parameter, the value of K defaults to 0
for database clusters that contain fewer than three nodes or 1
for database clusters that contain three or more nodes.

Notes
• This function creates a design, but does not implement it. To implement the design, use the
GET_DESIGN_SCRIPT (page 275) function to create a SQL script which you can redirect to
a file and run to create projections. To create a file that contains the script, use \o to redirect
the results to a file and \t to show only tuples (no headings). To deploy the projections, use
the \i meta-command in vsql to execute the file.
• This function returns the number of tables for which it created projections. It also logs details
of the design process, such as a summary of the projections it created, in the designer log
file (designer.log), which is located in the same directory as vertica.log.

Examples
• The following example creates a basic design named VMart_Schema within a design context
named VMart_Schema. By default:

-258-
SQL Functions

§ The design contains projections for all the tables in the database.
§ None of the projections are segmented.
§ The value of K-Safety is one (1).

SELECT CREATE_PROJECTION_DESIGN('VMart_Schema');
• The following example creates a basic design named VMart_Schema within a design context
named VMart_Schema. Were:
§ '"' instructs Database Designer to a basic design. No queries are provided.
§ 'public.* and store.*' are the tables for which to create the design. In this case it's all the
tables in the public and store schemas.
§ 'public.inventory_fact,store.store_sales_fact,store.store_orders_fact,online_sales.online_s
ales_fact' are the tables to segment. Typically fact and large dimension tables are
segmented across all nodes within the database cluster. This improves the efficiency of
the database.
§ 0 is the K-safety value of the database.

SELECT CREATE_PROJECTION_DESIGN('VMart_Schema',
'"',
'public.*,store.*',
'public.inventory_fact,store.store_sales_fact,store.store_orders_fac
t,
online_sales.online_sales_fact',
'0');

• The following example creates an optimized design named VMart_Opt within a design
context named VMart_Opt. The design will be optimized for the queries stored in the
public.myQueries query input file. By default:
§ The design contains projections for all the tables in the database.
§ None of the projections are segmented.
§ The value of K-Safety is one (1).

SELECT CREATE_PROJECTION_DESIGN('VMart_Opt', 'public.myQueries');

See Also
GET_DESIGN_SCRIPT (page 275)

DEPLOY_DESIGN
Analyzes a design and then deploys it.

Syntax
DEPLOY_DESIGN ( design_context_name , design_name )

-259-
SQL Reference Manual

Parameters
design_context_na Specifies the name of the design context schema that contains the
me design to analyze.
design_name Specifies the name of the design to analyze.

Notes
DEPLOY_DESIGN prepares a design for deployment by:
• Setting the deployment status of the design to pending. See the
design_<design_name>_deployment view.
• Either creating a <designName>_deployment_projections table, if one does not exist, to track
the status of every projection in the design (for new designs) or, if the table already exists,
clearing it of existing projection entries.
• Analyzing the design to be deployed and determining its specific projection implementation.
• Placing a record in the <designName>_deployment_projections table for each projection to
be added to or removed from the deployment.
If a projection to be added already exists in the deployment (catalog), it is given a status of
deployed and Database Designer does not redeploy it. A column already exists in the
deployment catalog if it is anchored on the sample table as the projection to be added and
has the same:
§ Table columns with same encoding
§ Sort columns in the same order
§ Segmentation nodes.
§ Segmentation expression (none for unsegmented).
§ Segmentation offset (none for unsegmented).
Note: If there is a naming conflict between a new projection and one already in the catalog,
the name of the new projection is changed in the design repository and in the
<designName>_deployment_projections table.
• Finally, DEPLOY_DESIGN calls RUN_DEPLOYMENT (page 296) to implement the
projections.
Note: If you attempt to deploy a design created in Vertica version 3.0, Vertica displays an
error indicating that you must regenerate the design. This occurs because there is no
deployment system table for the design. Use either CREATE_DESIGN (page 255) or
UPDATE_DESIGN (page 310).

Example
The following example analyzes the design (VMartDesign) from the vmart context and then calls
RUN_DEPLOYMENT to deploy it.
SELECT DEPLOY_DESIGN('vmart','VMartDesign');

-260-
SQL Functions

See Also
RUN_DEPLOYMENT (page 296), CANCEL_DEPLOYMENT (page 248) and
REVERT_DEPLOYMENT (page 295)

DISABLE_DUPLICATE_KEY_ERROR
Disables error messaging when Vertica finds duplicate PRIMARY KEY/UNIQUE KEY values at
runtime. Queries execute as though no constraints are defined on the schema. Effects are
session scoped.
CAUTION: When called, DISABLE_DUPLICATE_KEY_ERROR() suppresses data integrity
checking and can lead to incorrect query results. Use this function only after you insert
duplicate primary keys into a dimension table in the presence of a prejoin projection. Then
correct the violations and turn integrity checking back on with
REENABLE_DUPLICATE_KEY_ERROR (page 292)().

Syntax
SELECT DISABLE_DUPLICATE_KEY_ERROR();

Usage
The following sample statements create dimension table dim and the corresponding projection:
CREATE TABLE dim (pk INTEGER PRIMARY KEY, x INTEGER);
CREATE PROJECTION dim_p (pk, x) AS SELECT * FROM dim ORDER BY x UNSEGMENTED ALL
NODES;
The next two statements create table fact and the pre-join projection that joins fact to dim.
CREATE TABLE fact(fk INTEGER REFERENCES dim(pk));
CREATE PROJECTION prejoin_p (fk, pk, x) AS SELECT * FROM fact, dim WHERE pk=fk ORDER
BY x;
The following statements load values into table dim. Notice the last statement inserts a duplicate
primary key value of 1:
INSERT INTO dim values (1,1);
INSERT INTO dim values (2,2);
INSERT INTO dim values (1,2); --Constraint violation
COMMIT;
Table dim now contains duplicate primary key values, but you cannot delete the violating row
because of the presence of the pre-join projection. Any attempt to delete the record results in the
following error message:
ROLLBACK: Duplicate primary key detected in FK-PK join Hash-Join (x dim_p), value
1
In order to remove the constraint violation (pk=1), use the following sequence of commands,
which puts the database back into the state just before the duplicate primary key was added.
1 To remove the violation, first save the original dim rows that match the duplicated primary
key.
CREATE TEMP TABLE dim_temp(pk integer, x integer);

-261-
SQL Reference Manual

INSERT INTO dim_temp SELECT * FROM dim WHERE pk=1 AND x=1; -- original
dim row
2 Temporarily disable error messaging on duplicate constraint values:
SELECT DISABLE_DUPLICATE_KEY_ERROR();
Caution: Remember that issuing this command suppresses the enforcement of data integrity
checking.
3 Remove the the original row that contains duplicate values:
DELETE FROM dim WHERE pk=1;
4 Allow the database to resume data integrity checking:
SELECT REENABLE_DUPLICATE_KEY_ERROR();
5 Reinsert the original values back into the dimension table:
INSERT INTO dim SELECT * from dim_temp;
COMMIT;
6 Validate your dimension and fact tables.
If you receive the following error message, it means that the duplicate records you want to delete
are not identical. That is, the records contain values that differ in at least one column that is not a
primary key; for example, (1,1) and (1,2).
ROLLBACK: Delete: could not find a data row to delete (data integrity violation?)
The difference between this message and the rollback message in the previous example is that
a fact row contains a foreign key that matches the duplicated primary key, which has been
inserted. Thus, a row with values from the fact and dimension table is now in the prejoin
projection. In order for the DELETE statement (Step 3 in the following example) to complete
successfully, extra predicates are required to identify the original dimension table values (the
values that are in the prejoin).
This example is nearly identical to the previous example, except that an additional INSERT
statement joins the fact table to the dimension table by a primary key value of 1:
INSERT INTO dim values (1,1);
INSERT INTO dim values (2,2);
INSERT INTO fact values (1); -- New insert statement joins fact with dim on primary
key value=1
INSERT INTO dim values (1,2); -- Duplicate primary key value=1
COMMIT;
1 To remove the violation, first save the original dim and fact rows that match the duplicated
primary key:
CREATE TEMP TABLE dim_temp(pk integer, x integer);
CREATE TEMP TABLE fact_temp(fk integer);
INSERT INTO dim_temp SELECT * FROM dim WHERE pk=1 AND x=1; -- original
dim row
INSERT INTO fact_temp SELECT * FROM fact WHERE fk=1;
2 Temporarily suppresses the enforcement of data integrity checking:
SELECT DISABLE_DUPLICATE_KEY_ERROR();
3 Remove the duplicate primary keys. These steps implicitly remove all fact rows with the
matching foreign key, as well.

-262-
SQL Functions

a) Remove the the original row that contains duplicate values:


DELETE FROM dim WHERE pk=1 AND x=1;
Note: The extra predicate (x=1) specifies removal of the original (1,1) row, rather than
the newly inserted (1,2) values that caused the violation.
b) Remove all remaining rows:
DELETE FROM dim WHERE pk=1;
4 Turn on integrity checking:
SELECT REENABLE_DUPLICATE_KEY_ERROR();
5 Reinsert the original values back into the fact and dimension table:
INSERT INTO dim SELECT * from dim_temp;
INSERT INTO fact SELECT * from fact_temp;
COMMIT;
6 Validate your dimension and fact tables.

See Also
ANALYZE_CONSTRAINTS (page 241)
REENABLE_DUPLICATE_KEY_ERROR (page 292)

DISPLAY_LICENSE
Returns license information.

Syntax
SELECT DISPLAY_LICENSE()

Examples
SELECT DISPLAY_LICENSE();
display_license
-----------------------------------------------------
Vertica Systems, Inc.
2007-08-03
Perpetual
0
500GB

(1 row)

DO_TM_TASK
Runs a Tuple Mover operation (moveout) on one or more projections defined on the specified
table.

Syntax
SELECT DO_TM_TASK( 'moveout' , table_name [ projection ] );

-263-
SQL Reference Manual

Parameters
moveout Moves out all projections on the specified table (if a
particular projection is not specified).
table_name Is the name of the specified table.
projection [Optional] If projection is not passed as an argument, all
projections in the system are used.
If projection is specified, DO_TM_TASK looks for a
projection of that name and, if found, uses it; if a named
projection is not found, the function looks for a table with
that name and, if found, moves out all projections on that
table.

Notes
DO_TM_TASK() is useful because you can move out all projections from a table or database,
without having to name each projection individually.
You do not need to stop the Tuple Mover.

Data in the WOS


Note: If you are continuously loading data into the WOS, you must stop the load before you
can to drop a partition.
The WOS is not partitioned, so it must be empty for partitioning to succeed. To move data out of
the WOS:
1 Advance the epoch by issuing the following command:
SELECT ADVANCE_EPOCH (page 240)();
2 Move data out of the WOS with the command:
SELECT DO_TM_TASK (page 263)('moveout', 'table-name')
This function performs a moveout of all projections defined over the specified table.
3 To determine if data remains in the WOS and, if so, which projections have data in the WOS,
use the following system tables:
§ Use the SYSTEM (page 461) table to determine if any data remains in the WOS:
SELECT WOS_BYTES FROM SYSTEM;
§ If data remains in the WOS, use PROJECTION_STORAGE (page 448) to determine
which projections have data in the WOS:
SELECT * FROM PROJECTION_STORAGE;

Note: These tables show all data in the WOS, not just committed data.

See Also
COLUMN_STORAGE (page 426)
DROP_PARTITION (page 266)

-264-
SQL Functions

DUMP_PARTITION_KEYS (page 270)


DUMP_PROJECTION_PARTITION_KEYS (page 270)
DUMP_TABLE_PARTITION_KEYS (page 271)
PARTITION_PROJECTION (page 288)
SELECT ADVANCE_EPOCH (page 240)
Partitioning Tables in the Administrator's Guide

Examples

Expression Result
DO_TM_TASK('moveout', 'fact'); Performs a moveout of all projections for table fact
DO_TM_TASK('moveout', 'fact_p') Performs a moveout for projection fact_p

DROP_LOCATION
Removes the specified storage location.

Syntax
DROP_LOCATION ( 'path' , 'site' )

Parameters
path Specifies where the storage location to drop is mounted.

site Is the Vertica site where the location is available.

Notes
• Dropping a storage location is a permanent operation and cannot be undone. Therefore,
Vertica recommends that you retire a storage location before dropping it. This will allow you
to verify that you actually want to drop a storage location before doing so. Additionally, you
can easily restore a retired storage location.
• Dropping storage locations is limited to locations that contain only temp files.
• If a location used to store data and you modified it to store only temp files, the location may
still contain data files. If the storage location contains data files, Vertica will not allow you to
drop it. You can manually merge out all the data in this location, wait for the ATM to
mergeout the data files automatically, or you can drop partitions. Deleting data files will not
work.

Example
The following example drops a storage location on node3 that was used to store temp files:

-265-
SQL Reference Manual

SELECT DROP_LOCATION('/secondVerticaStorageLocation/' , 'node3');

See Also
• RETIRE_LOCATION (page 294) in this SQL Reference Guide
• Dropping Storage Locations and Retiring Storage Locations in the Administrator's Guide

DROP_PARTITION
Forces the partition of projections (if needed) and then drops the specified partition.

Syntax
DROP_PARTITION [ ( table_name ) , ( partition_value ) ]

Parameters
table_name Specifies the name of the table.

partition_value Must be specified as a string (within quotes) for all data


types; for example:
DROP_PARTITION('t1', '2');

Notes
Partitioning functions take immutable2 functions only, in order that the same information be
available across all nodes.

Restrictions
• When specifying arguments in CREATE TABLE … PARTITION BY (page 346) expressions,
single quotes are optional for INT and FLOAT data types. All other data types require quotes.
• The specified table cannot be used as a dimension in a pre-joined projection.
• The specified table cannot contain projections in non up-to-date state.
• Projections anchored on the specified table cannot have data in the WOS.

Data in the WOS


Note: If you are continuously loading data into the WOS, you must stop the load before you
can to drop a partition.
The WOS is not partitioned, so it must be empty for partitioning to succeed. To move data out of
the WOS:
1 Advance the epoch by issuing the following command:
SELECT ADVANCE_EPOCH (page 240)();
2 Move data out of the WOS with the command:
SELECT DO_TM_TASK (page 263)('moveout', 'table-name')
This function performs a moveout of all projections defined over the specified table.

2
Immutable functions return the same answers when provided the same inputs. For example, 2+2 always
equals 4.

-266-
SQL Functions

3 To determine if data remains in the WOS and, if so, which projections have data in the WOS,
use the following system tables:
§ Use the SYSTEM (page 461) table to determine if any data remains in the WOS:
SELECT WOS_BYTES FROM SYSTEM;
§ If data remains in the WOS, use PROJECTION_STORAGE (page 448) to determine
which projections have data in the WOS:
SELECT * FROM PROJECTION_STORAGE;

Note: These tables show all data in the WOS, not just committed data.

See Also
ALTER PROJECTION MOVEOUT (page 314)
ADVANCE EPOCH (page 240)
CREATE TABLE (page 346)
DO_TM_TASK (page 263)
DUMP_PARTITION_KEYS (page 270)
DUMP_PROJECTION_PARTITION_KEYS (page 270)
DUMP_TABLE_PARTITION_KEYS (page 271)
MERGE_PARTITIONS (page 287)
PARTITION_PROJECTION (page 288)
PARTITION_TABLE (page 289)
VT_COLUMN_STORAGE (page 469)
VT_PROJECTION (page 481)
Partitioning Tables in the Administrator's Guide

Examples
The following sequence of commands partitions and drops data by year:
CREATE TABLE fact (
...,
date_col date NOT NULL,
... )
PARTITION BY extract('year' FROM date_col);
SELECT DROP_PARTITION ('fact', 1999);
Or
SELECT DROP_PARTITION ('fact', extract (
'year' FROM '1999-01-01'::date);
The following example partitions and drops data by state:
CREATE TABLE fact (

-267-
SQL Reference Manual

...,
state VARCHAR2 NOT NULL,
... )
PARTITION BY state;
SELECT DROP_PARTITION ('fact', 'MA');

The following example partitions and drops data by year and month:
CREATE TABLE fact (
...,
year INTEGER NOT NULL,
month INTEGER NOT NULL,
... )
PARTITION BY year * 12 + month;
Using a constant for Oct 2007, 2007*12 + 10 = 24094:
SELECT DROP_PARTITION('fact', ‘24094’);

-268-
269

DUMP_CATALOG
Returns an internal representation of the Vertica catalog. This function is used for diagnostic
purposes.

Syntax
SELECT DUMP_CATALOG()

Notes
Send the output to Technical Support (on page 33).

Example
This example shows how to extract the catalog into a file (/tmp/catalog.txt)
\o /tmp/catalog.txt
SELECT DUMP_CATALOG();
\o

-269-
270

DUMP_LOCKTABLE
Determines whether or not a lock has been released.

Syntax
SELECT DUMP_LOCKTABLE()

Notes
Use DUMP_LOCKTABLE if Vertica becomes unresponsive; send the output to Technical
Support (on page 33).

DUMP_PARTITION_KEYS
Dumps the partition keys of all projections in the system.

Syntax
DUMP_PARTITION_KEYS( )

See Also
DO_TM_TASK (page 263)
DROP_PARTITION (page 266)
DUMP_PROJECTION_PARTITION_KEYS (page 270)
DUMP_TABLE_PARTITION_KEYS (page 271)
PARTITION_PROJECTION (page 288)
PARTITION_TABLE (page 289)
Partitioning Tables in the Administrator's Guide

DUMP_PROJECTION_PARTITION_KEYS
Dumps the partition keys of the specified projection.

Syntax
DUMP_PROJECTION_PARTITION_KEYS( projection_name )

Parameters
projection_name Specifies the name of the projection.

See Also
DO_TM_TASK (page 263)
DROP_PARTITION (page 266)
DUMP_PARTITION_KEYS (page 270)

-270-
SQL Functions

DUMP_TABLE_PARTITION_KEYS (page 271)


PARTITION_PROJECTION (page 288)
PARTITION_TABLE (page 289)
Partitioning Tables in the Administrator's Guide

DUMP_TABLE_PARTITION_KEYS
Dumps the partition keys of all projections anchored on the specified table.

Syntax
DUMP_TABLE_PARTITION_KEYS( table_name )

Parameters
table_name Specifies the name of the table.

See Also
DO_TM_TASK (page 263)
DROP_PARTITION (page 266)
DUMP_PARTITION_KEYS (page 270)
DUMP_PROJECTION_PARTITION_KEYS (page 271)
PARTITION_PROJECTION (page 288)
PARTITION_TABLE (page 289)
Partitioning Tables in the Administrator's Guide

EXPORT_CATALOG
Generates a SQL script that can be used to recreate a physical schema design in its current
state on a different cluster.

Syntax
EXPORT_CATALOG( filename , { design | design_all } )

Parameters
filename Specifies the path and name of the SQL output file. An
empty string dumps the script to console.
design Instructs Vertica to export the catalog.
design_all Instructs Vertica to export the catalog, systems schemas,
system views, system tables, and the projections on these
system tables.

-271-
SQL Reference Manual

Notes
• Exporting a design is useful for quickly moving a design to another cluster.
• Use the design_all parameter when adding a node to a cluster. See Integrating Data Into a
New Database Design.
• If a projection is created with no sort order, Vertica implicitly assigns a sort order based on
the SELECT columns in the projection definition. The sort order is explicitly defined in the
exported script.

Restrictions
The export script Vertica generates is portable as long as all the projections were generated
using UNSEGMENTED ALL NODES or SEGMENTED ALL NODES. Projections might not exist
on ALL NODES for the following reasons:
• A projection was dropped from a node.
• A projection was created only on a subset of nodes.
• An additional node was added since the projection set was created.

EXPORT_DESIGN_CONFIGURATION
Generates a VSQL script that can be used to recreate the design configuration on another
system.

Syntax
EXPORT_DESIGN_CONFIGURATION ( design_context_name , design_name ,
export_file_name )

Parameters
design_context_name Specifies the name of the design context schema that contains
the design configuration to export.
existing_design_name Specifies the name of the design configuration to export.
saved_design_name Specifies a unique name for the file generated by this function.

Notes
• This function exports all the Database Designer tables that were created in a design context
that relate to the specified design configuration. It does not export design tables from the
logical schema that are referenced by Database Designer. (See
EXPORT_DESIGN_TABLES (page 273).) Use EXPORT_DESIGN_TABLES and
EXPORT_DESIGN_CONFIGURATION to produce two scripts which, when run in sequence,
can be used to recreate the logical schema and design context in another database.
• If a path is not specified with the export_file_name, Vertica creates the file in the catalog
directory.

-272-
SQL Functions

Example
The following example creates a file named VMartDesignExport that contains the necessary data
to recreate the VMartDesign configuration.
SELECT EXPORT_DESIGN_CONFIGURATION('vmart','VMartDesign','VMartDesignExport');

See Also
EXPORT_DESIGN_TABLES (page 273)

EXPORT_DESIGN_TABLES
Generates a script that, when run, recreates all the database structures for the logical schema
that are referenced by the design context.

Syntax
EXPORT_DESIGN_TABLES ( design_context_name , design_name , export_file__name )

Parameters
design_context_name Specifies the name of the design context schema that contains
the design to export.
existing_design_name Specifies the name of the design to export.
saved_design_name Specifies a unique name for the file generated by this function.

Notes
• This function exports all the design tables from the logical schema that are referenced by the
design context specified. It also exports all the schemas, constraints, projections, and views
associated with these tables.
• This function is useful for exporting a design to another system.
• If a path is not specified with the export_file_name, Vertica creates the file in the catalog
directory.

Example
The following example creates a file named VMartExport that contains data to recreate all the
database structures for the logical schema associated with the VMartDesign within the vmart
design context.
SELECT EXPORT_DESIGN_TABLES('vmart','VMartDesign','VMartExport');

See Also
EXPORT_DESIGN_CONFIGURATION (page 272)

EXPORT_STATISTICS
Generates an XML file that contains statistics for the database.

-273-
SQL Reference Manual

Syntax
EXPORT_STATISTICS( filename )

Parameters
filename Specifies the path and name of the XML output file. An
empty string dumps the script to console.

Notes
• EXPORT_STATISTICS is used in conjunction with ANALYZE_STATISTICS (page 247) and
READ_DATA_STATISTICS (page 291) to provide information regarding the database to
Database Designer to use to generate designs.
• Before you export statistics for the database, be sure to run ANALYZE_STATISTICS to
collect and aggregate data samples and storage information. If you do not use
ANALYZE_STATISTICS, Database Designer will produce a sub-optimal projection similar to
the ones created for temporary designs.

GET_AHM_EPOCH
Returns the number of the epoch in which the Ancient History Mark is located. Data deleted up
to and including the AHM epoch can be purged from physical storage.

Syntax
GET_AHM_EPOCH()
Note: The AHM epoch is 0 (zero) by default (purge is disabled).

Examples
SELECT GET_AHM_EPOCH();
get_ahm_epoch
----------------------
Current AHM epoch: 0
(1 row)

GET_AHM_TIME
Returns a TIMESTAMP value representing the Ancient History Mark. Data deleted up to and
including the AHM epoch can be purged from physical storage.

Syntax
GET_AHM_TIME()

Examples
SELECT GET_AHM_TIME();
get_ahm_time
------------------------------------------------
Current AHM Time: 2009-02-17 16:13:19.97574-05
(1 row)

-274-
SQL Functions

See Also
SET DATESTYLE (page 396) for information about valid TIMESTAMP (page 99) values.

GET_CURRENT_EPOCH
The GET_CURRENT_EPOCH function returns the number of the current epoch. The current
epoch is the epoch into which data (COPY, INSERT, UPDATE, and DELETE operations) is
currently being written. The current epoch advances automatically every three minutes.

Syntax
GET_CURRENT_EPOCH()

Examples
=> SELECT GET_CURRENT_EPOCH();
get_current_epoch
----------------------
(1 row)

GET_DESIGN_SCRIPT
Generates a SQL query that you can use to create the physical schema (projections).

Syntax
GET_DESIGN_SCRIPT ( design_context_name , design__name )

Parameters
design_context_name Specifies the name of the design context schema that contains the
design for which to generate a sql script.
design_name Specifies the specific design configuration for which to generate a
sql script.

Notes
§ To create a file that contains the contents of the query, use \o to redirect the results to a
file.

Example
The following example generates a SQL query for the VMartDesign in the vmart context.
SELECT GET_DESIGN_SCRIPT('vmart','VMartDesign');

-275-
SQL Reference Manual

GET_LAST_GOOD_EPOCH
The GET_LAST_GOOD_EPOCH function returns the number of the last good epoch. The last
good epoch is a term used in manual recovery and refers to the most recent epoch that can be
recovered.

Syntax
GET_LAST_GOOD_EPOCH()

Examples
=> SELECT GET_LAST_GOOD_EPOCH();
get_last_good_epoch
----------------------
Last Good Epoch: 5598
(1 row)

GET_NUM_ACCEPTED_ROWS
For the current session, returns the number of rows loaded into the database for the last
completed load.

Syntax
GET_NUM_ACCEPTED_ROWS();

Notes
• Only loads from STDIN or a single file on the initiator are supported. This function cannot be
called for multi-node loads.
• Information is not available for a load that is currently running. Check VT_LOAD_STREAMS
(page 476) for its status.
• Data regarding loads does not persist, and is dropped when a new load is initiated.

GET_NUM_REJECTED_ROWS
For the current session, returns the number of rows that were rejected during the last completed
load.

Syntax
GET_NUM_REJECTED_ROWS();

Notes
• Only loads from STDIN or a single file on the initiator are supported. This function cannot be
called for multi-node loads.
• Information is not available for a load that is currently running. Check VT_LOAD_STREAMS
(page 476) for its status.
• Data regarding loads does not persist, and is dropped when a new load is initiated.

-276-
SQL Functions

GET_PROJECTION_STATUS
Returns information relevant to the status of a projection.

Syntax
GET_PROJECTION_STATUS( [schema-name.]projection );

Parameters
[schema-name.]proje Is the name of the projection for which to display status. When
ction using more than one schema, specify the schema that contains
the projection.

Description
GET_PROJECTION_STATUS returns information relevant to the status of a projection:
• The current K-Safety status of the database
• The number of nodes in the database
• Whether or not the projection is segmented
• The number and names of buddy projections
• Whether or not the projection is safe
• Whether or not the projection is up-to-date

Notes
• You can use GET_PROJECTION_STATUS to monitor the progress of a projection data
refresh. See ALTER PROJECTION (page 314).
• When using GET_PROJECTION_STATUS or GET_PROJECTIONS you must provide the
name and node (for example, ABC_NODE01) instead of just ABC.
• To view a list of the nodes in a database, use the View Database Command in the
Administration Tools.

Examples
SELECT GET_PROJECTION_STATUS('t1_sp2');
get_projection_status
-----------------------------------------------------------------------------------------------
Current system K is 0.
# of Nodes: 1.
t1_sp2 [Segmented: No] [# of Buddies: 0] [No buddy projections] [Safe: Yes] [UptoDate: No]

See Also
ALTER PROJECTION (page 314)
GET_PROJECTIONS (page 277)

GET_PROJECTIONS, GET_TABLE_PROJECTIONS
Note: This function was formerly named GET_TABLE_PROJECTIONS(). Vertica still
supports the former function name.

-277-
SQL Reference Manual

Returns information relevant to the status of a table:


• The current K-Safety status of the database
• The number of sites (nodes) in the database
• The number of projections for which the specified table is the anchor table
• For each projection:
§ The projection's buddy projections
§ Whether or not the projection is segmented
§ Whether or not the projection is safe
§ Whether or not the projection is up-to-date

Syntax
GET_PROJECTIONS( [schema-name.]table )

Parameters
[schema-name.]table Is the name of the table for which to list projections. When
using more than one schema, specify the schema that
contains the table.

Notes
• You can use GET_PROJECTIONS to monitor the progress of a projection data refresh. See
ALTER PROJECTION (page 314).
• When using GET_PROJECTIONS or GET_PROJECTION_STATUS, you must provide the
name and node (for example, ABC_NODE01) instead of just ABC.
• To view a list of the nodes in a database, use the View Database Command in the
Administration Tools.

Examples
The following example gets information about the store_dimension table in the VMart schema:
SELECT GET_PROJECTIONS('store.store_dimension');
--------------------------------------------------------------------------------------
Current system K is 1.
# of Nodes: 4.
Table store.store_dimension has 4 projections.

Projection Name: [Segmented] [Seg Cols] [# of Buddies] [Buddy Projections] [Safe] [UptoDate]
----------------------------------------------------------
store.store_dimension_node0004 [Segmented: No] [Seg Cols: ] [K: 3] [store.store_dimension_node0003,
store.store_dimension_node0002, store.store_dimension_node0001] [Safe: Yes] [UptoDate: Yes][Stats:
Yes]
store.store_dimension_node0003 [Segmented: No] [Seg Cols: ] [K: 3] [store.store_dimension_node0004,
store.store_dimension_node0002, store.store_dimension_node0001] [Safe: Yes] [UptoDate: Yes][Stats:
Yes]
store.store_dimension_node0002 [Segmented: No] [Seg Cols: ] [K: 3] [store.store_dimension_node0004,
store.store_dimension_node0003, store.store_dimension_node0001] [Safe: Yes] [UptoDate: Yes][Stats:
Yes]
store.store_dimension_node0001 [Segmented: No] [Seg Cols: ] [K: 3] [store.store_dimension_node0004,
store.store_dimension_node0003, store.store_dimension_node0002] [Safe: Yes] [UptoDate: Yes][Stats:
Yes]
(1 row)

-278-
SQL Functions

See Also
ALTER PROJECTION (page 314)
GET_PROJECTION_STATUS (page 277)

IMPLEMENT_TEMP_DESIGN
Creates and implements a temporary physical schema design (projections).

Syntax
IMPLEMENT_TEMP_DESIGN ( 'table_name' )
IMPLEMENT_TEMP_DESIGN ( '' )

Parameters
table_name Specifies the name of the table for which to create a projection. If
an empty string is provided instead of a table name, Vertica
creates temporary projections for all the tables in all named
schema.

Notes
• Vertica does not create projections for any table that already has a safe super projection.
• This function returns the number of projections created.

INTERRUPT_STATEMENT
Interrupts the specified statement (within an external session), rolls back the current transaction,
and writes a success or failure message to the log file.

Syntax
INTERRUPT_STATEMENT( sessionid , stmtid )

Parameters
sessionid specifies the session to interrupt. This identifier is unique within
the cluster at any point in time.
stmtid specifies the statement to interrupt

Notes
• Only statements run by external sessions can be interrupted.
• Sessions can be interrupted during statement execution.
• If the stmtid is valid, the statement is interruptible. The command is successfully sent and
returns a success message. Otherwise the system returns an error.

Messages
Statement interrupt sent. Check SESSIONS for progress.
Success.

-279-
SQL Reference Manual

Session <id> could not be successfully interrupted: session not found


The session ID argument to the interrupt command does not match a running session.
Session <id> could not be successfully interrupted: statement not found
The statement ID does not (or no longer) matches the ID of a running statement (if any).
No interruptible statement running
If the statement is DDL or otherwise non-interruptible.
Internal (system) sessions cannot be interrupted.
The session is internal.

Examples
Two user session are open. RECORD 1 shows user session running SELECT FROM SESSION,
and RECORD 2 shows user session running COPY DIRECT:
=> select * from sessions;

-[ RECORD 1 ]
current_timestamp | 2008-04-01 15:00:20
node_name | site01
user_name | release
client_hostname | 127.0.0.1:57141
login_timestamp | 2008-04-01 14:41:26
session_id | rhel4-1-30361:0xd7e3e:994462853
transaction_start | 2008-04-01 14:48:54
transaction_id | 45035996273741092
transaction_description | user release (select * from sessions;)
statement_start | 2008-04-01 15:00:20
statement_id | 0
last_statement_duration | 1
current_statement | select * from sessions;
last_statement | select * from sessions;
|
-[ RECORD 2 ]
current_timestamp | 2008-04-01 15:00:20
node_name | site01
user_name | release
client_hostname | 127.0.0.1:57150
login_timestamp | 2008-04-01 14:58:56
session_id | rhel4-1-30361:0xd868a:1015659125
transaction_start | 2008-04-01 15:00:17
transaction_id | 45035996273741101
transaction_description | user release (COPY ClickStream_Fact FROM
|
'/scratch_b/qa/vertica/QA/VT_Scenario/data/clickstream
| /1g/ClickStream_Fact.tbl' DELIMITER '|' NULL '\\n'
DIRECT;)
statement_start | 2008-04-01 15:00:17
statement_id | 17179869529
last_statement_duration | 0
current_statement | COPY ClickStream_Fact FROM
'/scratch_b/qa/vertica/QA/

-280-
SQL Functions

|
VT_Scenario/data/clickstream/1g/ClickStream_Fact.tbl' DELIMITER '|'
| NULL '\\n' DIRECT;
last_statement |
Interrupt the COPY DIRECT statement running in session rhel4-1-30361:0xd868a:1015659125:
=> SELECT INTERRUPT_STATEMENT('rhel4-1-30361:0xd868a:1015659125',17179869529);
Interrupt_statement; statement interrupt sent. Check SESSIONS for progress.
Verify that the interrupted statement is no longer active. It appears in the last statement field:
=> SELECT * FROM SESSIONS;
-[ RECORD 1 ]
current_timestamp | 2008-04-01 15:01:09
node_name | site01
user_name | release
client_hostname | 127.0.0.1:57141
login_timestamp | 2008-04-01 14:41:26
session_id | rhel4-1-30361:0xd7e3e:994462853
transaction_start | 2008-04-01 14:48:54
transaction_id | 45035996273741092
transaction_description | user release (select * from sessions;)
statement_start | 2008-04-01 15:01:09
statement_id | 0
last_statement_duration | 99
current_statement | select * from sessions;
last_statement | select
interrupt_statement('rhel4-1-30361:0xd868a:1015659125',17179869529);
|
-[ RECORD 2 ]
current_timestamp | 2008-04-01 15:01:09
node_name | site01
user_name | release
client_hostname | 127.0.0.1:57150
login_timestamp | 2008-04-01 14:58:56
session_id | rhel4-1-30361:0xd868a:1015659125
transaction_start | 2008-04-01 15:00:17
transaction_id | 45035996273741101
transaction_description | user release (COPY ClickStream_Fact FROM
|
'/scratch_b/qa/vertica/QA/VT_Scenario/data/clickstream/
| 1g/ClickStream_Fact.tbl' DELIMITER '|' NULL '\\n'
DIRECT;)
statement_start | 2008-04-01 15:00:17
statement_id | 0
last_statement_duration | 44456
current_statement |
last_statement | COPY ClickStream_Fact FROM
'/scratch_b/qa/vertica/QA/
|
VT_Scenario/data/clickstream/1g/ClickStream_Fact.tbl' DELIMITER '|'
| NULL '\\n' DIRECT;

-281-
SQL Reference Manual

LOAD_DATA_STATISTICS
Loads data into the design context specified.

Syntax
LOAD_DATA_STATISTICS ( design_context_name , bool_value )

Parameters
design_context_name Specifies the name of the design context schema in which to load
database statistics.
bool_value Determines if the statistics are regenerated before loading them
into the design context
Use one of the following:
true (to regenerate the data statistics)
false

Notes
• To load data statistics, all database nodes must be up. Otherwise, the function will fail.
• By default, LOAD _DATA_STATISTICS loads existing data statistics for the design tables
into the design context. Use the regenerate_stats parameter (true) to update the statistics
before loading them into the data statistics table.
• It is necessary to regenerate statistics the first time you create a design context or whenever
you modify design tables. Depending upon the size of your database, gathering statistics can
take a significant amount of time. Once you gather data statistics, they persist for the
database. This means that they are used by every design created for the database, and you
do not need to regenerate them again unless you modify the tables that you have included in
one or more designs.
• Loading new or modified data stats could change the behavior of existing queries because
the optimizer uses these statistics.
• If this parameter is set to true, the function runs analyze_statistics on each design table.

Example
The following example regenerates and then loads data statistics into the vmart design context:
SELECT LOAD_DATA_STATISTICS('vmart', true);

See Also
ANALYZE_STATISTICS (page 247), READ_DATA_STATISTICS (page 291)

LOAD_DESIGN_QUERIES
Updates the specified query input table with queries from the query input file specified.

Syntax
LOAD_DESIGN_QUERIES ( query_table_name , file_name )

-282-
SQL Functions

Parameters
query_table_name Specifies the name of the query input table to update.

file_name Specifies the absolute path of the query input file from which to
obtain queries.

Notes
• LOAD_DESIGN_QUERIES is used to load initial queries at design creation into a design. It's
also used to add additional queries to designs. For example, you might want to:
§ Create targeted queries for a CEO and add them to a design.
§ Create additional queries after adding additional columns to one or more tables.
• The query input file has a 10 MB limit.
• You can use this function in conjunction with CREATE_DESIGN_QUERIES_TABLE (page
257) and SET_DESIGN_QUERIES_TABLE (page 303) to create a new query input table,
load it with data from the query repository, and establish it as the query input table for the
specified design. This enables you to work on a design even if you do not have access to
files on the database server.

Example
The following example loads queries from the QueryFile into a query input table named
vmart_query_input:
SELECT LOAD_DESIGN_QUERIES('vmart_query_input',
'/scratch/examples/VMart_Schema/QueryFile');

MAKE_AHM_NOW
Sets the Ancient History Mark (AHM) to the greatest allowable value, and lets you drop any
projections that existed before the issue occurred.
Caution: This function is intended only for users who administer databases on systems that
belong to them.

Syntax
MAKE_AHM_NOW()

Notes
• The MAKE_AHM_NOW function performs the following operations:
§ Advances the epoch.
§ Performs a moveout operation on all projections.
§ Sets the AHM to LGE, which should be, at least, the current epoch at the time
MAKE_AHM_NOW() was issued.
• All history will lost. Therefore, you will not be able to perform historical queries prior to the
current epoch.

-283-
SQL Reference Manual

• This function succeeds even when nodes are down. However, if the AHM is advanced in this
way while nodes are down, the nodes must recover all data from scratch.

Example
SELECT MAKE_AHM_NOW();
make_ahm_now
-------------------------------
AHM set (New AHM Epoch: 5613)
(1 row)

See Also
DROP PROJECTION (page 360)
MARK_DESIGN_KSAFE (page 285)
SET_AHM_EPOCH (page 298)
SET_AHM_TIME (page 300)

-284-
285

MARK_DESIGN_KSAFE
Enables or disables high availability in your environment, in case of a failure. Before enabling
recovery, MARK_DESIGN_KSAFE queries the catalog to determine whether a cluster's physical
schema design meets the following requirements:
• Dimension tables are replicated on all nodes.
• Fact table superprojections are segmented with each segment on a different node.
• Each fact table projection has at least one "buddy" projection for K-Safety=1 or two buddy
projections for K-Safety=2.
• Each segment of each fact table projection exists on two or three nodes. Two nodes are
required for K-Safety=1 and three nodes are required for K-Safety=2.
Projections are considered to be buddies if they contain the same columns and have the
same segmentation. They can have different sort orders.
MARK_DESIGN_KSAFE does not change the physical schema in any way.

Syntax
SELECT MARK_DESIGN_KSAFE(k)

Parameters
k 2 enables high availability if the schema design meets requirements for
K-Safety=2
1 enables high availability if the schema design meets requirements for
K-Safety=1
0 disables high availability

If you specify a k value of one (1) or two (2), Vertica returns one of the following messages.
Success:
Marked design n-safe
Failure:
The schema does not meet requirements for K=n.
Fact table projection projection-name
has insufficient "buddy" projections.
n in the message is 1 or 2 and represents the k value.

Notes
• The database's internal recovery state persists across database restarts but it is not checked
at startup time.
• If a database has had MARK_DESIGN_KSAFE enabled, you must temporarily disable
MARK_DESIGN_KSAFE before loading data into a new table with its corresponding buddy.
• When one node fails on a system marked K-safe=1, the remaining nodes are available for
DML operations.

-285-
SQL Reference Manual

Examples
> SELECT MARK_DESIGN_KSAFE(1);
mark_design_ksafe
----------------------
Marked design 1-safe
(1 row)
If the physical schema design is not K-Safe, messages indicate which projections do not have a
buddy:
> SELECT MARK_DESIGN_KSAFE(1);
The given K value is not correct; the schema is 0-safe
Projection pp1 has 0 buddies, which is smaller that the given K of 1
Projection pp2 has 0 buddies, which is smaller that the given K of 1
.
.
.
(1 row)

See Also
• High Availability and Recovery in the Concepts Guide.
• For information about monitoring K-Safety, see SYSTEM (page 461) in the SQL System
Tables (Monitoring API) (page 409).
• For information about designing segmented projections for K-Safety, see Using Identically
Segmented Projections in the Administrator's Guide.
• For information about troubleshooting K-Safety, see Failure Recovery in the Troubleshooting
Guide

MEASURE_LOCATION_PERFORMANCE
Measures disk performance for the location specified.

Syntax
MEASURE_LOCATION_PERFORMANCE ( path , [ node ] )

Parameters
path Specifies where the storage location to measure is
mounted.
node [Optional] Is the Vertica node where the location to be
measured is available.
If this parameter is omitted, node defaults to the initiator.

-286-
SQL Functions

Notes
• If you intend to create a tiered disk architecture in which projections, columns, and partitions
are stored on different disks based on predicted or measured access patterns, you need to
measure storage location performance for each location in which data will be stored. You do
not need to measure storage location performance for temp data storage locations because
temporary files are stored based on available space.
• This method of measuring storage location performance applies only to configured clusters. If
you want to measure a disk before configuring a cluster see ???.
• Storage location performance equates to the amount of time it takes to read a fixed amount
of data from the disk. This read time equates to the disk throughput in MB per second plus
the time it takes to seek data based on the number of seeks per second, as follows:
Read Time = Throughput (MB/second) + Latency (seeks/second)
Therefore, a disk is faster than another disk if its Read Time is smaller.

Example
The following example measures the performance of a storage location on node2:
SELECT MEASURE_LOCATION_PERFORMANCE('/secondVerticaStorageLocation/' , 'node2');
WARNING: measure_location_performance can take a long time. Please check logs for
progress
measure_location_performance
--------------------------------------------------
Throughput : 122 MB/sec. Latency : 140 seeks/sec

See Also
ADD_LOCATION (page 239), ALTER_LOCATION_USE, RETIRE_LOCATION (page 294), and
Measuring Location Performance.

MERGE_PARTITIONS
Merges ROSs that belong to partitions in a specified partition key range.

Syntax
MERGE_PARTITIONS [
... ( table_name ) ,
... ( partitionKeyFrom ) ,
... ( partitionKeyTo ) ]

Parameters
table_name Specifies the name of the table

partitionKeyFrom Specifies the start point of the partition

partitionKeyTo Specifies the end point of the partition

-287-
SQL Reference Manual

Description
MERGE_PARTITIONS merges ROSs that have data belonging to partitions in a specified
partition key range: [ partitionKeyFrom, partitionKeyTo ].

Notes
• Partitioning functions take invariant functions only, in order that the same information be
available across all nodes.
• The edge values are included in the range, and partitionKeyFrom must be less than or
equal to partitionKeyTo.
• Inclusion of partitions in the range is based on the application of less than(<)/greater than(>)
operators of the corresponding data type.
• If partitionKeyFrom is the same as partitionKeyTo, all ROSs of the partition key are
merged into one ROS.
• No restrictions are placed on a partition key's data type.

Examples
SELECT MERGE_PARTITIONS('T1', '200', '400');
SELECT MERGE_PARTITIONS('T1', '800', '800');
SELECT MERGE_PARTITIONS('T1', 'CA', 'MA');
SELECT MERGE_PARTITIONS('T1', 'false', 'true');
SELECT MERGE_PARTITIONS('T1', '06/06/2008', '06/07/2008');
SELECT MERGE_PARTITIONS('T1', '02:01:10', '04:20:40');
SELECT MERGE_PARTITIONS('T1', '06/06/2008 02:01:10', '06/07/2008 02:01:10');
SELECT MERGE_PARTITIONS('T1', '8 hours', '1 day 4 hours 20 seconds');

PARTITION_PROJECTION
Forces a split of ROS containers of the specified projection.

Syntax
PARTITION_PROJECTION ( projection_name )

Parameters
projection_name Specifies the name of the projection.

Notes
PARTITION_PROJECTION is similar to PARTITION_TABLE (page 289)(), except that
PARTITION_PROJECTION works only on the specified projection, instead of the table.
After a refresh completes, the refreshed projections go into a single ROS container. If the table
was created with a PARTITION BY clause, then you should call PARTITION_TABLE() or
PARTITION_PROJECTION() to reorganize the data into multiple ROS containers, since queries
on projections with multiple ROS containers perform better than queries on projections with a
single ROS container.
Partitioning functions take invariant functions only, in order that the same information be
available across all nodes.

-288-
SQL Functions

See Also
DO_TM_TASK (page 263)
DROP_PARTITION (page 266)
DUMP_PARTITION_KEYS (page 270)
DUMP_PROJECTION_PARTITION_KEYS (page 270)
DUMP_TABLE_PARTITION_KEYS (page 271)
PARTITION_TABLE (page 289)
Partitioning Tables in the Administrator's Guide

PARTITION_TABLE
Forces the system to break up any ROSs that contain multiple distinct values of the partitioning
expression. Only ROS containers with more than one distinct value participate in the split.

Syntax
PARTITION_TABLE ( table_name )

Parameters
table_name Specifies the name of the table.

Notes
PARTITION_TABLE is similar to PARTITION_PROJECTION (page 288), except that
PARTITION_TABLE works on the specified table.
After a refresh completes, the refreshed projections go into a single ROS container. If the table
was created with a PARTITION BY clause, then you should call PARTITION_TABLE() or
PARTITION_PROJECTION() to reorganize the data into multiple ROS containers, since queries
on projections with multiple ROS containers perform better than queries on projections with a
single ROS container.
Partitioning functions take invariant functions only, in order that the same information be
available across all nodes.

See Also
DO_TM_TASK (page 263)
DROP_PARTITION (page 266)
DUMP_PARTITION_KEYS (page 270)
DUMP_PROJECTION_PARTITION_KEYS (page 270)
DUMP_TABLE_PARTITION_KEYS (page 271)
PARTITION_PROJECTION (page 288)

-289-
SQL Reference Manual

Partitioning Tables in the Administrator's Guide

PURGE
Purges all projections in the physical schema. A purge operation permanently removes deleted
data from physical storage so that the disk space can be reused. You can purge historical data
up to and including the epoch in which the Ancient History Mark is contained.

Syntax
PURGE()

Notes
This function was formerly named PURGE_ALL_PROJECTIONS. Vertica supports both function
calls.

See Also
PURGE_PROJECTION (page 290)
PURGE_TABLE (page 290)
Purging Deleted Data in the Administrator's Guide.

PURGE_PROJECTION
Purges the specified projection. A purge operation permanently removes deleted data from
physical storage so that the disk space can be reused. You can purge historical data up to and
including the epoch in which the Ancient History Mark is contained.

Syntax
PURGE_PROJECTION ( [schema-name.]projection )

Parameters
projection Is the name of a specific projection. When using more than
one schema, specify the schema that contains the projection.

See Also
Purging Deleted Data in the Administrator's Guide

PURGE_TABLE
Purges all projections of the specified table. A purge operation permanently removes deleted
data from physical storage so that the disk space can be reused. You can purge historical data
up to and including the epoch in which the Ancient History Mark is contained.

Syntax
PURGE_TABLE ( [schema-name.]table-name )

-290-
SQL Functions

Parameters
table-name Is the name of a specific table in the Logical Schema. When
using more than one schema, specify the schema that
contains the projection.

Notes
This function was formerly named PURGE_TABLE_PROJECTIONS(). Vertica still supports the
old name.

Example
The following example purges all projections for the store sales fact table located in the Vmart
schema:
PURGE_TABLE('store.store_sales_fact');

See Also
Purging Deleted Data in the Administrator's Guide

READ_DATA_STATISTICS
Loads data statistics for each design table from an external XML file into the design context
specified.

Syntax
READ_DATA_STATISTICS ( design_context_name , stats_file_name )

Parameters
design_context_name Specifies the name of the design context schema in which to load
database statistics.
stats_file_name Specifies the file path for the XML file to pass in.

Notes
• To create an XML file that contains data statistics, run ANALYZE_STATISTICS (page 247)
to collect and aggregate data samples and storage information. Then use
EXPORT_STATISTICS (page 273) to export this data to an XML file. If you do not use
ANALYZE_STATISTICS, Database Designer will produce a sub-optimal projection similar to
the ones created for temporary designs.
• If a set of data statistics already exists for the design context, it is deleted before the data is
read.

See Also
LOAD_DATA_STATISTICS (page 282), ANALYZE_STATISTICS (page 247), and
EXPORT_STATISTICS (page 273)

-291-
SQL Reference Manual

REENABLE_DUPLICATE_KEY_ERROR
Reenables the default behavior of error reporting by reversing the effects of
DISABLE_DUPLICATE_KEY_ERROR. Effects are session scoped.

Syntax
SELECT REENABLE_DUPLICATE_KEY_ERROR();

Examples
For examples and usage see DISABLE_DUPLICATE_KEY_ERROR (page 261).

See Also
ANALYZE_CONSTRAINTS (page 241)

REMOVE_DESIGN
Permanently removes all design repository records associated with the named physical schema
design.

Syntax
REMOVE_DESIGN ( design_context_name , design__name )

Parameters
design_context_name Specifies the name of the design context schema that contains the
design to remove.
design_name Specifies the specific design to delete.

EXAMPLE
The following example removes the VMartDesign from the vmart context schema.
SELECT REMOVE_DESIGN('vmart','VMartDesign');

REMOVE_DESIGN_CONTEXT
Drops the design context schema with the specified name.

Syntax
REMOVE_DESIGN_CONTEXT ( design_context_name )

Parameters
design_context_name Specifies the name of the design context schema to drop.

Notes
• The design context schema and all its corresponding design configurations are dropped.
• This is a permanent operation.

-292-
SQL Functions

Restrictions
Only the database administrator can drop a design context schema.

Example
The following drops the design context schema for the VMart demo database.
SELECT REMOVE_DESIGN_CONTEXT('vmart');

See Also
CLEAR_DESIGN_TABLES (page 249) and REMOVE_DESIGN (page 292)

REMOVE_DEPLOYMENT_ENTRY
Removes a projection entry from the <designName>_deployment_projections table, thus
removing it from the next deployment.

Syntax
REMOVE_DEPLOYMENT_ENTRY ( design_context_name , design_name , projection_name )

Parameters
design_context_name Specifies the name of the design context schema that contains the
design to modify.
design_name Specifies the specific design to modify.
projection_name Specifies the projection entry to be dropped.

Notes
• Removing a projection entry prevents the corresponding drop from being implemented for the
projection. Therefore, if the projection was supposed to be dropped, it will remain in the
deployment.

Example
The following removes the customer_tmp_dev02 projection from the
VMartDesign_deployment_projections table:
SELECT REMOVE_DEPLOYMENT_ENTRY('vmart','VMartDesign','customer_tmp_dev02');

RESET_DESIGN_QUERIES_TABLE
Removes the query input table from the design configuration.

Syntax
RESET_DESIGN_QUERIES_TABLE ( design_context_name , design_name )

Parameters
design_context_name Specifies the name of the design context schema to update.

-293-
SQL Reference Manual

design_name Specifies the name of the design configuration to update.

Notes
This is useful for changing an optimized design into a basic design.

Example
The following example removes the query input table from the VMartDesign configuration:
SELECT RESET_DESIGN_QUERIES_TABLE('vmart','VMartDesign');

See Also
CREATE_DESIGN_QUERIES_TABLE (page 257), SET_DESIGN_QUERIES_TABLE (page
303)

RESTORE_LOCATION
Restores the retired location specified.

Syntax
RESTORE_LOCATION_USE ( path , [ node ] )

Parameters
path Specifies where the retired storage location is mounted.
node [Optional] Is the Vertica node where the retired location
is available.
If this parameter is omitted, node defaults to the initiator.

Notes
Once restored, Vertica will re-rank the storage locations and use the restored location to process
queries as determined by its rank.

Example
The following example restores the retired storage location on node3:
SELECT RESTORE_LOCATION ('/thirdVerticaStorageLocation/' , 'node3');

See Also
ADD_LOCATION (page 239), RETIRE_LOCATION (page 294), and Modifying Storage
Locations

RETIRE_LOCATION
Makes the specified storage location inactive.

Syntax
RETIRE_LOCATION ( 'path' , 'site' )

-294-
SQL Functions

Parameters
path Specifies where the storage location to retire is
mounted.
site Is the Vertica site where the location is available.

Notes
• Before retiring a location, be sure that at least one location will remain for storing data and
temp files. Data and temp files can be stored in either one storage location or separate
storage locations.
• Once retired, no new data can be stored on the location unless the location is restored
through the RESTORE_LOCATION (page 294) function.
• If the storage location stored data, the data is not moved. Instead, it is removed through one
or more mergeouts. Therefore, the location cannot be dropped.
• If the storage site was used to store only temp files, it can be dropped. See Dropping Storage
Locations in the Administrators Guide and the DROP_LOCATION (page 265) function.

Example
SELECT RETIRE_LOCATION ('/secondVerticaStorageLocation/' , 'node2');

See Also
• ADD_LOCATION (page 239) and RESTORE_LOCATION (page 294) in this SQL Reference
Guide
• Retiring Storage Locations in the Administrator's Guide

REVERT_DEPLOYMENT
Prepares a previously deployed design to be dropped from the deployment database and
(optionally) drops it.

Syntax
REVERT_DEPLOYMENT ( design_context_name , design_name , [ bool_value ] )

Parameters
design_context_na Specifies the name of the design context schema that contains the
me design to drop.
design_name Specifies the name of the design to drop.
bool_value Determines if the function calls RUN_DEPLOYMENT (true) or not
(false) to drop the design.
By default bool_value is true.

-295-
SQL Reference Manual

Notes
• REVERT_DEPLOYMENT prepares a design to be dropped by:
§ Setting the deployment status of the design to pending. See the
design_<design_name>_deployment view.
§ Determining all the projections that were deployed for the design and creating a 'drop'
record for each of these projections in the
design_<design_name>_deployment_projections table.
§ By default, calling RUN_DEPLOYMENT to drop the design.

Example
The following examples both prepare the VMartDesign (from the vmart context) to be dropped
and then call RUN_DEPLOYMENT to drop it.
SELECT REVERT_DEPLOYMENT('vmart','VMartDesign');
SELECT REVERT_DEPLOYMENT('vmart','VMartDesign',true);

See Also
DEPLOY_DESIGN (page 259) and RUN_DEPLOYMENT (page 296)

RUN_DEPLOYMENT
Implements the design deployment created through either DEPLOY_DESIGN or
DROP_DESIGN_DEPLOYMENT.

Syntax
RUN_DEPLOYMENT ( design_context_name , design_name , [ bool_value ] )

Parameters
design_context_na Specifies the name of the design context schema that contains the
me design to deploy.
design_name Specifies the name of the design to deploy.
bool_value Specifies whether existing projections are dropped (true) or not (false)
during deployment.
By default, bool_value is true.
Tip: Setting bool_value to false gives you the opportunity to test the
new design without dropping the original design. See Deploying Test
Designs.

Notes
• RUN_DEPLOYMENT implements the design by:
§ Implementing all new projections with an operation status of add. See the
design_<design_name>_deployment_projections for more information about the
operation status.

-296-
SQL Functions

§ By default, dropping all projections with an operation status of drop. Typically these are all
pre-existing projections from the deployment database.
Tip: You can use the REMOVE_DEPLOYMENT_ENTRY function to prevent specific
projections from being added or dropped. This enables you to fine tune the deployment.
§ Refreshes new projections if necessary.
Note: If you attempt to deploy a design created in Vertica version 3.0, Vertica displays an
error indicating that you must regenerate the design. This occurs because there is no
deployment system table for the design. Use either CREATE_DESIGN (page 255) or
UPDATE_DESIGN (page 310).

Example
The following examples both deploy the VMartDesign from the vmart context. They also drop all
deployed projections from pre-existing deployments.
SELECT RUN_DEPLOYMENT('vmart','VMartDesign');
SELECT RUN_DEPLOYMENT('vmart','VMartDesign',true);

See Also
DEPLOY_DESIGN (page 259), CANCEL_DEPLOYMENT (page 248) and
REVERT_DEPLOYMENT (page 295)

SAVE_DESIGN_VERSION
Generates a backup physical schema design.

Syntax
SAVE_DESIGN_VERSION ( design_context_name , existing_design__name ,
saved_design__name )

Parameters
design_context_name Specifies the name of the design context schema that contains
the design to back up.
existing_design_name Specifies the name of the design to back up.
saved_design_name Specifies a unique name for the backup design.

Notes
• The entire design configuration is saved including all design context system tables related to
the design as well as query information.
• Once saved, you can export a design candidate for testing or deployment. You can also use
it for design migration.

SAVE_QUERY_REPOSITORY
Triggers Vertica to save query data to the query repository immediately.

-297-
SQL Reference Manual

Syntax
SAVE_QUERY_REPOSITORY()

Notes
Vertica saves data based on the established query repository configuration parameters. For
example, it will use the value of the QueryRepoRetentionTime parameter to determine the
maximum number of days worth of queries to save. (See Configuring Query Repository.)

See Also
Collecting Query Information

SELECT CURRENT_SCHEMA
Shows the resolved name of $User.

Syntax
SELECT CURRENT_SCHEMA

Notes
If the search path for USER1 is: $USER, COMMON, PUBLIC:
SELECT CURRENT_SCHEMA() returns the following output if schema USER1 exists:
USER1
If schema USER1 does not exist, it returns the following output:
COMMON

Example
SELECT CURRENT_SCHEMA();
current_schema
----------------
public
(1 row)

SET_AHM_EPOCH
Sets the Ancient History Mark (AHM) to the specified epoch. This function allows deleted data up
to and including the AHM epoch to be purged from physical storage.
SET_AHM_EPOCH is normally used for testing purposes. Consider SET_AHM_TIME (page
300) instead, which is easier to use.

Syntax
SET_AHM_EPOCH ( epoch, [ true ] )

Parameters
epoch Specifies one of the following:
The number of the epoch in which to set the AHM

-298-
SQL Functions

Zero (0) (the default) disables purge (page 290)


true [Optional] Allows the AHM to advance when nodes are down.
Note: If the AHM is advanced after the last good epoch of the
failed nodes, those nodes must recover all data from scratch.

Notes
If you use SET_AHM_EPOCH , the number of the specified epoch must be:
• Greater than the current AHM epoch
• Less than the current epoch
• Less than or equal to the cluster last good epoch (the minimum of the last good epochs of
the individual nodes in the cluster)
• Less than or equal to the cluster refresh epoch (the minimum of the refresh epochs of the
individual nodes in the cluster)
Use the SYSTEM (page 461) table to see current values of various epochs related to the AHM;
for example:
SELECT * from SYSTEM;
-[ RECORD 1 ]------------+---------------------------
current_timestamp | 2009-08-11 17:09:54.651413
current_epoch | 1512
ahm_epoch | 961
last_good_epoch | 1510
refresh_epoch | -1
designed_fault_tolerance | 1
node_count | 4
node_down_count | 0
current_fault_tolerance | 1
catalog_revision_number | 1590
wos_used_bytes | 0
wos_row_count | 0
ros_used_bytes | 41490783
ros_row_count | 1298104
total_used_bytes | 41490783
total_row_count | 1298104
All nodes must be up. You cannot use SET_AHM_EPOCH when any node in the cluster is
down, except by using the optional true parameter.
When a node is down and you issue SELECT MAKE_AHM_NOW(), the following error is printed to
the vertica.log:
Some nodes were excluded from setAHM. If their LGE is before the AHM they will perform
full recovery.

Examples
The following command sets the AHM to a specified epoch of 12:
SELECT SET_AHM_EPOCH(12);

-299-
SQL Reference Manual

The following command sets the AHM to a specified epoch of 2 and allows the AHM to advance
despite a failed node:
SELECT SET_AHM_EPOCH(2, true);

See Also
MAKE_AHM_NOW (page 283)
SET_AHM_TIME (page 300)
SYSTEM (page 461)

SET_AHM_TIME
Sets the Ancient History Mark (AHM) to the epoch corresponding to the specified time on the
initiator node. This function allows historical data up to and including the AHM epoch to be
purged from physical storage.

Syntax
SET_AHM_TIME ( time , [ true ] )

Parameters
time Is a TIMESTAMP (page 99) value that is automatically
converted to the appropriate epoch number.
true [Optional] Allows the AHM to advance when nodes are down.
Note: If the AHM is advanced after the last good epoch of the
failed nodes, those nodes must recover all data from scratch.

Notes
SET_AHM_TIME returns a TIMESTAMP WITH TIME ZONE value representing the end point of
the AHM epoch.
You cannot change the AHM when any node in the cluster is down, except by using the optional
true parameter.
When a node is down and you issue SELECT MAKE_AHM_NOW(), the following error is printed to
the vertica.log:
Some nodes were excluded from setAHM. If their LGE is before the AHM they will perform full
recovery.

Examples
Epochs depend on a configured epoch advancement interval. If an epoch includes a
three-minute range of time, the purge operation is accurate only to within minus three minutes of
the specified timestamp:
SELECT SET_AHM_TIME('2008-02-27 18:13');
set_ahm_time
------------------------------------

-300-
SQL Functions

AHM set to '2008-02-27 18:11:50-05'


(1 row)
Note: The -05 part of the output string is a time zone value, an offset in hours from UTC
(Universal Coordinated Time, traditionally known as Greenwich Mean Time, or GMT).
In the above example, the actual AHM epoch ends at 18:11:50, roughly one minute before the
specified timestamp. This is because SET_AHM_TIME selects the epoch that ends at or before
the specified timestamp. It does not select the epoch that ends after the specified timestamp
because that would purge data deleted as much as three minutes after the AHM.
For example, using only hours and minutes, suppose that epoch 9000 runs from 08:50 to 11:50
and epoch 9001 runs from 11:50 to 15:50. SET_AHM_TIME('11:51') chooses epoch 9000
because it ends roughly one minute before the specified timestamp.
In the next example, if given an environment variable set as date =`date`; the following
command fails if a node is down:
SELECT SET_AHM_TIME('$date');
In order to force the AHM to advance, issue the following command instead:
SELECT SET_AHM_TIME('$date', true);

See Also
MAKE_AHM_NOW (page 283)
SET_AHM_EPOCH (page 298) for a description of the range of valid epoch numbers.
SET DATESTYLE (page 396) for information about specifying a TIMESTAMP (page 99) value.

SET_DESIGN_KSAFETY
Sets the K-safety value for the physical schema design.

Syntax
SET_DESIGN_KSAFETY ( design_context_name , design_name , int_ksafety_value )

Parameters
design_context_name Specifies the name of the design context schema to update.
design_name Specifies the name of the design configuration to update.
int_ksafety_value Sets the K-safety value to 0, 1, or 2.
Vertica uses the following defaults:
For a cluster that consists of one or two nodes, the default is zero
(0).
For a cluster of three or more nodes, the default is one (1).

Notes
In Vertica, the value of K can be 0, 1, or 2. The value of K can be one 1 or two 2 only when the
physical schema design meets certain requirements. See K-Safety for an overview.

-301-
SQL Reference Manual

Example
The following example sets the K-safety for the design to two (2):
SELECT SET_DESIGN_KSAFETY('vmart','VMartDesign', 2);

SET_DESIGN_LOG_FILE
Specifies the location where the debug log (designer.log) for the physical schema design is
created.

Syntax
SET_DESIGN_LOG_FILE ( design_context_name , design_name , log_file_name )

Parameters
design_context_name Specifies the name of the design context schema to modify.
design_name Specifies the name of the design configuration to modify.
log_file_name Specifies the location where the debug log for the design is
created. By default, designer.log is located in the same directory
as vertica.log.
If this string is empty, this function resets the location of the log file
back to the default location.

See Also
SET_DESIGN_LOG_LEVEL (page 302)

SET_DESIGN_LOG_LEVEL
Specifies the level of detail Database Designer writes to the log file for the physical schema
design.

Syntax
SET_DESIGN_LOG_LEVEL ( design_context_name , design_name , string_log_level )

Parameters
design_context_name Specifies the name of the design context schema to update.
design_name Specifies the name of the design configuration to update.
string_log_level Sets the log level value to BASIC or DEBUG.
The default is BASIC.

Example
The following example sets the log level for the design to DEBUG:
SELECT SET_DESIGN_LOG_LEVEL('vmart','VMartDesign','DEBUG');

-302-
SQL Functions

See Also
SET_DESIGN_LOG_FILE (page 302)

SET_DESIGN_PARAMETER
Sets the specified physical schema design parameter to true or false.

Syntax
SET_DESIGN_PARAMETER ( design_context_name , design_name , parameter_name ,
bool_value )

Parameters
design_context_name Specifies the name of the design context schema to update.
design_name Specifies the name of the design configuration to update.
parameter_name Sets one of the following parameters:
create_prejoins--Determines whether pre-join projections are
created (true) or not (false). By default, Database Designer makes
pre-joins as an optimization (true). However, you can turn them
off.
sort_buddies_same--Determines whether the same sort order is
used for buddy projections (true) or not (false). The default is false.
bool_value One of the following:
true
false

Notes
By default, Database Designer uses different sort orders for buddies to offer slightly different
optimizations, each of which may favor one type of query. If you are using queries with the same
sort order, you would not want the buddy projections to use different sort orders. In this case,
using buddy projections with the same sort order provides the least impact on performance if a
node goes down because queries will not have to be resorted.

Example
The following example configures a design that contains no pre-join projections:
SELECT SET_DESIGN_PARAMETER('vmart','VMartDesign','create_prejoins',false);

SET_DESIGN_QUERIES_TABLE
Establishes the specified table as the query input table for the physical schema design.

Syntax
SET_DESIGN_QUERIES_TABLE ( design_context_name , design_name , query_table_name
)

-303-
SQL Reference Manual

Parameters
design_context_name Specifies the name of the design context schema to modify.
design_name Specifies the name of the design configuration to modify.
query_table_name Specifies the name of the query input table to use for the design
configuration.

Notes
This function is used in conjunction with CREATE_DESIGN_QUERIES_TABLE (page 257) and
LOAD_DESIGN_QUERIES (page 282) to create a new query input table, load it with data from
the query repository, and establish it as the query input table for the specified design. This
enables you to work on a design even if you do not have access to files on the database server.

Example
The following example sets the query input table for the VMartDesign with the vmart_query_input
table.
SELECT SET_DESIGN_QUERIES_TABLE('vmart','VMartDesign','vmart_query_input');

See Also
CREATE_DESIGN_QUERIES_TABLE (page 257), RESET_DESIGN_QUERIES_TABLE (page
293)

SET_DESIGN_QUERY_CLUSTER_LEVEL
Changes the number of query clusters used to the number specified.

Syntax
SET_DESIGN_QUERY_CLUSTER_LEVEL ( 'design_context_name' , 'design_name' ,
int_cluster_level )

Parameters
design_context_name Specifies the name of the design context schema to update.
design_name Specifies the name of the design configuration to update.
int_cluster_level Specifies the number of query clusters for the design.
The default is one (1).

Notes
A query cluster level determines the number of sets used to group similar queries. The query
cluster level can be any integer from one (1) to the number of queries to be included in the
physical schema design.

-304-
SQL Functions

Queries are generally grouped based on the columns they access and the way in which they are
used. The following work loads typically use different types of queries and are placed in different
query clusters: drill downs, large aggregations, and large joins. For example, if a reporting tool
and dashboard both access the same database, the reporting tool is likely to use a drill down to
access a subset of data and the dashboard is likely to use a large aggregation to look across a
large range of data. In this case, there would be at least two (2) query clusters.

Examples
The following example creates a design with three (3) query clusters:
SELECT SET_DESIGN_QUERY_CLUSTER_LEVEL('vmart','VMartDesign',3);

SET_DESIGN_SEGMENTATION_COLUMN
Specifies a column to use as the segmentation column for the table, rather than relying on
Database Designer to choose it automatically.

Syntax
SET_DESIGN_SEGMENTATION_COLUMN ( 'design_context_name' , 'design_name' ,
'table_name' , 'column_name' )

Parameters
design_context_name Specifies the name of the design context schema to modify.
design_name Specifies the name of the design configuration to modify.
table_name Specifies the name of the design table that contains the column to
use for segmentation.
column_name Specifies the name of column to use for segmentation.

Notes
• The segmentation column is used to segment projection across nodes. Use a segmentation
column that has a high number of distinct values.
• To specify more than one column, make additional calls to the
SET_DESIGN_SEGMENTATION_COLUMN function. You can specify columns on the same
table (which will automatically be combined) or other tables.

Example
The following example specifies that product_price column in the Public.product_dimension table
be used to segment the table.
SELECT
SET_DESIGN_SEGMENTATION_COLUMN('vmart','VMartDesign','Public.product_dimension
','product_price');

See Also
CLEAR_DESIGN_SEGMENTATION_TABLE (page 249)

-305-
SQL Reference Manual

SET_DESIGN_SEGMENTATION_TABLE
Determines whether or not the specified table will be segmented.

Syntax
SET_DESIGN_SEGMENTATION_TABLE ( design_context_name , table_name , bool_value )

Parameters
design_context_name Specifies the name of the design context schema to modify.

table_name Specifies the entry within the design context to modify.


bool_value One of the following:
true (to segment the table)
false
The default is true if left unspecified.

Notes
• The table specified by the entry is marked as requesting segmentation.

Examples
The following examples both mark the store.store_orders_fact table within the vmart design
context for segmentation:
SELECT SET_DESIGN_SEGMENTATION_TABLE(
'vmart','store.store_orders_fact'
);
SELECT SET_DESIGN_SEGMENTATION_TABLE(
'vmart','store.store_orders_fact',true
);

SET_DESIGN_TABLE_ROWS
Sets the number of anticipated rows for the specified table.

Syntax
SET_DESIGN_TABLE_ROWS ( design_context_name , table_name , num_rows )

Parameters
design_context_name Specifies the name of the design context schema to modify.
table_name Specifies the table within the design context to modify.
num_rows Specifies the number of anticipated rows for the table.

-306-
SQL Functions

Notes
• If the number of table rows will be an order of magnitude larger or smaller than the sample
provided for data statistics, use this function to supply the anticipated number of rows. An
approximation is okay.
• Database Designer uses this information to optimize the design for that number of rows.

Example
The following example sets the anticipated number of rows for the store.store_orders_fact entry
within the vmart design context to five million (5,000,000).
SELECT SET_DESIGN_TABLE_ROWS('vmart','store.store_orders_fact',5000000);

SET_LOCATION_PERFORMANCE
Sets disk performance for the location specified.

Syntax
SET_LOCATION_PERFORMANCE ( path , node , throughput , avg latency )

Parameters
node Is the Vertica node where the location to be set is
available.
If this parameter is omitted, node defaults to the initiator.
path Specifies where the storage location to set is mounted.

throughput Specifies the throughput for the location. The throughput


must be 1 or more.

avg latency Specifies the average latency for the location. The
avg_latency must be 1 or more.

Notes
To obtain the throughput and avg latency for the location, run the
MEASURE_LOCATION_PERFORMANCE (page 286) function before attempting to set the
location's performance.

Example
The following example sets the performance of a storage location on node2 to a throughput of
122 MB/second and a latency of 140 seeks/second.
SELECT
MEASURE_LOCATION_PERFORMANCE('node2','/secondVerticaStorageLocation/','122','1
40');

-307-
SQL Reference Manual

See Also
• ADD_LOCATION (page 239) and MEASURE_LOCATION_PERFORMANCE (page 286) in
this guide.
• Measuring Location Performance and Setting Location Performance in the Administrator's
Guide.

START_REFRESH
Transfers data to projections that are not able to participate in query execution due to missing or
out-of-date data.

Syntax
START_REFRESH()

Notes
• All nodes must be up in order to start a refresh.
• A refresh can start when
§ A new projection is created on tables that already have data
§ The newly-added projection has an existing, up-to-date buddy projection or a
newly-added buddy projection
§ The MARK_DESIGN_KSAFE (page 285) function is called to validate the K-safety of the
design
• START_REFRESH has no effect if a refresh is already running.
• Shutting down the database ends the refresh.
• To view the progress of the refresh, see the PROJECTION_REFRESHES (page 446) and
PROJECTIONS (page 416) SQL system tables.
• If a projection is updated from scratch, the data stored in the projection represents the table
columns as of the epoch in which the refresh commits. As a result, the query optimizer might
not choose the new projection for AT EPOCH queries that request historical data at epochs
older than the refresh epoch of the projection. Projections refreshed from buddies retain
history and can be used to answer historical queries.
• After a refresh completes, the refreshed projections go into a single ROS container. If the
table was created with a PARTITION BY clause, then you should call PARTITION_TABLE()
or PARTITION_PROJECTION() to reorganize the data into multiple ROS containers, since
queries on projections with multiple ROS containers perform better than queries on
projections with a single ROS container.

See Also
MARK_DESIGN_KSAFE (page 285), PROJECTION_REFRESHES (page 446) and
PROJECTIONS (page 416)

-308-
SQL Functions

SYNC_CURRENT_DESIGN
Creates a physical schema design with the projections from a deployed design. This allows the
user to load the deployed design into designer system tables where it can be modified through
the standard Database Designer API.

Syntax
SYNC_CURRENT_DESIGN( design_context_name , design_name )

Parameters
design_context_name Specifies the name of the design context in which to create the
design.
design_name Specifies the name of the design to create.

Notes
• This is useful for capturing a deployed design. For example, a database administrator might
want a designer to modify a deployed design. In this case, the database administrator would
call SYNC_CURRENT_DESIGN and then, perhaps, export the design for the designer to
work on elsewhere.
• Vertica returns an error if the design context does not exist.

See Also
UPDATE_DESIGN (page 310)

TEMP_DESIGN_SCRIPT
Generates a SQL query that you can use to create a temporary projection.

Syntax
TEMP_DESIGN_SCRIPT ( 'table_name' )

Parameters
table_name Specifies the name of the table to include in the design. If an
empty string is provided instead of a table name, Vertica includes
all the tables in the schema in the design.

Notes
• If you specify a table, Vertica generates a script for the table whether or not it has a safe
super projection. If you specify an empty string instead of a table name, Vertica generates
scripts only for tables that do not already have a safe super projection.
• This function returns a script that contains all the SQL queries it generates. This query is in
the form:
CREATE PROJECTION fact_super
(

-309-
SQL Reference Manual

column-list,
...
)
AS SELECT * FROM fact
UNSEGMENTED ALL NODES;
Tip: To create a file that contains the query, use \o to redirect the results to a file.

UPDATE_DESIGN
Generates an updated physical schema design that contains the differences between two
designs.

Syntax
UPDATE_DESIGN ( 'design_context_name' , 'base_design__name' ,
'updated_design_name' )

Parameters
design_context_name Specifies the name of the design context schema in which to
update a design.
base_design_name Specifies the name of the original base design.
updated_design_name Specifies the name of the new design.

Notes
• To update a design, the general process is to create a new design and then use the
UPDATE_DESIGN function to:
§ Compare the original design to the new design.
§ Generate an incremental design that contains the difference between the two designs.
• Once the incremental design is generated, you deploy it on top of the original design to
implement your modifications.
• To create an incremental design, Database Designer
§ Excludes base design projections for design tables that have been removed.
§ Adds projections for newly added design tables. (This includes applying queries from the
base design that are anchored on these tables (if they are eligible anchor tables).)
§ Excludes or adds projections as appropriate if there is a difference in k-safety between
the base and new designs.
§ If the query input table for the new design is not empty, creating projections optimized for
these queries.

Example
The following example creates an incremental design based on a comparison of VMartDesign
and NewVMartDesign.

-310-
SQL Functions

SELECT UPDATE_DESIGN('vmart','VMartDesign','NewVMartDesign');

See Also
SYNC_CURRENT_DESIGN (page 309)

WAIT_DEPLOYMENT
Returns zero (0) when the deployment is complete.

Syntax
SELECT WAIT_DEPLOYMENT()

Notes
This function is useful for using a script to deploy a design. It will not return until the deployment
is complete.

See Also
RUN_DEPLOYMENT (page 296) and DEPLOY_DESIGN (page 259)

-311-
SQL Statements
The primary structure of a SQL query is its statement. Multiple statements are separated by
semicolons; for example:
CREATE TABLE fact ( ..., date_col date NOT NULL, ...);
CREATE TABLE fact(..., state VARCHAR2 NOT NULL, ...);

-313-
SQL Reference Manual

ALTER PROJECTION
Initiates a rename operation on the specified projection:

Syntax
ALTER PROJECTION projection {
| RENAME TO new-projection-name }

Parameters
projection Specifies the name of a projection.
RENAME TO Changes the name of the projection to the specified name.

See Also
• Use DO_TM_TASK (page 263) to run a Tuple Mover operation (moveout) on one or more
projections defined on a specified table.
• See Understanding the Automatic Tuple Mover in the Administrator's Guide for an
explanation of how to use manual Tuple Mover control.
• See Purging Deleted Data in the Administrator's Guide for more information about purge
operations.

ALTER SCHEMA
Renames one or more existing schemas.

Syntax
ALTER SCHEMA schemaname1 [, schemaname2, schemaname3 ]{
RENAME TO new-schema-name1 [, new-schema-name2, new-schema-name3 ]}

Parameters
schemaname Specifies the name of one or more schemas to rename.
RENAME TO Specifies one or more new schema names.
The lists of schemas to rename and the new schema names
are parsed from left to right and matched accordingly using
one-to-one correspondence.
When renaming schemas, be sure to follow these
standards:
The number of schemas to rename must match the number
of new schema names supplied.
The new schema names must not already exist.
The RENAME TO parameter is applied atomically. Either all
the schemas are renamed or none of the schemas are
renamed. If, for example, the number of schemas to rename

-314-
SQL Statements

does not match the number of new names supplied, none of


the schemas are renamed.
Note: Renaming a schema that is referenced by a view will
cause the view to fail unless another schema is created to
replace it.

Notes
• Only the superuser or schema owner can use the ALTER SCHEMA command.
• Renaming schemas does not affect existing prejoin projections because prejoin projections
refer to schemas by the schemas' unique numeric IDs (OIDs), and the OIDs for schemas are
not changed by ALTER SCHEMA.

Tip
Renaming schemas is useful for swapping schemas without actually moving data. To facilitate
the swap, enter a non-existent, temporary placeholder schema. The following example uses the
temporary schema temps to facilitate swapping schema S1 with schema S2. In this example, S1
is renamed to temps. Then S2 is renamed to S1. Finally, temps is renamed to S2.
ALTER SCHEMA S1, S2, temps
RENAME TO temps, S1, S2;

Examples
The following example renames schema S1 to S3 and schema S2 to S4:
ALTER SCHEMA S1, S2
RENAME TO S3, S4;

See Also
CREATE SCHEMA (page 344) and DROP SCHEMA (page 360)

-315-
316

ALTER TABLE
Modifies an existing table.

Syntax 1
ALTER TABLE [schema-name.]table-name
{
ADD COLUMN column-definition (on page 348)
| ADD table-constraint (on page 319)
| ALTER COLUMN column_name [ SET DEFAULT default_expr ] | [ DROP DEFAULT ]
| DROP CONSTRAINT constraint-name [ RESTRICT | CASCADE ]
| RENAME [ COLUMN ] column TO new_column
| SET SCHEMA new-schema-name [ CASCADE | RESTRICT ]
}

Syntax 2
ALTER TABLE [schema-name.]table-name1 [, [schema-name.]table-name2,
[schema-name.]table-name3]
RENAME [TO] new-table-name1 [, new-table-name2 , new-table-name3 ]

Parameters
[schema-name.]table-name Specifies the name of the table to be altered. When using more than
one schema, specify the schema that contains the table.
ALTER TABLE can be used in conjunction with SET SCHEMA to move
only one table between schemas at a time.
When using ALTER TABLE to rename one or more tables, you can
specify a comma-delimited list of table names to rename.
ADD COLUMN Adds a new column to a table and to all superprojections of the table. A
column-definition unique projection column name is generated in each superprojection.
When a new column is added, the default value is inserted for existing
rows. For example, if current_timestamp is the default expression, all
rows will have the current timestamp.
Volatile functions cannot be specified through ADD COLUMN. Use
ALTER COLUMN to specify volatile functions.
The column is populated according to the column-constraint (on page
349).
Note: Columns added to a table that is referenced by a view will not
appear in the result set of the view even if the view uses the wild card
(*) to represent all columns in the table. Recreate the view to
incorporate the column.
ADD Adds a table-constraint (on page 319) to a table that does not have
table-constraint any associated projections.
Note: Adding a table constraint has no effect on views that reference
the table.

-316-
SQL Statements

ALTER COLUMN column_name Alters an existing column within the specified table to change or drop a
[ SET DEFAULT default_expr ] default expression.
| [ DROP DEFAULT ]
Tip: When adding a column, you cannot specify a volatile function in its
default column expression. To work around this, add the column and fill
it with NULLS. Then, alter the column to specify the volatile function.
For example:
ALTER TABLE tbl ADD COLUMN newcol float;
ALTER TABLE tbl ALTER COLUMN newcol SET DEFAULT
random();
DROP CONSTRAINT Drops the specified table-constraint (on page 319) from the table.
constraint-name
[ RESTRICT | CASCADE ] Use the CASCADE keyword to drop a constraint upon which something
else depends. For example, a FOREIGN KEY constraint depends on a
UNIQUE or PRIMARY KEY constraint on the referenced columns.
Note: Dropping a table constraint has no effect on views that reference
the table.
RENAME [TO] RENAME TO is used to rename one table, while RENAME is used to
new-table-name1 ... rename multiple tables. In either case, the key word changes the name
of the table or tables to the specified name or names.
To rename two or more tables simultaneously, use a comma-delimited
list. The lists of tables to rename and the new table names are parsed
from left to right and matched accordingly using one-to-one
correspondence.
When renaming tables, be sure to follow these standards:
Do not specify the schema-name as part of the table specification after
the RENAME TO clause. The schema-name is specified only after the
ALTER TABLE clause because this statement applies to only one
schema.
The following example renames tables T1 and T2 in the S1 schema
to U1 and U2 respectively.
ALTER TABLE S1.T1, S1.T2
RENAME TO U1, U2;
The following example generates a syntax error:
ALTER TABLE S1.T1, S1.T2
RENAME TO S1.U1, S1.U2;
The number of tables to rename must match the number of new table
names supplied.
The new table names must not already exist.
The RENAME TO parameter is applied atomically. Either all the tables
are renamed or none of the tables are renamed. If, for example, the
number of tables to rename does not match the number of new names
supplied, none of the tables are renamed.
Note: Renaming a table that is referenced by a view will cause the
view to fail unless another table is created to replace it.

-317-
SQL Reference Manual

RENAME [ COLUMN ] column TO Renames the specified column within the table.
new_column
Note: If a column that is referenced by a view is renamed, the column
will not appear in the result set of the view even if the view uses the
wild card (*) to represent all columns in the table. Recreate the view to
incorporate the column's new name.
SET SCHEMA Moves the table to the specified schema. By default, SET SCHEMA is
new-schema_name set to CASCADE. This means that all the projections that are anchored
[ CASCADE | RESTRICT ] on this table are automatically moved to the new schema regardless of
the schema in which they reside. To move only projections that are
anchored on this table and that reside in the same schema, use the
RESTRICT key word.
If the name of the table or any of the projections that you want to move
already exists in the new schema, the statement rolls back and the
tables and projections are not moved. In the new schema, rename the
table or projections that conflict with the ones that you want to move
and then rerun the statement.
Notes: Although this is likely to occur infrequently, Vertica supports
moving system tables to system schemas if necessary. This might
occur to support designs created through Database Designer.
Temporary tables cannot be moved between schemas.
SET SCHEMA supports moving only one table between schemas at a
time.

Notes
• To use the ALTER TABLE statement, the user must either be a superuser or be the table
owner and have CREATE privilege on the affected schema. If you use SET SCHEMA, you
must also have CREATE privilege on the schema to which you want to move the table.
• With the exception of performing a table rename, one operation can be performed at a time
in an ALTER TABLE command; for example, to add multiple columns, issue consecutive
ALTER TABLE ADD COLUMN commands.
• The following clauses cannot be used with any other clauses. They are exclusive:
§ RENAME [TO]
§ RENAME COLUMN
§ SET SCHEMA
§ ADD COLUMN
• The ADD constraints and DROP constraints clauses can be used together.
• Adding a column to a table does not affect the K-Safety of the physical schema design.
• You cannot use ALTER TABLE ... ADD COLUMN on a temporary table.
• Vertica allows adding 1600 columns to a table.
• Renaming tables does not affect existing prejoin projections because prejoin projections refer
to tables by the tables' unique numeric IDs (OIDs), and the OIDs for tables are not changed
by ALTER TABLE.

-318-
SQL Statements

Tip
Renaming tables is useful for swapping tables within the same schema without actually moving
data. It cannot be used to swap tables across schemas. To enable the swap, use a non-existent,
temporary placeholder table. The following example uses the temporary table temps to facilitate
swapping table T1 with table T2. In this example, T1 is renamed to temps. Then T2 is renamed
to T1. Finally, temps is renamed to T2.
ALTER TABLE T1, T2, temps
RENAME TO temps, T1, T2;

Examples
The following example adds a table constraint to the Product_Dimension table within the Retail
schema.
ALTER TABLE Retail.Product_Dimension
ADD CONSTRAINT PK_Product_Dimension PRIMARY KEY (Product_Key);
The following example drops a table constraint to the Product_Dimension table within the Retail
schema.
ALTER TABLE Retail.Product_Dimension
DROP CONSTRAINT PK_PRODUCT_Dimension;
The following example drops the default expression specified for the Discontinued_flag column.
ALTER TABLE Retail.Product_Dimension
ALTER COLUMN Discontinued_flag DROP DEFAULT;
The following example renames a column in the Retail.Product_Dimension table from
Product_description to Item_description:
ALTER TABLE Retail.Product_Dimension
RENAME COLUMN Product_description TO Item_description;
The following example moves table T1 from schema S1 to schema S2. SET SCHEMA defaults
to CASCADE so all the projections that are anchored on table T1 are automatically moved to
schema S2 regardless of the schema in which they reside
ALTER TABLE S1.T1 SET SCHEMA S2;

table-constraint
Adds a join constraint or a constraint describing a functional dependency to the metadata of a
table. See Adding Constraints in the Administrator's Guide.

Syntax
[ CONSTRAINT constraint_name ]
{ PRIMARY KEY ( column [ , ... ] )
| FOREIGN KEY ( column [ , ... ] )
REFERENCES table
| UNIQUE ( column [ , ... ] )

-319-
SQL Reference Manual

| CORRELATION ( column1 ) DETERMINES ( column2 )


}

Parameters
CONSTRAINT constraint-name Optionally assigns a name to the constraint. Vertica
recommends that you name all constraints.
PRIMARY KEY Adds a referential integrity constraint defining one or more NOT
( column [ , ... ] ) NULL numeric columns as the primary key.
FOREIGN KEY Adds a referential integrity constraint defining one or more NOT
( column [ , ... ] ) NULL numeric columns as a foreign key.
REFERENCES table Specifies the table to which the FOREIGN KEY constraint
applies. If column is omitted, the default is the primary key of
table.
UNIQUE Ensures that the data contained in a column or a group of
( column [ , ... ] ) columns is unique with respect to all the rows in the table.
CORRELATION Describes a functional dependency. Given a tuple and the set
of values in column1, you can determine the corresponding
value of column2.

Notes
• A foreign key constraint can be specified solely by a reference to the table that contains the
primary key. The columns in the referenced table do not need to be explicitly specified; for
example:
CREATE TABLE fact(c1 INTEGER PRIMARY KEY);
CREATE TABLE dim (c1 INTEGER REFERENCES fact);
• Use the ALTER TABLE (page 316) command to add a table constraint. The CREATE
TABLE statement does not allow table constraints.
• Define PRIMARY KEY and FOREIGN KEY constraints in all tables that participate in inner
joins. See Adding Join Constraints.
• Adding constraint to a table that is referenced in a view does not affect the view.

Examples
CORRELATION (Product_Description) DETERMINES (Category_Description)
The Retail Sales Example Database described in the Getting Started Guide contains a table
Product_Dimension in which products have descriptions and categories. For example, the
description "Seafood Product 1" exists only in the "Seafood" category. You can define several
similar correlations between columns in the Product Dimension table.

-320-
321

ALTER USER
Changes a database user account.

Syntax
ALTER USER name [ WITH [ ENCRYPTED | UNENCRYPTED ] PASSWORD password ]

Parameters
name Specifies the name of the user to alter; names that contain
special characters must be double-quoted.
ENCRYPTED Is the default. An md5 encryption scheme is used
password Is the password to assign to the user.

-321-
322

COMMIT
Ends the current transaction and makes all changes that occurred during the transaction
permanent and visible to other users.

Syntax
COMMIT [ WORK | TRANSACTION ]

Parameters
WORK | Have no effect; they are optional keywords for readability.
TRANSACTION

-322-
323

COPY
Is designed for bulk loading data from a file on a cluster host into a Vertica database. (See
LCOPY (page 374) to load from a data file on a client system.) COPY reads data optionally
compressed (GZIP, BZIP) from multiple named pipes or files and inserts records into the WOS
(memory) or directly into the ROS (disk).
Note: You must connect as the database superuser in order to COPY from a file.

Syntax
COPY [schema-name.]table
[ ( [ Column as Expression ] / column
...[FILLER datatype]
...[ FORMAT 'format' ]
...[ ENCLOSED BY 'char' ]
...[ ESCAPE AS 'char' ]
...[ NULL [ AS ] 'string' ]
...[ DELIMITER [ AS ] 'char' ]
...[ ,...] ) ]
FROM { STDIN
...[ BZIP | GZIP | UNCOMPRESSED ] | 'pathToData' [ ON nodename ]
[ BZIP | GZIP | UNCOMPRESSED ] [, ...] }
...[ WITH ]
...[ DELIMITER [ AS ] 'char' ]
...[ NULL [ AS ] 'string' ]
...[ ESCAPE AS 'char' ]
...[ ENCLOSED BY 'char' ]
...[ RECORD TERMINATOR 'string' ]
...[ SKIP n ]
...[ REJECTMAX n ]
...[ EXCEPTIONS 'pathname' [ ON nodename ] [, ...] ]
...[ REJECTED DATA 'pathname' [ ON nodename ] [, ...] ]
...[ ABORT ON ERROR ]
...[ DIRECT ]
...[ STREAM NAME ]
...[ NO COMMIT ]

Parameters
[schema-name.]table Specifies the name of a schema table (not a projection).
Vertica loads the data into all projections that include columns
from the schema table.
When using more than one schema, specify the schema that
contains the table.
Column as Expression Specifies the target column, for which you want to compute
values, as an expression. This is used to transform data when
it is loaded into the target database. Transforming data is
useful for computing values to be inserted into a column in the
target database from other columns in the source. (See
Transforming Data in the Administrator's Guide.)
Transformation Requirements

-323-
SQL Reference Manual

The copy statement must contain at least one parsed column,


which can be a filler column. (See Ignoring Columns and
Fields in the Load File in this guide and the COPY (page 323)
statement in the SQL Reference Guide for more information
about using fillers.)
For parsed columns, specify only raw data in the source.
The return data type of the expression must be coercible to
that of the target column. Parameter (parsed columns) will
also be coerced to match the expression.
When there are computed columns, all parsed columns in the
expression must be listed in the COPY statement.
Transformation Restrictions
Computed columns cannot be used in copy expressions.
Raw data cannot be specified in the source for computed
columns.
Copy expressions may contain only constants.
FORMAT cannot be specified for a computed column.
Transformation Usage
If nulls are specified in the raw data for parsed columns in the
source, evaluation will follow the same rules as for
expressions within sql statements.
Parsed and computed columns can be interspersed in the
COPY statement.
Multiple columns can be specified in a copy expression.
Multiple copy expressions can refer to the same parsed
column.
A copy expression can be as simple as a single column and
may be as complex as a case expression with multiple
columns.
Copy expressions can be specified for columns of all
supported data types.
COPY expressions can use most Vertica-supported SQL
functions, operators, constants, NULLs, and comments, as
follows: date time functions, formatting functions, numeric
functions, string functions, null value functions, and system
functions
COPY expressions cannot use the following SQL functions:
meta (Vertica functions), analytic, and aggregate

-324-
SQL Statements

column Restricts the load to one or more specified columns in the


table. If no columns are specified, all columns are loaded by
default.
Table columns that are not in the column list are given their
default values. If no default value is defined for a column,
COPY inserts NULL.
There is no implicit casting during parsing, so mismatched
data types will cause the COPY to roll back and the row to be
rejected. For parsed columns, specify only raw data in the
source.
Note: The data file must contain the same number of
columns as the COPY command's column list. For example, in
a table T1 with nine columns (C1 through C9), COPY T1 (C1,
C6, C9) would load the three columns of data in each record
to columns C1, C6, and C9, respectively.
FILLER datatype Instructs Vertica not to load a column and the fields it contains
into the destination table. This is useful for:
Omitting columns that you do not want to transfer into a table.
Transforming data from a source column and then loading the
transformed data to a destination table without loading the
original, untransformed source column (parsed column). (See
Transforming Data in the Administrator's Guide.)
Filler requirements:
The datatype of the filler column must be specified.
The name of the filler column must be unique across the
source file and target table.
The filler column must be a parsed column, not a computed
column.
Filler restrictions:
The source columns in a COPY statement cannot consist of
only filler columns.
Target table columns cannot be specified as filler whether or
not they appear in the column list.
Filler usage:
Expressions can contain filler columns.
There is no restriction on the number of filler columns that can
be used in a copy statement (other than at least one column
must not be a filler column).
A data file can consist of only filler columns. This means that
all data in a data file can be loaded into filler columns and then
transformed and loaded into table columns.
All parser parameters can be specified for filler columns.
All statement level parser parameters apply to filler columns.

-325-
SQL Reference Manual

FORMAT 'format' Is specified for date, time, and binary data types.
Supported date/time formats are the same as those accepted
by TO_DATE(text, text); for example:
TO_DATE('05 Dec 2000', 'DD Mon YYYY')
If you specify invalid format strings, the COPY operation
returns an error. Refer to the following links for supported
formats:
Template Patterns for Date/Time Formatting (page 171)
Template Pattern Modifiers for Date/Time Formatting
(page 174)
pathToData Specifies the absolute path of the file containing the data,
which can be from multiple input sources.
Path can optionally contain wildcards to match more than one
file. The file or files must be accessible to the host on which
the COPY statement runs.
You can use variables to construct the pathname as described
in Using Load Scripts.
The supported patterns for wildcards are specified in the
Linux Manual Page GLOB(7), Globbing pathnames
http://man-wiki.net/index.php/7:glob.
nodename Is optional. If omitted, operations default to the query's initiator
node.
Note: Nodename cannot be specified with STDIN because
STDIN is read on the initiator node only.
STDIN Reads from the client a standard input instead of a file. STDIN
takes one input source only and is read on the initiator node.
To load multiple input sources, use pathToData.
BZIP|GZIP|UNCOMPRESSED Input files can be of any format. If wildcards are used, then all
qualifying input files should be of the same format.
UNCOMPRESSED is the default.
Note: When using concatenated BZIP or GZIP files, be sure
that each source file is terminated with a record terminator
before you concatenate them.
WITH Are for readability and have no effect.
AS

DELIMITER 'char' Is the single ASCII character that separates columns within
each record of a file. The default in Vertica is a vertical bar (|).
Note: A comma (,) is the delimiter commonly used in CSV
data files.
If the delimiter character appears in string of data values, use
the ESCAPE AS (\) character to indicate that it is a literal.
See Loading Character Data. Also use (\) to specify special
(non-printing, control) characters as a delimiter, such as the
tab character ('\t').
Quote (") is allowed if you set ENCLOSED BY to single

-326-
SQL Statements

quotes and specify the delimiter as double quotes; for


example, '"'

ESCAPE AS 'char' Sets the escape character. The default is backslash (\).
You can use the escape character to escape the record
terminator, delimiter, enclosed by character, and the escape
character itself.
ENCLOSED BY 'char' Sets the quotes character and allows delimiter characters to
be embedded in string values. The default is empty with
ENCLOSED BY off; for example, given the following input:
"vertica, value"
Column 1 contains "vertica
Column 1 contains value"
Notice the double quotes before vertica and after value.
Turn ENCLOSED BY on by specifying any single ASCII
character. Double quotes is the most common:
ENCLOSED BY '"'
Turn ENCLOSED BY off using the following command:
ENCLOSED BY ''
You can use ENCLOSED BY to embed delimiter character
string values. For example, to include a quote character in a
value, use the ESCAPE AS character. The following input, for
example, returns "vertica"
"\"vertica\""
Using the following sample input, columns are distributed as
follows:
"1", "vertica,value", ",", "'"
Column 1 contains 1
Column 2 contains vertica,value
Column 3 contains ,
Column 4 contains '
You could also write the above example using single quotes,
or any ASCII character of your choosing:
'1', 'vertica,value', ',', '''
~1~, ~vertica,value~, ~,~, ~'~
In the following list, double quotes are used for illustration
purposes only:
An enclosed string is detected if the string starts with double
quotes ("); leading spaces are optional.
It is assumed that the string is not enclose if the string does
not start with " .
" in the middle of a string has no effect. Any " in the middle of
an enclosed string, on the other hand, must include an escape
(backslash \) character.
An enclosed string ends with a second occurrence of " with
an optional adjacent delimiter. The end of the string is, thus, "

-327-
SQL Reference Manual

or "| (where | is assumed to be the delimiter), whichever


occurs first.
An enclosed string does not match a NULL string, regardless
of the contents.
NULL 'string' The string that represents a null value. The default is an empty
string ('').
To input an empty or literal string, use quotes (ENCLOSED
BY); for example:
NULL ''
NULL 'literal'
The null string is case-insensitive and must be the only value
between the delimiters. For example, if the null string is NULL
and the delimiter is the vertical bar (|):
|NULL| indicates a null value.
| NULL | does not indicate a null value.
When you use the COPY command in a script, you must
substitute a double-backslash for each null string that includes
a backslash. For example, the scripts used to load the
example databases contain:
COPY ... NULL '\\n' ...
Note: When using COPY FROM, any data item that
matches this string is stored as a null value, so make sure that
you use the same string that you used with COPY TO.
RECORD TERMINATOR 'string' Specifies the literal character string that indicates the end of a
data file record. You can include non-printing characters and
backslash characters in the string according to the following
convention:

Sequenc Description Abbreviati ASCII


e on Decimal
\0 Null NULL 0
character
\a Bell BEL 7
\b Backspace BS 8
\t Horizontal HT 9
tab
\n Linefeed LF 10
\v Vertical VT 11
tab
\f Formfeed FF 12
\r Carriage CR 13
return
\\ Backslash 92
Note: You cannot use the record in the delimiter
string.

-328-
SQL Statements

SKIP Skips the first 'n' records in each file in a load, which is useful
if you want to omit table header information.
REJECTMAX Sets an upper limit on the number of logical records to be
rejected before a load fails. The limit is one less than the value
specified for REJECTMAX. When the number of rejected
records becomes equal to the value specified for
REJECTMAX, the load fails. If not specified or if value is 0,
REJECTMAX allows an unlimited number of rejections.
Note: Vertica does not accumulate rejected records across
files or nodes while the data is loading. If one file exceeds the
maximum reject number, the entire load fails.
EXCEPTIONS 'pathname' Specifies the filename or absolute pathname in which to write
messages indicating the input line number and the reason for
each rejected data record. The default pathname is:
<Catalog
dir>/CopyErrorLogs/<tablename>-<filename of
source>-copy-from-exceptions
<Catalog dir>represents the directory in which the database
catalog files are stored, and <tablename>-<filename of
source> are the names of the table and data file. If copying
from STDIN, the <filename of source> is STDIN.
Note: Filename is required because of multiple input
files. Also, long table names combined with long data file
names can exceed the operating system's maximum
length (typically 255 characters). To work around this
limitation, specify a path for the exceptions file that is
different from the default path; for example,
\tmp\<shorter-file-name>.
If exceptions files are not specified:
If there is one data source file (pathToData or STDIN), all
information is stored as one file in the default directory.
If there are multiple data files, all information is stored as
separate files, one for each data file in default directory.
If exception files are specified:
If there is one data file, path is treated as a file with all
information stored in this file. If path is not a file, then the
system returns an error.
If there are multiple data files, path is treated as a directory
with all information stored in separate files, one for each data
file in this directory. If path is not a directory, then the system
returns an error.
Exceptions files are not shipped to the initiator node.
Only one pathname per node is accepted. If more than one is
provided, Vertica returns an error.
The format for the EXCEPTIONS file is:
COPY: Input record <num> in <pathofinputfile>
has been rejected (<reason>). Please see
<pathtorejectfile>, record <recordnum> for

-329-
SQL Reference Manual

the rejected record.

REJECTED DATA 'pathname' Specifies the filename or absolute pathname in which to write
rejected rows. This file can then be edited to resolve problems
and reloaded. The default pathname is:
<Catalog
dir>/CopyErrorLogs/<tablename>-<filename of
source>-copy-from-data
<Catalog dir>represents the directory in which the database
catalog files are stored, and <tablename>-<filename of
source> are the names of the table and data file. If copying
from STDIN, the <filename of source> is STDIN.
Notes: Filename is required because of multiple input files.
Also, long table names combined with long data file names
can exceed the operating system's maximum length (typically
255 characters). To work around this limitation, specify a path
for the rejected data file that is different from the default path;
for example, \tmp\<shorter-file-name>.
If rejected data files are not specified:
If there is one data source file (pathToData or STDIN), all
information is stored as one file in the default directory.
If there are multiple data files, all information is stored as
separate files, one for each data file in default directory.
If rejected data files are specified:
If there is one data file, path is treated as a file with all
information stored in this file. If path is not a file, then the
system returns an error.
If there are multiple data files, path is treated as a directory,
with all information stored in separate files, one for each data
file in this directory. If path is not a directory, then the system
returns an error.
Rejected data files are not shipped to the initiator node.
Only one pathname per node is accepted. If more than one is
provided, Vertica returns an error.
ABORT ON ERROR Stops the COPY command if a row is rejected and rolls back
the command. No data is loaded.
DIRECT Specifies that the data should go directly to the ROS (Read
Optimized Store. By default, data goes to the WOS (Write
Optimized Store).
STREAM NAME Is the optional identifier that names a stream, which could be
useful for quickly identifying a particular load.
STREAM NAME appears in the stream column of the
LOAD_STREAMS (page 440) table.
By default, Vertica names streams by table and file name. For
example, if you have two files (f1, f2) in Table A, stream
names would appear as A-f1, A-f2, etc.

-330-
SQL Statements

Use the following statement to name a stream:


COPY <mytable> FROM <myfile> DELIMITER '|' DIRECT
STREAM NAME 'My stream name';
NO COMMIT Use COPY with the NO COMMIT key words to prevent the
current transaction from committing automatically (default
behavior for all but temporary tables).
This option is useful for executing multiple COPY commands
in a single transaction. For example, all the rows in the
following sequence commit in the same transaction.
COPY... NO COMMIT;
COPY... NO COMMIT;
COPY... NO COMMIT;
COMMIT;
NO COMMIT can be combined with any other existing COPY
option, and all the usual transaction semantics apply.
If there is a transaction in progress initiated by a statement
other than COPY (for example, INSERT), COPY... NO
COMMIT adds rows to the same transaction as the earlier
statements. The previous statements are NOT committed.
Tip: Use the NO COMMIT keywords to incorporate detection
of constraint violations into the load process. Vertica checks
for violations when queries are executed, not when data is
loaded. To avoid constraint violations, load data without
committing it and then perform a post-load check of your data
using the ANALYZE_CONSTRAINTS (page 241) function. If
the function finds constraint violations, you can easily roll back
the load because you have not committed it.

Notes
• COPY FROM STDIN is allowed to any user granted the INSERT privilege, while COPY
FROM <file> is an admin-only operation.
• The COPY FORMAT keyword significantly improves performance for loading DATE data
types.
• The COPY command automatically commits itself and any current transaction unless NO
COMMIT is specified and unless the tables are temp tables. Vertica recommends that you
COMMIT (page 322) or ROLLBACK (page 379) the current transaction before using COPY.
• You cannot use the same character in both the DELIMITER and NULL strings.
• NULL values are not allowed for columns with primary key or foreign key referential integrity
constraints.
• String data in load files is considered to be all characters between the specified delimiters.
Do not enclose character strings in quotes. In other words, quote characters are treated as
ordinary data.
• Invalid input is defined as:
§ Missing columns (too few columns in an input line).
§ Extra columns (too many columns in an input line).

-331-
SQL Reference Manual

§ Empty columns for INTEGER or DATE/TIME data types. COPY does not use the default
data values defined by the CREATE TABLE command.
§ Incorrect representation of data type. For example, non-numeric data in an INTEGER
column is invalid.
• Empty values (two consecutive delimiters) are accepted as valid input data for CHAR and
VARCHAR data types. Empty columns are stored as an empty string (''), which is not
equivalent to a null string.
• When an empty line is encountered during load, it is neither inserted nor rejected. However,
the record number is incremented. Bear this in mind when evaluating lists of rejected
records. If you return a list of rejected records and one empty row was encountered during
load, the position of rejected records will be bumped up one position.
• Cancelling a COPY statement rolls back all rows loaded by that statement.
• The default record terminator for COPY is now '\n'. If you are loading data from a
Windows client, you need to specify RECORD TERMINATOR '\r\n'.
If you are using JDBC, Vertica recommends that you use use the following value for the
RECORD TERMINATOR
System.getProperty("line.separator")
• Named pipes are supported. Naming conventions have the same rules as filenames on the
given file system. Permissions are open, write, and close.
• The following can be specified on either a statement or per column basis: DELIMITER,
ENCLOSED BY, ESCAPE AS, and NULL. The same rules apply whether the parameter is
specified at the statement or column level. Column level parameters override statement level
parameters. If no column level parameter is specified, the statement level parameter is used.
If neither a column level nor statement level parameter is specified, the default is used.

Examples
The following examples specify format, delimiter, null and enclosed by strings.
COPY public.customer_dimension (customer_since FORMAT 'YYYY')
DELIMITER ','
NULL AS 'null'
ENCLOSED BY '"'
COPY store.store_dimension
FROM :input_file
DELIMITER '|'
NULL ''
RECORD TERMINATOR '\f'
COPY a
FROM stdin
DELIMITER ','
NULL '\\\N'
DIRECT;

Loading Binary Data


In the following example create a table that loads a different binary format for each column and
insert the same value, the byte sequence {0x61,0x62,0x63,0x64,0x65}.
CREATE TABLE t(oct VARBINARY(5), hex VARBINARY(5), bitstring VARBINARY(5));

-332-
SQL Statements

Create the projection:


CREATE PROJECTION t_p(oct, hex, bitstring) AS SELECT * FROM t;
Issue the COPY command. Note that the copy is from STDIN, not a file.
COPY t (oct FORMAT 'octal',
hex FORMAT 'hex',
bitstring FORMAT 'bitstring')
FROM STDIN DELIMITER ',';
Enter the data to be copied, which you end with a backslash and a period on a line by itself:
141142143144145,0x6162636465,0110000101100010011000110110010001100101
\.
And now issue the SELECT statement to see the results:
SELECT * FROM t;
oct | hex | bitstring
-------+-------+-----------
abcde | abcde | abcde
(1 row)

Using Compressed Data and Named Pipes


The following command creates the named pipe, pipe1:
\! mkfifo pipe1
\set dir `pwd`/
\set file '\'':dir'pipe1\''
The following sequence copies an uncompressed file from the named pipe:
\! cat pf1.dat > pipe1 &
COPY fact FROM :file delimiter '|';
SELECT * FROM fact;
COMMIT;
The following statement copies a GZIP file from named pipe and uncompresses it:
\! gzip pf1.dat
\! cat pf1.dat.gz > pipe1 &
COPY fact FROM :file ON site01 GZIP delimiter '|';
SELECT * FROM fact;
COMMIT;
\!gunzip pf1.dat.gz
The following COPY command copies a BZIP file from named pipe and then uncompresses it:
\!bzip2 pf1.dat
\! cat pf1.dat.bz2 > pipe1 &
COPY fact FROM :file ON site01 BZIP delimiter '|';
SELECT * FROM fact;
COMMIT;
bunzip2 pf1.dat.bz2

User-specified Exceptions and Rejected Data


\set dir `pwd`/data/

-333-
SQL Reference Manual

\set remote_dir /scratch_b/qa/tmp_ms/


Reject/Exception files NOT specified. The inputs are multiple files, and exceptions and rejection
files go to the default directory on each node:
\set file1 '\'':dir'C1_rej.dat\''
\set file2 '\'':dir'C2_rej.dat\''
\set file3 '\'':remote_dir'C3_rej.dat\''
\set file4 '\'':remote_dir'C4_rej.dat\''

COPY fact FROM :file1 on site01,


:file2 on site01,
:file3 ON site02,
:file4 ON site02
delimiter '|';
Reject/Exception files SPECIFIED. Input is a single file on the initiator, and the exceptions and
rejected data are filenames instead of directories:
\set except_s1 '\'':dir'exceptions\''
\set reject_s1 '\'':dir'rejections\''
COPY fact FROM :file1 on site01
delimiter '|'
REJECTED DATA :reject_s1 on site01
EXCEPTIONS :except_s1 on site01;
Reject/Exception files SPECIFIED. A single file is on remote node:
\set except_s2 '\'':remote_dir'exceptions\''
\set reject_s2 '\'':remote_dir'rejections\''
COPY fact FROM :file1 on site02
delimiter '|'
REJECTED DATA :reject_s2 on site02
EXCEPTIONS :except_s2 on site02;
Reject/Exception files SPECIFIED. Multiple data files on multiple nodes, with rejected data and
exceptions referring to the directory on which the files reside:
\set except_s1 '\'':dir'\''
\set reject_s1 '\'':dir'\''
\set except_s2 '\'':remote_dir'\''
\set reject_s2 '\'':remote_dir'\''
COPY fact FROM :file1 on site01,
:file2 on site01,
:file3 ON site02,
:file4 ON site02
delimiter '|'
REJECTED DATA :reject_s1 on site01, :reject_s2 on site02
EXCEPTIONS :except_s1 on site01, :except_s2 on site02;

Loading NULL values


You can specify NULL values by entering fields in a data file without content.
For example, given the default delimiter (|) and default NULL (empty string), the following inputs

-334-
SQL Statements

| | 1
| 2 | 3
4 | | 5
6 | |
are inserted into the table as follows:
(null, null, 1)
(null, 2, 3)
(4, null, 5)
(6, null, null)
If NULL is set as a literal ('null'), the following inputs
null | null | 1
null | 2 | 3
4 | null | 5
6 | null | null
are inserted into the table as follows:
(null, null, 1)
(null, 2, 3)
(4, null, 5)
(6, null, null)

Transforming Data
The following example derives and loads values for the year, month, and day columns in the
target database based on the timestamp column in the source database. It also loads the parsed
column, timestamp, from the source database to the target database.
CREATE TABLE t (
year VARCHAR(10),
month VARCHAR(10),
day VARCHAR(10),
k timestamp
);
CREATE PROJECTION tp (
year,
month,
day,
k)
AS SELECT * FROM t;
COPY t (year AS TO_CHAR(k, 'YYYY'),
month AS TO_CHAR(k, 'Month'),
day AS TO_CHAR(k, 'DD'),
k FORMAT 'YYYY-MM-DD') FROM STDIN NO COMMIT;
2009-06-17
1979-06-30
2007-11-26
\.

SELECT * FROM t;
year | month | day | k
------+-----------+-----+---------------------

-335-
SQL Reference Manual

2009 | June | 17 | 2009-06-17 00:00:00


1979 | June | 30 | 1979-06-30 00:00:00
2007 | November | 26 | 2007-11-26 00:00:00
(3 rows)

Ignoring Columns and Fields in the Load File


The following example derives and loads the value for the timestamp column in the target
database from the year, month, and day columns in the source input. The year, month, and day
columns are not loaded because the FILLER keyword skips them.
create table t (k timestamp);
create projection tp (k) as select * from t;
copy t(year FILLER varchar(10),
month FILLER varchar(10),
day FILLER varchar(10),
k as to_data(year || month || day, 'YYYYMMDD')) from STDIN no commit;

2009|06|17
1979|06|30
2007|11|26

select * from t;
k
---------------
2009-06-17 00:00:00
1979-06-30 00:00:00
2007-11-26 00:00:00

(3 rows)

See Also
SQL Data Types (page 89)
ANALYZE_CONSTRAINTS (page 241)
Loading Binary Data and Loading Character Data in the Administrator's Guide

-336-
SQL Statements

CREATE PROJECTION
Creates metadata for a projection in the Vertica catalog.

Syntax
CREATE PROJECTION [schema-name.]projection-name
( [ projection-column ] [ ENCODING encoding-type (on page 340) ] [ , ... ] [
ACCESSRANK integer ] )
AS SELECT table-column [ , ... ]
FROM table-reference [ , ... ]
[ WHERE join-predicate (on page 83) [ AND join-predicate (on page 83) ] ...
[ ORDER BY table-column [ , ... ] ]
[ hash-segmentation-clause (on page 341) | range-segmentation-clause (on page
343)
| UNSEGMENTED { NODE node | ALL NODES } ]

Parameters
[schema-name.]projection-nam
e
Specifies the name of the projection to be created. When using
more than one schema, specify the schema that contains the
projection.
Note: If projection schema is not specified, the projection is created
in the same schema as the anchor table.
projection-column
Specifies the name of a column in the projection. The data type is
inferred from the corresponding column in the schema table (based
on ordinal position).
If projection columns are not explicitly named, they are inferred from
the column names for the table specified in the SELECT statement.
The following example automatically uses store and transaction as
the projection column names for sales_p:
CREATE_TABLE sales(store integer, transaction integer);
CREATE PROJECTION sales_p as select * from sales;
Note that you cannot specify specific encodings on projection
columns using this method.
Different projection-column names can be used to distinguish
multiple columns of the same name from different tables so that no
aliases are needed.
encoding-type
Specifies the type of encoding (see "encoding-type" on page 340)
to use on the column. The Database Designer automatically
chooses an appropriate encoding for each projection column.
Caution: Using the NONE keyword for strings could negatively affect the
behavior of string columns.
ACCESSRANK integer
Overrides the default access rank for a column. This is useful if you

-337-
SQL Reference Manual

want to increase or decrease the speed at which a column is


accessed. See Creating and Configuring Storage Locations and
Prioritizing Column Access Speed.
SELECT table-column
Specifies a list of schema table columns corresponding (in ordinal
position) to the projection columns.
table-reference Specifies a list of schema tables containing the columns to include in the
projection in the form:
table-name [ AS ] alias [ ( column-alias [ , ...] ) ] [ ,
...] ]
WHERE join-predicate
Specifies foreign-key = primary-key equijoins between the fact table
and dimension tables. Foreign key columns must be NOT NULL.
No other predicates are allowed.
ORDER BY table-column Specifies which columns to sort. Because all projection columns are sorted
in ascending order in physical storage, CREATE PROJECTION does not
allow you to specify ascending or descending.
Note: If you do not specify the sort order, Vertica uses the order in which
columns are specified in the column list as the sort order for the projection.
hash-segmentation-clause Allows you to segment a projection based on a built-in hash function that
provides even distribution of data across nodes, resulting in optimal query
execution. See hash-segmentation-clause (on page 341).
range-segmentation-clause Allows you to segment a projection based on a known range of values
stored in a specific column chosen to provide even distribution of data
across a set of nodes, resulting in optimal query execution. See
range-segmentation-clause (on page 343).
NODE node Creates an unsegmented projection on the specified node only. Dimension
table projections must be UNSEGMENTED.
ALL NODES Creates a separate unsegmented projection on each node at the time the
CREATE PROJECTION statement is executed (automatic replication). In
order to do distributed query execution, Vertica requires an exact,
unsegmented copy of each dimension table superprojection on each node.
See projection naming note below.

Unsegmented Projection Naming


CREATE PROJECTION ... UNSEGMENTED takes a snapshot of the nodes defined at execution time
to generate a node list in a predictable order. Thus, replicated projections have the name:
projection-name_node-name
For example, if the nodes are named NODE01, NODE02, and NODE03 then the following
command creates projections named ABC_NODE01, ABC_NODE02, and ABC_NODE03:
CREATE PROJECTION ABC ... UNSEGMENTED ALL NODES

-338-
SQL Statements

This naming convention could impact functions that provide information about projections, for
example, GET_PROJECTIONS (page 277) or GET_PROJECTION_STATUS (page 277), where
you must provide the name ABC_NODE01 instead of just ABC. To view a list of the nodes in a
database, use the View Database command in the Administration Tools.

Notes
Vertica recommends that you use multiple projection syntax for K-safe clusters.
CREATE PROJECTION does not load data into physical storage. If the tables over which the
projection is defined already contain data you must issue START_REFRESH (page 308) to bring
the projection up-to-date. This process could take a long time, depending on how much data is in
the tables. Once a projection is up-to-date, it is updated as part of INSERT, UPDATE, DELETE
or COPY statements.
If no segmentation is specified, the default is UNSEGMENTED on the node where the CREATE
PROJECTION was executed.
A projection is not refreshed until after a buddy projection is created. After the CREATE
PROJECTION is executed, if you execute 'select start_refresh()' the following
message displays:
Starting refresh background process
However, the refresh does not begin until after a buddy projection is created. You can monitor
the refresh operation by examining the vertica.log file or view the final status of the projection
refresh by using SELECT get_projections('table-name;). For example:
SELECT get_projections('customer_dimension');
get_projections
----------------------------------------------------------------
Current system K is 1.
# of Nodes: 4.
Table public.customer_dimension has 4 projections.

Projection Name: [Segmented] [Seg Cols] [# of Buddies] [Buddy Projections] [Safe] [UptoDate]
----------------------------------------------------------
public.customer_dimension_node0004 [Segmented: No] [Seg Cols: ] [K: 3]
[public.customer_dimension_node0003, public.customer_dimension_node0002,
public.customer_dimension_node0001] [Safe: Yes] [UptoDate: Yes][Stats: Yes]
public.customer_dimension_node0003 [Segmented: No] [Seg Cols: ] [K: 3]
[public.customer_dimension_node0004, public.customer_dimension_node0002,
public.customer_dimension_node0001] [Safe: Yes] [UptoDate: Yes][Stats: Yes]
public.customer_dimension_node0002 [Segmented: No] [Seg Cols: ] [K: 3]
[public.customer_dimension_node0004, public.customer_dimension_node0003,
public.customer_dimension_node0001] [Safe: Yes] [UptoDate: Yes][Stats: Yes]
public.customer_dimension_node0001 [Segmented: No] [Seg Cols: ] [K: 3]
[public.customer_dimension_node0004, public.customer_dimension_node0003,
public.customer_dimension_node0002] [Safe: Yes] [UptoDate: Yes][Stats: Yes]
(1 row)

Note: After a refresh completes, the refreshed projections go into a single ROS container. If
the table was created with a PARTITION BY clause, then you should call
PARTITION_TABLE() or PARTITION_PROJECTION() to reorganize the data into multiple
ROS containers, since queries on projections with multiple ROS containers perform better
than queries on projections with a single ROS container.

-339-
SQL Reference Manual

encoding-type
Vertica supports the following encoding and compression types:

ENCODING AUTO (default)


For CHAR/VARCHAR, BOOLEAN, BINARY/VARBINARY, and FLOAT columns,
Lempel-Ziv-Oberhumer-based (LZO) compression is used. For INTEGER,
DATE/TIME/TIMESTAMP, and INTERVAL types, the compression scheme is based on the delta
between consecutive column values.
Encoding Auto is ideal for sorted, many-valued columns such as primary keys. It is also suitable
for general purpose applications for which no other encoding or compression scheme is
applicable. Therefore, it serves as the default if no encoding/compression is specified.
The CPU requirements for this type are relatively small. In the worst case, data may expand by
eight percent (8%) for LZO and twenty percent (20%) for integer data.

ENCODING DELTAVAL
For INTEGER and DATE/TIME/TIMESTAMP/INTERVAL columns, data is recorded as a
difference from the smallest value in the data block. This encoding has no effect on other data
types.
Encoding Deltaval is best used for many-valued, unsorted integer or integer-based columns. The
CPU requirements for this type are small, and the data will never expand.

ENCODING RLE
Run Length Encoding (RLE) replaces sequences (runs) of identical values with a single pair that
contains the value and number of occurrences. Therefore, it's best used for low cardinality
columns that are present in the ORDER BY clause of a projection.
The Vertica execution engine processes RLE encoding run-by-run and the Vertica optimizer
gives it preference. Thus you should use it only when the run length is large, such as when
low-cardinality columns are sorted.
The storage for RLE and AUTO encoding of CHAR/VARCHAR and BINARY/VARBINARY is
always the same.

ENCODING BLOCK_DICT
For reach block of storage, Vertica compiles distinct column values into a dictionary and then
stores the dictionary and a list of indexes to represent the data block.
BLOCK_DICT is ideal for few-valued, unsorted columns in which saving space is more important
than encoding speed. Certain kinds of data, such as stock prices, are typically few-valued within
a localized area once the data is sorted, such as by stock symbol and timestamp, and are good
candidates for BLOCK_DICT. Long CHAR/VARCHAR columns are not good candidates for
BLOCK_DICT encoding.
CHAR and VARCHAR columns that contain 0x00 or 0xFF characters should not be encoded
with BLOCK_DICT. Also, BINARY/VARBINARY columns do not support BLOCK_DICT
encoding.

-340-
SQL Statements

The encoding CPU for BLOCK_DICT is significantly higher than for default encoding schemes.
The maximum data expansion is eight percent (8%).

ENCODING BLOCKDICT_COMP
This encoding type is similar to BLOCK_DICT except that dictionary indexes are entropy coded.
This encoding type requires significantly more CPU time to encode and decode and has a poorer
worst-case performance. However, use of this type can lead to space savings if the distribution
of values is extremely skewed.

ENCODING DELTARANGE_COMP
This compression scheme is primarily used for floating point data, and it stores each value as a
delta from the previous one.
This scheme is ideal for many-valued FLOAT columns that are either sorted or confined to a
range. Do not use this scheme for unsorted columns that contain NULL values, as the storage
cost for representing a NULL value is high. This scheme has a high cost for both compression
and decompression.
To determine if DELTARANGE_COMP is suitable for a particular set of data, compare it to other
schemes. Be sure to use the same sort order as the projection and select sample data that will
be stored consecutively in the database.

ENCODING COMMONDELTA_COMP
This compression scheme builds a dictionary of all the deltas in the block and then stores
indexes into the delta dictionary using entropy coding.
This scheme is ideal for sorted FLOAT and INTEGER-based
(DATE/TIME/TIMESTAMP/INTERVAL) data columns with predictable sequences and only the
occasional sequence breaks, such as timestamps recorded at periodic intervals or primary keys.
For example, the following sequence compresses well: 300, 600, 900, 1200, 1500, 600, 1200,
1800, 2400. The following sequence does not compress well: 1, 3, 6, 10, 15, 21, 28, 36, 45, 55.
If the delta distribution is excellent, columns can be sorted in less than one bit per row. However,
this scheme is very CPU intensive. If you use this scheme on data with arbitrary deltas, it can
lead to significant data expansion.

ENCODING NONE
Do not specify this value. It is obsolete and exists only for backwards compatibility. The result of
ENCODING NONE is the same as ENCODING AUTO except when applied to CHAR and
VARCHAR columns. Using ENCODING NONE on these columns increases space usage,
increases processing time, and leads to problems if 0x00 or 0xFF characters are present in the
data.

hash-segmentation-clause
Hash segmentation allows you to segment a projection based on a built-in hash function that
provides even distribution of data across some or all of the nodes in a cluster, resulting in optimal
query execution.

-341-
SQL Reference Manual

Note: Hash segmentation is the preferred method of segmentation in Vertica 2.0 and later.
The Database Designer uses hash segmentation by default.

Syntax
SEGMENTED BY expression
[ ALL NODES [ OFFSET offset ] | NODES node [ ,... ] ]

Parameters
SEGMENTED BY expression Can be a general SQL expression, but there is no reason to use
anything other than the built-in HASH (page 181) or MODULARHASH
(page 183) functions with table columns as arguments.
Choose columns that have a large number of unique data values and
acceptable skew in their data distribution. Primary key columns that
meet the criteria could be an excellent choice for hash segmentation.
ALL NODES Automatically distributes the data evenly across all nodes at the time
the CREATE PROJECTION statement is executed. The ordering of the
nodes is fixed.
OFFSET offset Is an integer that specifies the node within the ordered sequence on
which to start the segmentation distribution, relative to 0. See example
below.
NODES node [ ,... ] Specifies a subset of the nodes in the cluster over which to distribute
the data. You can use a specific node only once in any projection. For a
list of the nodes in a database, use the View Database command in the
Administration Tools.

Notes
• Omitting an OFFSET clause is equivalent to OFFSET 0.
• CREATE PROJECTION accepts the deprecated syntax SITES node for compatibility with
previous releases.
• Table column names must be used in the expression, not the new projection column names
• If you want to use a different SEGMENTED BY expression, the following restrictions apply:
§ All leaf expressions must be either constants (on page 55) or column-references (see
"Column References" on page 74) to a column in the SELECT list of the CREATE
PROJECTION command
§ Aggregate functions are not allowed
§ The expression must return the same value over the life of the database.
§ The expression must return non-negative INTEGER values in the range 0 <= x < 263 (two
to the sixty-third power or 2^63), and the values should be uniformly distributed over that
range.
§ If expression produces a value outside the expected range (a negative value for
example), no error occurs, and the row is added to the first segment of the projection.

Examples
CREATE PROJECTION ... SEGMENTED BY HASH(C1,C2) ALL NODES;
CREATE PROJECTION ... SEGMENTED BY HASH(C1,C2) ALL NODES OFFSET 1;

-342-
SQL Statements

The example produces two hash-segmented buddy projections that form part of a K-Safe design.
The projections can use different sort orders.
CREATE PROJECTION fact_ts_2 (f_price, f_cid, f_tid, f_cost, f_date) AS
(SELECT price, cid, tid, cost, dwdate
FROM fact)
SEGMENTED BY ModularHash(dwdate)
ALL NODES OFFSET 2;

See Also
HASH (page 181) and MODULARHASH (page 183)

range-segmentation-clause
Range segmentation allows you to segment a projection based on a known range of values
stored in a specific column chosen to provide even distribution of data across a set of nodes,
resulting in optimal query execution.
Note: Vertica Systems, Inc. recommends that you use hash segmentation, instead of range
segmentation.

Syntax
SEGMENTED BY expression
NODE node VALUES LESS THAN value

NODE node VALUES LESS THAN MAXVALUE

Parameters (Range Segmentation)


SEGMENTED BY expression Is a single column reference (see "Column References" on page 74)
to a column in the SELECT list of the CREATE PROJECTION
statement. Choose a column that has:
INTEGER or FLOAT data type
A known range of data values
An even distribution of data values
A large number of unique data values
Avoid columns that:
Are foreign keys
Are used in query predicates
Have a date/time data type
Have correlations with other columns due to functional dependencies.
Note: Segmenting on DATE/TIME data types is valid but guaranteed
to produce temporal skew in the data distribution and is not
recommended. If you choose this option, do not use TIME or TIMETZ
because their range is only 24 hours.
NODE node Is a symbolic name for a node. You can use a specific node only once
in any projection. For a list of the nodes in a database, use SELECT *
FROM NODE_RESOURCES.
VALUES LESS THAN value Specifies that this segment can contain a range of data values less

-343-
SQL Reference Manual

than the specified value, except that segments cannot overlap. In other
words, the minimum value of the range is determined by the value of
the previous segment (if any).
MAXVALUE Specifies a sub-range with no upper limit. In other words, it represents
a value greater than the maximum value that can exist in the data. The
maximum value depends on the data type of the segmentation column.

Notes
• The SEGMENTED BY expression syntax allows a general SQL expression but there is no
reason to use anything other than a single column reference (see "Column References" on
page 74) for range segmentation. If you want to use a different expression, the following
restrictions apply:
§ All leaf expressions must be either constants (on page 55) or column-references (see
"Column References" on page 74) to a column in the SELECT list of the CREATE
PROJECTION command
§ Aggregate functions are not allowed
§ The expression must return the same value over the life of the database.
• During INSERT or COPY to a segmented projection, if expression produces a value outside
the expected range (a negative value for example), no error occurs, and the row is added to
a segment of the projection.
• CREATE PROJECTION with range segmentation accepts the deprecated syntax SITE node
for compatibility with previous releases.
• CREATE PROJECTION with range segmentation allows the SEGMENTED BY expression to
be a single column-reference to a column in the projection-column list for compatibility with
previous releases. This syntax is considered to be a deprecated feature and causes a
warning message. See DEPRECATED syntax in the Troubleshooting Guide.

See Also
NODE_RESOURCES (page 444)

CREATE SCHEMA
Defines a new schema.

Syntax
CREATE SCHEMA schemaname [ AUTHORIZATION user_name ]

Parameters
schemaname Specifies the name of the schema to create. The schema
name must be distinct from all other schemas within the
database. If the schema name is not provided, the user
name is used as the schema name.
AUTHORIZATION user_name Assigns ownership of the schema to a user. If a user name
is not provided, the user who creates the schema is
assigned ownership. Only a Superuser is allowed to create

-344-
SQL Statements

a schema that is owned by a different user.

Notes
Optionally, CREATE SCHEMA could include the following sub-statements to create tables within
the schema:
• CREATE TABLE (page 346)
• GRANT
With the following exceptions, these sub-statements are treated as if they have been entered as
individual commands after the CREATE SCHEMA statement has completed:
• If the AUTHORIZATION statement is used, all tables are owned by the specified user.
• The CREATE SCHEMA statement and all its associated sub-statements are completed as
one transaction. If any of the statements fail, the entire CREATE SCHEMA statement is
rolled back.

Restrictions
To create a schema, the user must either be a superuser or have CREATE privilege for the
database.

Examples
The following example creates a schema named S1 with no objects.
CREATE SCHEMA S1;
The following example creates a schema named S1 with a table named T. It grants Fred, Aniket,
and Pequan access to all existing tables and ALL privileges on table T:
CREATE SCHEMA S1
CREATE TABLE T (C INT)
GRANT USAGE ON SCHEMA S1 TO Fred, Aniket, Pequan
GRANT ALL ON TABLE T TO Fred, Aniket, Pequan;

See Also
ALTER SCHEMA (page 314), SET SEARCH_PATH (page 397), and DROP SCHEMA (page
360)

-345-
346

CREATE TABLE
Creates a table in the logical schema.
Note: CREATE TABLE does not create a projection corresponding to the table. Every
column in the table must exist in at least one projection before you can store data in the
table.

Syntax
CREATE TABLE [schema-name.]table_name ( column-definition (on page 348) [ , ...
] )
[ PARTITION BY partition-clause ]

Parameters
[schema-name.]table-name Specifies the name of the table to be created. When using
more than one schema, specify the schema that contains the
table.
column-definition Defines one or more columns. See column-definition (on
page 348).
partition-clause All leaf expressions must be either constants or columns of the
table.
All other expressions must be functions and operators;
aggregate functions and subqueries are not permitted in the
expression.
partition-clause must calculate an idempotent value from its
arguments.
partition-clause must be not null.

Usage
Creating a table with the partition clause causes all projections anchored on that table to be
partitioned according to the partitioning clause.
For each partitioned projection, logically, there are as many partitions as the number of unique
values returned by the partitioned expression applied over the tuples of the projection.
Note: Due to the impact on the number of ROSs, explicit and implicit upper limits are
imposed on the number of partitions a projection can have; these limits, however, are
detected during the course of operation, such as during COPY.
Creating a partitioned table does not necessarily force all data feeding into a table’s projection
to be segregated immediately. Logically, the partition clause is applied after the segmented by
clause.
Partitioning specifies how data is organized at individual nodes in a cluster and after projection
data is segmented; only then is the data partitioned at each node based on the criteria in the
partitioning clause.

-346-
SQL Statements

Restrictions
• CREATE TABLE does not allow table constraints, only column and correlation constraints.
• If a database has had automatic recovery enabled, you must temporarily disable automatic
recovery in order to create a new table. In other words, you must:
SELECT MARK_DESIGN_KSAFE(0)
CREATE TABLE ...
CREATE PROJECTION ...
SELECT MARK_DESIGN_KSAFE(1)
• Cancelling a CREATE TABLE statement can cause unpredictable results. Vertica Systems,
Inc. recommends that you allow the statement to finish, then use DROP TABLE (page 362).

Examples
The following example creates a table named Product_Dimension in the Retail schema:
CREATE TABLE Retail.Product_Dimension (
Product_Key integer NOT NULL,
Product_Description varchar(128),
SKU_Number char(32) NOT NULL,
Category_Description char(32),
Department_Description char(32) NOT NULL,
Package_Type_Description char(32),
Package_Size char(32),
Fat_Content integer,
Diet_Type char(32),
Weight integer,
Weight_Units_of_Measure char(32),
Shelf_Width integer,
Shelf_Height integer,
Shelf_Depth integer
);
The following example partitions data by year:
CREATE TABLE fact ( ..., date_col date NOT NULL, ...)
PARTITION BY extract('year' FROM date_col);
The following example partitions data by state:
CREATE TABLE fact(..., state VARCHAR2 NOT NULL, ...)
PARTITION BY state;

See Also
CREATE TEMPORARY TABLE (page 351)
DROP_PARTITION (page 266)
DROP PROJECTION (page 360)
DUMP_PARTITION_KEYS (page 270)
DUMP_PROJECTION_PARTITION_KEYS (page 270)

-347-
SQL Reference Manual

DUMP_TABLE_PARTITION_KEYS (page 271)


PARTITION_PROJECTION (page 288)
PARTITION_TABLE (page 289)
Partitioning Tables in the Administrator's Guide

column-definition
A column definition specifies the name, data type, and constraints to be applied to a column.

Syntax
column-name data-type [ column-constraint (on page 349) [ ... ] ]

Parameters
column-name Specifies the name of a column to be created or added.
data-type Specifies one of the following data types:
BINARY (page 89)
BOOLEAN (page 93)
CHARACTER (page 94)
DATE/TIME (page 96)
NUMERIC (page 103)
column-constraint Specifies a column constraint (see "column-constraint" on page 349) to
apply to the column.

-348-
349

column-constraint
Adds a referential integrity constraint to the metadata of a column. See Adding Constraints in the
Administrator's Guide.

Syntax
[ CONSTRAINT constraint-name ]
{ [ NOT ] NULL
| PRIMARY KEY
| REFERENCES table-name
| UNIQUE
[ DEFAULT default ]
}

Parameters
CONSTRAINT Optionally assigns a name to the constraint. Vertica recommends that you
constraint-name name all constraints.
NULL [Default] Specifies that the column is allowed to contain null values.
NOT NULL Specifies that the column must receive a value during INSERT and UPDATE
operations. If no DEFAULT value is specified and no value is provided, the
INSERT or UPDATE statement returns an error because no default value
exists.
PRIMARY KEY Adds a referential integrity constraint defining the column as the primary key.
REFERENCES Adds a referential integrity constraint defining the column as a foreign key.
If column is omitted, the default is the primary key of table.
table-name Specifies the table to which the REFERENCES constraint applies.
column-name Specifies the column to which the REFERENCES constraint applies. If column
is omitted, the default is the primary key of table-name.
UNIQUE Ensures that the data contained in a column or a group of columns is unique
with respect to all the rows in the table.
DEFAULT default [Optional] Specifies a default data value for a column if the column is used in an
INSERT operation and no value is specified for the column. If there is no value
specified for the column and no default, the default is null.
Default value usage:
• A default value can be set for a column of any data type.
• The default value can be any variable-free expression as long as it
matches the data type of the column.
• Variable-free expressions can contain:
§ Constants
§ SQL functions
§ Null handling functions
§ System information functions
§ String functions

-349-
SQL Reference Manual

§ Numeric functions
§ Formatting functions
§ Nested functions
§ All Vertica supported operators
Default value restrictions:
• Expressions can contain only constant arguments.
• Subqueries and cross-references to other columns in the table are
not permitted in the expression.
• The return value of a Default expression cannot be NULL.
• The return data type of the default expression after evaluation
should either match that of the column for which it is defined or an
implicit cast between the two data-types should be possible. For
example, a character value cannot be cast to a numeric data type
implicitly, but a number data-type can be cast to character data type
implicitly.
• Default expressions when evaluated should conform to the bounds
for the column.
Volatile functions are not supported when adding columns to existing
tables. For example, RANDOM(), CURRVAL(), TIMEOFDAY(),
SYSDATE(), and SETVAL() are not supported. See ALTER TABLE
(page 316).

Notes
• A FOREIGN KEY constraint can be specified solely by a REFERENCE to the table that
contains the PRIMARY KEY. The columns in the referenced table do not need to be explicitly
specified; for example:
CREATE TABLE fact(c1 INTEGER PRIMARY KEY);
CREATE TABLE dim (c1 INTEGER REFERENCES fact);
• You must specify NOT NULL constraints on columns that are given PRIMARY and
REFERENCES constraints.
• Vertica does not support expressions in the DEFAULT clause.

Example
The following creates the store dimension table and sets the default column value for
Store_state to MA:
CREATE TABLE store_dimension (Store_state CHAR (2) DEFAULT MA);

-350-
351

CREATE TEMPORARY TABLE


Vertica supports transaction- and session-scoped GLOBAL temporary tables. The CREATE
TEMPORARY TABLE command defines a GLOBAL temporary table as one that is visible to all
users and sessions.
The contents (data) of table are private to the transaction or session where the data was
inserted. Data is automatically removed when the transaction commits or rolls back or the
session ends. This allows two users, A and B, to concurrently use the same temporary table but
see only data specific to his or her own transactions for the duration of those transactions or
sessions.
The definition of the temporary table persists in the database catalogs until explicitly removed by
using the DROP TABLE (page 362) statement.
A common use case for a temporary table is to divide complex query processing into multiple
steps. Typically, a reporting tool holds intermediate results while reports are generated (for
example, first get a result set, then query the result set, and so on). You can also write
subqueries.

Syntax
CREATE [ [ GLOBAL ] { TEMPORARY | TEMP } ]
TABLE table-name (
{ column-name data-type
[ DEFAULT default ]
[ NULL | NOT NULL ]
} ...
)
[ ON COMMIT { DELETE | PRESERVE } ROWS ]
[ NO PROJECTION ]

Parameters
GLOBAL [Optional] Specifies that the table definition is visible to all
sessions. Temporary table data is visible only to the session that
inserts the data into the table.
Temporary tables in Vertica are always global.
table-name Specifies the name of the temporary table to be created.
column-name Specifies the name of a column to be created in the new
temporary table.
data-type Specifies one of the following data types:
BINARY (page 89)
BOOLEAN (page 93)
CHARACTER (page 94)
DATE/TIME (page 96)
NUMERIC (page 103)

-351-
SQL Reference Manual

DEFAULT default [Optional] Specifies a default data value for a column if the column
is used in an INSERT operation and no value is specified for the
column. If there is no value specified for the column and no
default, the default is null.
Default value usage:
• A default value can be set for a column of any data type.
• The default value can be any variable-free expression as
long as it matches the data type of the column.
• Variable-free expressions can contain:
§ Constants
§ SQL functions
§ Null handling functions
§ System information functions
§ String functions
§ Numeric functions
§ Formatting functions
§ Nested functions
§ All Vertica supported operators
Default value restrictions:
• Expressions can contain only constant arguments.
• Subqueries and cross-references to other columns in the
table are not permitted in the expression.
• The return value of a Default expression cannot be
NULL.
• The return data type of the default expression after
evaluation should either match that of the column for
which it is defined or an implicit cast between the two
data-types should be possible. For example, a character
value cannot be cast to a numeric data type implicitly,
but a number data-type can be cast to character data
type implicitly.
• Default expressions when evaluated should conform to
the bounds for the column.
• Volatile functions are not supported when adding
columns to existing tables. For example, RANDOM(),
CURRVAL(), TIMEOFDAY(), SYSDATE(), and
SETVAL() are not supported. See ALTER TABLE (page
316).
NULL [Default] Specifies that the column is allowed to contain null
values.
NOT NULL Specifies that the column must receive a value during INSERT
and UPDATE operations. If no DEFAULT value is specified and no
value is provided, the INSERT or UPDATE statement returns an

-352-
SQL Statements

error because no default value exists.


ON COMMIT { PRESERVE | [Optional] Specifies whether data is transaction- or
DELETE } ROWS session-scoped:
DELETE marks the temporary table for transaction-scoped data.
Vertica truncates the table (delete all its rows) after each commit.
DELETE ROWS is the default.
PRESERVE marks the temporary table for session-scoped data,
which is preserved beyond the lifetime of a single transaction.
Vertica truncates the table (delete all its rows) when you terminate
a session.
NO PROJECTION [Optional] Prevents the automatic creation of a default
superprojection for the temporary table.

Notes
• Queries involving temporary tables have the same restrictions on SQL support as normal
queries that do not use temporary tables.
• Vertica automatically creates a default superprojection for a temporary table. This projection
is unsegmented and has the property that any data inserted into the table is local only to the
node that initiated the transaction. Vertica automatically chooses a sort order and
compression techniques for the projection.
In order to override this default behavior, use the NO PROJECTION keyword when creating
the temporary table and then use the CREATE PROJECTION statement to create your own
custom projections. For example, when using a temporary table as a dimension table in a
star query, bypass the default projection and instead create replicated projections on the
temporary tables just as you would do for a normal query.
• Prejoin projections that refer to both temporary and non-temporary tables are not supported.
• Vertica supports session-scoped isolation and statement-level rollback of temporary table
data.
• Single-node (pinned to the initiator node only) projections are supported.
• AT EPOCH LATEST queries that refer to session-scoped temporary tables work the same as
those for transaction-scoped temporary tables. Both return all committed and uncommitted
data regardless of epoch. For example, you can commit data from a temporary table in one
epoch, advance the epoch, and then commit data in a new epoch.
• Moveout and mergeout operations cannot be used on session-scoped temporary data.
• You cannot add projections to non-empty, session-scoped temporary tables. Make sure you
create projections before you load data.
• If you issue the TRUNCATE TABLE (page 403) statement against a temporary table, only
session-specific data is truncated with no affect on data in other sessions.
• The DELETE ... FROM TEMP TABLE syntax does not truncate data when the table was
created with PRESERVE; it marks rows for deletion. See DELETE (page 358) for additional
details.
• In general, session-scoped temporary table data is not visible using system (virtual) tables.

-353-
SQL Reference Manual

Example
Session-scoped rows in a GLOBAL temporary table can be preserved for the whole session or
for the current transaction only. For example, in the first statement below, ON COMMIT DELETE
ROWS indicates that the data should be deleted at the end of the transaction.
CREATE GLOBAL TEMP TABLE temp_table1 (
x NUMERIC,
y NUMERIC )
ON COMMIT DELETE ROWS;
By contrast, ON COMMIT PRESERVE ROWS indicates that the data should be preserved until
the end of the session.
CREATE GLOBAL TEMP TABLE temp_table2 (
x NUMERIC,
y NUMERIC )
ON COMMIT PRESERVE ROWS;

See Also
ALTER TABLE (page 316), CREATE TABLE (page 346), DELETE (page 358), DROP TABLE
(page 362), IMPLEMENT_TEMP_DESIGN (page 279)
Subqueries in the Programmer's Guide
Transactions in the Concepts Guide

-354-
355

CREATE USER
Adds a name to the list of authorized database users.

Syntax
CREATE USER name [ WITH [ ENCRYPTED | UNENCRYPTED ] PASSWORD 'password' ]

Parameters
name Specifies the name of the user to create; names that contain special
characters must be double-quoted.
Tip: Vertica database user names are logically separate from user
names of the operating system in which the server runs. If all the users of
a particular server also have accounts on the server's machine, it makes
sense to assign database user names that match their operating system
user names. However, a server that accepts remote connections could
have many database users who have no local operating system account,
and in such cases there need be no connection between database user
names and OS user names.
ENCRYPTED Is the default.A md5 encryption scheme is used
password Is the password to assign to the user.

Notes
• Only a superuser can create a user.
• User names are not case-sensitive.
• The following options are allowed but ignored:
§ SYSID uid
§ CREATEDB
§ NOCREATEDB
§ CREATEUSER
§ NOCREATEUSER
§ VALID UNTIL
• Other options, including IN GROUP, are not allowed.
• Newly-created users do not have access to schema PUBLIC by default. Make sure to
GRANT USAGE ON SCHEMA PUBLIC to all users you create.
• You can change a user password by using the ALTER USER statement. If you want to
configure a user to not have any password authentication, you can set the empty password ‘’
in CREATE or ALTER USER statements.
• Unencrypted passwords are visible in the catalog in clear text. Unless the database is used
solely for evaluation purposes, Vertica Systems recommends that all database users have
encrypted passwords.

Examples
CREATE USER Fred;

-355-
SQL Reference Manual

GRANT USAGE ON SCHEMA PUBLIC to Fred;

CREATE VIEW
Defines a new view.

Syntax
CREATE VIEW viewname [ ( column_name [, ...] ) ] AS query ]

Parameters
viewname Specifies the name of the view to create. The view name must be unique. Do
not use the same name as any table, view, or projection within the database.
If the view name is not provided, the user name is used as the view name.
column_name [Optional] Specifies the list of names to be used as column names for the
view. Columns are presented from left to right in the order given. If not
specified, Vertica automatically deduces the column names from the query.
query Specifies the query that the view executes. Vertica also uses the query to
deduce the list of names to be used as columns names for the view if they
are not specified.
Use a SELECT (page 382) statement to specify the query.The SELECT
statement can refer to tables, temp tables, and other views.

Notes
Views are read only. You cannot perform insert, update, delete, or copy operations on a view.
When Vertica processes a query that contains a view, the view is treated as a subquery because
the view name is replaced by the view's defining query. The following example defines a view
(ship) and illustrates how a query that refers to the view is transformed.
View: CREATE VIEW ship AS SELECT * FROM public.shipping_dimension;
Original Query: SELECT * FROM ship;
Transformed query: SELECT * FROM (SELECT * FROM public.shipping_dimension)
AS ship;
Use the DROP VIEW (page 364) statement to drop a view. Only the specified view is dropped.
Vertica does not support CASCADE functionality for views, and it does not check for
dependencies. Dropping a view causes any view that references it to fail.

Restrictions
To create a view, the user must be a superuser or have the following privileges:
• CREATE on the schema in which the view is created.
• SELECT on all the tables and views referenced within the view's defining query.

-356-
SQL Statements

• USAGE on all the schemas that contain the tables and views referenced within the view's
defining query.

Example
CREATE VIEW myview AS
SELECT SUM(annual_income), customer_state
FROM public.customer_dimension
WHERE customer_key IN
(SELECT customer_key
FROM store.store_sales_fact)
GROUP BY customer_state
ORDER BY customer_state ASC
The following example uses the myview view with a WHERE clause that limits the results to
combined salaries of greater than 2,000,000,000.
SELECT * FROM myview WHERE SUM > 2000000000;
2278679481 | CA
558455361 | CO
226947952 | FL
252410919 | GA
288327492 | IL
275529410 | MA
249433841 | MI
360368119 | MS
331044783 | TN
870156932 | TX
200938064 | UT
265890220 | WA
(12 rows)

See Also
SELECT (page 382), DROP VIEW (page 364), GRANT (View) (page 372), and REVOKE (View)
(page 377)

-357-
358

DELETE
Marks tuples as no longer valid in the current epoch. It does not delete data from disk storage for
base tables.

Syntax
DELETE FROM [schema_name.]table
WHERE clause (on page 386)

Parameters
[schema_name.] Specifies the name of an optional schema.
table Specifies the name of a base table or temporary table.

Using DELETE for temporary tables


To remove all rows from a temporary table, use a DELETE statement with no WHERE clause. In
this special case, the rows are not stored in the system, which greatly improves performance.
The effect is similar to when a COMMIT is issued, in that all rows are removed, but the columns,
projections, and constraints are preserved, thus making it easy to re-populate the table.
If you include a WHERE clause when performing delete operations on temporary tables,
DELETE behaves the same as for base tables, marking all delete vectors for storage, and you
lose any performance benefits.
DELETE FROM temp_table is the only way to truncate a temporary table without ending the
transaction.

Notes
• If the DELETE operation succeeds on temporary tables, you cannot roll back to a prior
savepoint.
• DELETE marks records for deletion in the WOS, so be cautions of WOS Overload.
• You cannot delete records from a projection.
• When using more than one schema, specify the schema that contains the table in your
DELETE statement.
• To use DELETE or UPDATE (page 408) commands with a WHERE clause, the user must
have both SELECT (page 382) and DELETE privileges on the table.

Examples
The following command truncates a temporary table called temp1:
DELETE FROM temp1;
The following command deletes all records from base table T where C1 = C2 - C1.
DELETE FROM T
WHERE C1=C2-C1;

-358-
SQL Statements

The following command deletes all records from the customer table in the retail schema where
the state attribute is in MA or NH:
DELETE FROM retail.customer
WHERE state IN ('MA', 'NH');

See Also
DROP TABLE (page 362) and TRUNCATE TABLE (page 403)
Deleting Data in the Administrator's Guide

-359-
SQL Reference Manual

DROP PROJECTION
Marks a projection to be dropped from the catalog so it is unavailable to user queries.

Syntax
DROP PROJECTION projname [ , projname ...] [ RESTRICT | CASCADE ]

Parameters
projname Matches the projname from a CREATE PROJECTION statement and reverses its
effect.
When using more than one schema, specify the schema that contains the
projection.
projname can be 'projname' or 'schema.projname'.
RESTRICT Drops the projection only if it does not contain any objects. RESTRICT is the
default.
CASCADE Drops the projection even if it contains one or more objects.

Notes
In previous versions of Vertica, you could drop all projections from a table. In order to prevent
data loss and inconsistencies, tables must now contain one superprojection, so DROP
PROJECTION fails if a projection is the table's only superprojection. In such cases, use the
DROP TABLE command.
To a drop projections:
DROP PROJECTION prejoin_p_site02;
Alternatively, you can issue a command like the following, which drops projections on a particular
schema:
DROP PROJECTION schema1.fact_proj_a, schema1.fact_proj_b;
If you want to drop a set of buddy projections, you could be prevented from dropping them
individually using a sequence of DROP PROJECTION statements due to K-Safety violations.
See MARK_DESIGN_KSAFE (page 285) for details.

See Also
CREATE PROJECTION (page 337), DROP TABLE (page 362), GET_PROJECTIONS (page
277), GET_PROJECTION_STATUS (page 277), and MARK_DESIGN_KSAFE (page 285)
Adding Nodes in the Administrator's Guide

DROP SCHEMA
Permanently removes a schema from the database. This is an irreversible process. Be sure that
you want to remove the schema and all its objects before you drop it.

-360-
SQL Statements

Syntax
DROP SCHEMA schema [ CASCADE | RESTRICT ]

Parameters
schema Specifies the name of the schema
CASCADE Drops the schema even if it contains one or more objects
RESTRICT Drops the schema only if it does not contain any objects (the
default)

By default, a schema cannot be dropped if it contains one or more objects. To force a drop, use
the CASCADE statement.

Restrictions
• The PUBLIC schema cannot be dropped.
• A schema can only be dropped by its owner or a superuser.

Notes
• A schema owner can drop a schema even if the owner does not own all the objects within the
schema. All the objects within the schema is also dropped.
• If a user is accessing any object within a schema that is in the process of being dropped, the
schema is not deleted until the transaction completes.
• Cancelling a DROP SCHEMA statement can cause unpredictable results.

Examples
The following example drops schema S1 only if it doesn't contain any objects:
DROP SCHEMA S1
The following example drops schema S1 whether or not it contains objects:
DROP SCHEMA S1 CASCADE

-361-
362

DROP TABLE
Drops a table and, optionally, its associated views and projections.

Syntax
DROP TABLE [ schema-name.]table [ CASCADE ]

Parameters
[schema-name.] [Optional] Specifies the name of an optional schema.
table Specifies the name of a schema table. When using more
than one schema, specify the schema that contains the
table in the DROP TABLE statement.
CASCADE [Optional] Drops all projections that include the table and all
views that reference the table.
Caution: Dropping a table and its associated projections
can destroy the K-Safety of your physical schema design.
Use this command only when absolutely necessary.

If you try to drop an table that has associated projections, a message listing the projections
displays. For example:
=> DROP TABLE d1;
NOTICE: Constraint - depends on Table d1
NOTICE: Projection d1p1 depends on Table d1
NOTICE: Projection d1p2 depends on Table d1
NOTICE: Projection d1p3 depends on Table d1
NOTICE: Projection f1d1p1 depends on Table d1
NOTICE: Projection f1d1p2 depends on Table d1
NOTICE: Projection f1d1p3 depends on Table d1
ERROR: DROP failed due to dependencies: Cannot drop Table d1 because other objects
depend on it
HINT: Use DROP ... CASCADE to drop the dependent objects too.
=> DROP TABLE d1 CASCADE;
DROP TABLE

Notes
• The table owner or schema owner or super user can drop a table.
Note: The schema owner can drop a table but cannot truncate a table.
• Canceling a DROP TABLE statement can cause unpredictable results.
• Vertica recommends that you make sure that all other users have disconnected before using
DROP TABLE.
• Dropping a table causes any view that references it to fail. However views that reference a
table that is dropped and then replaced by another table with the same name continue to
function using the contents of the new table, as long as the new table contains the same
columns and column names.
• Use the multiple projection syntax in K-safe clusters.

-362-
SQL Statements

See Also
DELETE (page 358), DROP PROJECTION (page 360), and TRUNCATE TABLE (page 403)
Adding Nodes and Deleting Data in the Administrator's Guide

-363-
364

DROP USER
Removes a name from the list of authorized database users.

Syntax
DROP USER name [, ...]

Parameters
name Specifies the name or names of the user to drop.

Examples

DROP USER Fred;

DROP VIEW
Removes the specified view.

Syntax
DROP VIEW viewname [ , ... ]

Parameters
viewname Specifies the name of the view to drop.

Notes
• Only the specified view is dropped. Vertica does not support cascade functionality for views
and it does not check for dependencies. Dropping a view causes any view that references it
to fail.
• Views that reference a view or table that is dropped and then replaced by another view or
table with the same name continue to function using the contents of the new view or table if it
contains the same column names. If the column data type changes, the server coerces the
old data type to the new one, if possible. Otherwise, it returns an error.

Restrictions
To drop a view, the user must be either a superuser or the person who created the view.

Examples
DROP VIEW myview;

-364-
365

EXPLAIN
Outputs the query plan.

Syntax
EXPLAIN { SELECT... | INSERT... | UPDATE... }

Output
Note: The EXPLAIN command is provided as a support feature and is not fully described
here. For information on how to interpret the output, contact Technical Support (page 33).
• A compact human-readable representation of the query plan, laid out hierarchically. For
example:
Vertica QUERY PLAN DESCRIPTION:
------------------------------
ID:1 Cost:2.7 Card:-1
Projection: P0
ID:2 Cost:0.1 Card:-1
DS: Value Idx
ProjCol:c_state, Table Oid.Attr#:25424.4
Pred: Y Out: P
ID:3 Cost:0.3 Card:-1
DS: Position Filtered by ID:2
ProjCol:c_gender, Table Oid.Attr#:25424.2
Pred: Y Out: P
ID:4 Cost:0.3 Card:-1
DS: Position Filtered by ID:3
ProjCol:c_name, Table Oid.Attr#:25424.3
Pred: Y Out: P
ID:5 Cost:1 Card:-1
DS: Position Filtered by ID:4
ProjCol:c_cid, Table Oid.Attr#:25424.1
Pred: N Out: V
ID:6 Cost:1 Card:-1
DS: Position Filtered by ID:4
ProjCol:c_state, Table Oid.Attr#:25424.4
Pred: N Out: V
• A GraphViz format of the graph for display in a graphical format. Graphviz is a graph plotting
utility with layout algorithms, etc. You can obtain a Fedora Core 4 RPM for GraphViz from:
yum -y install graphviz
A example of a GraphViz graph for a Vertica plan:
digraph G {
graph [rankdir=BT]
0[label="Root"];
1[label="ValExpNode"];
2[label="VDS:DVIDX(P0.c_state)"];
3[label="PDS(P0.c_gender)"];

-365-
SQL Reference Manual

4[label="PDS(P0.c_name)"];
5[label="Copy"];
6[label="PDS(P0.c_cid)"];
7[label="PDS(P0.c_state)"];
1->0 [label="V"];
1->0 [label="V"];
2->3 [label="P"];
3->4 [label="P"];
4->5 [label="P"];
5->6 [label="P"];
5->7 [label="P"];
6->1 [label="P+V"];
7->1 [label="P+V"]; }
• To create a picture of the plan, copy the output above to a file, in this example /tmp/x.txt:
1. dot -Tps /tmp/x.txt > /tmp/x.ps
2. ggv x.ps [evince x.ps works if you don't have ggv]
3. Alternative: dot -Tps | ghostview - and paste in the digraph.
4. Alternative: generate jpg using -Tjpg.
5. To scale an image for printing (8.5"x11" in this example):
6. Portrait: dot -Tps -Gsize="7.5,10" -Gmargin="0.5" ...
7. Landscape: dot -Tps -Gsize="10,7.5" -Gmargin="0.5" -Grotate="90" ...

-366-
SQL Statements

Example:

-367-
SQL Reference Manual

-368-
SQL Statements

GraphViz Information
http://www.graphviz.org/Documentation.php (http://www.graphviz.org/Documentation.php)

-369-
370

GRANT (Schema)
Grants privileges on a schema to a database user.
Note: In a database with trust authentication, the GRANT and REVOKE statements appear
to work as expected but have no actual effect on the security of the database.

Syntax
GRANT { { CREATE | USAGE } [, ...]
| ALL [ PRIVILEGES ]
}
ON SCHEMA schemaname [, ...]
TO { username | PUBLIC } [, ...]
[ WITH GRANT OPTION ]

Parameters
CREATE Allows the user read access to the schema and the right to create
tables and views within the schema.
USAGE Allows the user access to the objects contained within the
schema. This allows the user to look up objects within the schema.
Note that the user must also be granted access to the individual
objects. See the GRANT TABLE (page 371) and GRANT VIEW
(page 372) statements.
ALL Is synonymous with CREATE.
PRIVILEGES Is for SQL standard compatibility and is ignored.
schemaname Is the name of the schema for which privileges are being granted.
username Grants the privilege to a specific user.
PUBLIC Grants the privilege to all users.
WITH GRANT Allows the recipient of the privilege to grant it to other users.
OPTION

Notes
Newly-created users do not have access to schema PUBLIC by default. Make sure to grant
USAGE on schema PUBLIC to all users you create.

-370-
371

GRANT (Table)
Grants privileges on a table to a user.
Note: In a database with trust authentication, the GRANT and REVOKE statements appear
to work as expected but have no actual effect on the security of the database.

Syntax
GRANT { { SELECT | INSERT | UPDATE | DELETE | REFERENCES } [,...]
| ALL [ PRIVILEGES ]
}
ON [ TABLE ] [schema-name.]tablename [, ...]
TO { username | PUBLIC } [, ...]
[ WITH GRANT OPTION ]

Parameters
SELECT Allows the user to SELECT from any column of the specified table.
INSERT Allows the user to INSERT tuples into the specified table and to use
the COPY (page 323) command to load the table.
Note: COPY FROM STDIN is allowed to any user granted the
INSERT privilege, while COPY FROM <file> is an admin-only
operation.
UPDATE Allows the user to UPDATE tuples in the specified table.
DELETE Allows DELETE of a row from the specified table.
REFERENCES Is necessary to have this privilege on both the referencing and
referenced tables in order to create a foreign key constraint.
ALL Is synonomous with SELECT, INSERT, UPDATE, DELETE,
REFERENCES.
PRIVILEGES Is for SQL standard compatibility and is ignored.
[schema-name.]tab Specifies the table on which to grant the privileges. When using more
lename than one schema, specify the schema that contains the table on
which to grant privileges.
username Specifies the user to whom to grant the privileges.
PUBLIC Grants the privilege to all users.
WITH GRANT Allows the user to grant the same privileges to other users.
OPTION

Notes
To use the DELETE (page 358) or UPDATE (page 408) commands with a WHERE clause
(page 386), a user must have both SELECT and UPDATE and DELETE privileges on the table.

-371-
SQL Reference Manual

GRANT (View)
Grants privileges on a view to a database user.
Note: In a database with trust authentication, the GRANT and REVOKE statements appear
to work as expected but have no actual effect on the security of the database.

Syntax
GRANT { { SELECT }
| ALL [ PRIVILEGES ]
}
ON [schema-name.]viewname [, ...]
TO { username | PUBLIC } [, ...]
[ WITH GRANT OPTION ]

Parameters
SELECT Allows the user to perform SELECT operations on a view and the
resources referenced within it.
PRIVILEGES Is for SQL standard compatibility and is ignored.
ALL Is synonomous with SELECT.
[schema-name.]vie Specifies the view on which to grant the privileges. When using more
wname than one schema, specify the schema that contains the view.
username Specifies the user to whom to grant the privileges.
PUBLIC Grants the privilege to all users.
WITH GRANT Allows the user to grant the same privileges to other users.
OPTION

-372-
373

INSERT
Inserts values into the Write Optimized Store (WOS) for all projections of a table.

Syntax
INSERT [ /*+ direct */ ] INTO [schema-name.]table [ ( column [, ...] ) ]
{ DEFAULT VALUES |
VALUES ( { expression | DEFAULT } [, ...] ) | SELECT... (page 382)
}

Parameters
/*+ direct */ Writes the data directly to disk (ROS) instead of memory (WOS). This
syntax is only valid when used with INSERT...SELECT.
[schema-name.]tabl Specifies the name of a table in the schema. You cannot INSERT tuples
e into a projection. When using more than one schema, specify the
schema that contains the table.
column Specifies a column of the table.

DEFAULT VALUES Fills all columns with their default values as specified in CREATE
TABLE (page 346).

VALUES Specifies a list of values to store in the correspond columns. If no value


is supplied for a column, Vertica implicitly adds a DEFAULT value, if
present. Otherwise Vertica inserts a NULL value or, if the column is
defined as NOT NULL, returns an error.
expression Specifies a value to store in the corresponding column.

DEFAULT Stores the default value in the corresponding column.

SELECT... Specifies a query (SELECT (page 382) statement) that supplies the
rows to be inserted.

Notes
• An INSERT ... SELECT ... statement refers to tables in both its INSERT and SELECT
clauses. Isolation level applies only to the SELECT clauses and work just like an normal
query except that you cannot use AT EPOCH LATEST or AT TIME in an INSERT ... SELECT
statement. Instead, use the SET TRANSACTION CHARACTERISTICS (page 398)
statement to set the isolation level to READ COMMITTED. This is necessary in order to use
INSERT ... SELECT while the database is being loaded.

-373-
SQL Reference Manual

• You can list the target columns in any order. If no list of column names is given at all, the
default is all the columns of the table in their declared order; or the first N column names, if
there are only N columns supplied by the VALUES clause or query. The values supplied by
the VALUES clause or query are associated with the explicit or implicit column list
left-to-right.
• You must insert one complete tuple at a time.
• Be aware of WOS Overload.

Examples
INSERT INTO FACT VALUES (101, 102, 103, 104);
INSERT INTO CUSTOMER VALUES (10, 'male', 'DPR', 'MA', 35);
INSERT INTO Retail.T1 (C0, C1) VALUES (1, 1001);
INSERT INTO films
SELECT * FROM tmp_films
WHERE date_prod < '2004-05-07';

LCOPY
Is identical to the COPY (page 323) command, except that it loads data from a client system,
rather than a cluster host. LCOPY command is available only from the ODBC interface.

Example
The following code loads the table TEST from the file C:\load.dat located on a system where
the code is executed.
ODBCConnection<ODBCDriverConnect> test("VerticaSQL");
test.connect();
char *sql = "LCOPY test FROM 'C:\load.dat' DELIMITER '|';";
ODBCStatement stm(test.conn);
stm.execute(sql);

RELEASE SAVEPOINT
Destroys a savepoint without undoing the effects of commands executed after the savepoint was
established.

Syntax
RELEASE [ SAVEPOINT ] savepoint_name

Parameters
savepoint_name Specifies the name of the savepoint to destroy.

Notes
Once destroyed, the savepoint is unavailable as a rollback point.

-374-
SQL Statements

Example
The following example establishes and then destroys a savepoint called my_savepoint. The
values 101 and 102 are both inserted at commit.
INSERT INTO product_key VALUES (101);
SAVEPOINT my_savepoint;
INSERT INTO product_key VALUES (102);
RELEASE SAVEPOINT my_savepoint;
COMMIT;

See Also
SAVEPOINT (page 380) and ROLLBACK TO SAVEPOINT (page 379)

-375-
376

REVOKE (Schema)
Revokes privileges on a schema from a user.
Note: In a database with trust authentication, the GRANT and REVOKE statements appear
to work as expected but have no actual effect on the security of the database.

Syntax
REVOKE [ GRANT OPTION FOR ]
{ { CREATE | USAGE } [,...]
| ALL [ PRIVILEGES ]
}
ON SCHEMA schema-name [, ...]
FROM { username | PUBLIC } [, ...]

Parameters
GRANT OPTION FOR Revokes the grant option for the privilege, not the
privilege itself. If omitted, revokes both the privilege and
the grant option.
CREATE See GRANT (Schema) (page 370).
USAGE
ALL
PRIVILEGES
schema-name
username
PUBLIC

-376-
377

REVOKE (Table)
Revokes privileges on a table from a user.
Note: In a database with trust authentication, the GRANT and REVOKE statements appear
to work as expected but have no actual effect on the security of the database.

Syntax
REVOKE [ GRANT OPTION FOR ]
{ { SELECT | INSERT | UPDATE | DELETE | REFERENCES } [,...]
| ALL [ PRIVILEGES ]
}
ON [ TABLE ] [schema-name.]tablename [, ...]
FROM { username | PUBLIC } [, ...]

Parameters
GRANT OPTION FOR Revokes the grant option for the privilege, not the
privilege itself. If omitted, revokes both the privilege and
the grant option.
SELECT See GRANT (Table) (page 371).
INSERT
UPDATE
DELETE
REFERENCES
ALL
PRIVILEGES
[schema-name.]tabl
ename
username
PUBLIC

REVOKE (View)
Revokes privileges on a view from a user.
Note: In a database with trust authentication, the GRANT and REVOKE statements appear
to work as expected but have no actual effect on the security of the database.

Syntax
REVOKE [ GRANT OPTION FOR ]
{ { SELECT }
}
ON [ VIEW ] [schema-name.]viewname [, ...]
FROM { username | PUBLIC } [, ...]

Parameters
GRANT OPTION FOR Revokes the grant option for the privilege, not the privilege itself. If

-377-
SQL Reference Manual

omitted, revokes both the privilege and the grant option.


SELECT Allows the user to perform SELECT operations on a view and the
resources referenced within it.
PRIVILEGES Is for SQL standard compatibility and is ignored.
[schema-name.]vie Specifies the view on which to grant the privileges. When using more
wname than one schema, specify the schema that contains the view.
username Specifies the user to whom to grant the privileges.
PUBLIC Grants the privilege to all users.

-378-
379

ROLLBACK
Ends the current transaction and discards all changes that occurred during the transaction.

Syntax
ROLLBACK [ WORK | TRANSACTION ]

Parameters
WORK Have no effect; they are optional keywords for readability.
TRANSACTION

Notes
When an operation is rolled back, any locks that are acquired by the operation are also rolled
back.

ROLLBACK TO SAVEPOINT
Rolls back all commands that have been entered within the transaction since the given savepoint
was established.

Syntax
ROLLBACK TO [SAVEPOINT] savepoint_name

Parameters
savepoint_name Specifies the name of the savepoint to roll back to.

Notes
• The savepoint remains valid and can be rolled back to again later if needed.
• When an operation is rolled back, any locks that are acquired by the operation are also rolled
back.
• ROLLBACK TO SAVEPOINT implicitly destroys all savepoints that were established after the
named savepoint.

Example
The following example rolls back the values 102 and 103 that were entered after the savepoint,
my_savepoint, was established. Only the values 101 and 104 are inserted at commit.
INSERT INTO product_key VALUES (101);
SAVEPOINT my_savepoint;
INSERT INTO product_key VALUES (102);
INSERT INTO product_key VALUES (103);
ROLLBACK TO SAVEPOINT my_savepoint;
INSERT INTO product_key VALUES (104);
COMMIT;

-379-
SQL Reference Manual

See Also
RELEASE SAVEPOINT (page 374) and SAVEPOINT (page 380)

SAVEPOINT
SAVEPOINT is a transaction control command that creates a special mark, called a savepoint,
inside a transaction. A savepoint allows all commands that are executed after it was established
to be rolled back, restoring the transaction to the state it was in at the point in which the
savepoint was established.
Tip: Savepoints are useful when creating nested transactions. For example, a savepoint
could be created at the beginning of a subroutine. That way, the result of the subroutine
could be rolled back if necessary.

Syntax
SAVEPOINT savepoint_name

Parameters
savepoint_name Specifies the name of the savepoint to create.

Notes
• Savepoints are local to a transaction and can only be established when inside a transaction
block.
• Multiple savepoints can be defined within a transaction.
• If a savepoint with the same name already exists, it is replaced with the new savepoint.

Example
The following example illustrates how a savepoint determines which values within a transaction
can be rolled back. The values 102 and 103 that were entered after the savepoint,
my_savepoint, was established are rolled back. Only the values 101 and 104 are inserted at
commit.
INSERT INTO T1 (product_key) VALUES (101);
SAVEPOINT my_savepoint;
INSERT INTO T1 (product_key) VALUES (102);
INSERT INTO T1 (product_key) VALUES (103);
ROLLBACK TO SAVEPOINT my_savepoint;
INSERT INTO T1 (product_key) VALUES (104);
COMMIT;
SELECT product_key FROM T1;
--
101
104
(2 rows)

See Also
RELEASE SAVEPOINT (page 374) and ROLLBACK TO SAVEPOINT (page 379)

-380-
SQL Statements

-381-
382

SELECT
Retrieves a result set from one or more tables.
[ AT EPOCH LATEST ] | [ AT TIME 'timestamp' ]
SELECT [ ALL | DISTINCT ] ON ( expression [, ...] ) ] ]
* | expr [ AS ] output_name ] [, ...]
[ FROM (see "FROM Clause" on page 384) [, ...] ]
[ WHERE condition (see "WHERE Clause" on page 386) ]
[ GROUP BY expression (page 388) [, ...] ]
[ HAVING condition (see "HAVING Clause" on page 390) [, ...] ]
[ UNION (page 404) ]
[ ORDER BY expr (see "ORDER BY Clause" on page 391) [ ASC | DESC | USING
operator ] [, ...] ]
[ LIMIT (see "LIMIT Clause" on page 393) { count | ALL } ]
[ OFFSET (see "OFFSET Clause" on page 394) start ]

Parameters
AT EPOCH LATEST Queries all data in the database up to but not including the current epoch without
holding a lock or blocking write operations. See Snapshot Isolation for more
information. AT EPOCH LATEST is ignored when applied to temporary tables (all
rows are returned).
Note: By default, queries execute under the SERIALIZABLE isolation level,
which holds locks and blocks write operations. For optimal query performance,
use AT EPOCH LATEST.
AT TIME 'timestamp' Queries all data in the database up to and including the epoch representing the
specified date and time without holding a lock or blocking write operations. This is
called an Historical Query. AT TIME is ignored when applied to temporary tables
(all rows are returned).
* Is equivalent to listing all columns of the tables in the FROM Clause (page 384).
Note: Vertica recommends that you avoid using SELECT * for performance
reasons. An extremely large and wide result set can cause swapping.
DISTINCT Removes duplicate rows from the result set (or group).The DISTINCT set
quantifier must immediately follow the SELECT keyword. Only one DISTINCT
keyword can appear in the select list.
expression Forms the output rows of the SELECT statement. The expression can contain:
Column References (on page 74) to columns computed in the FROM clause
(page 384)
Constants (on page 55)
Mathematical Operators (on page 68)
String Concatenation Operators (on page 70)
Aggregate Expressions (page 72)
CASE Expressions (page 73)
SQL Functions (page 111)
expr Can be an analytic function (page 125).

-382-
SQL Statements

output_name Specifies a different name for an output column. This name is primarily used to
label the column for display. It can also be used to refer to the column's value in
ORDER BY (page 391) and GROUP BY (page 388) clauses, but not in the
WHERE (page 386) or HAVING (page 390) clauses.

Notes
The SELECT list (between the key words SELECT and FROM) specifies expressions that form
the output rows of the SELECT command.

See Also
Analytic Functions (page 125)
Subqueries and Joins in the Programmer's Guide

-383-
384

FROM Clause
Specifies one or more source tables from which to retrieve rows.

Syntax
FROM table-reference (on page 384) [ , table-reference (on page 384) ] ...
[ subquery ] [AS] name ...

Parameters
table-reference Is a table-primary (on page 384) or a joined-table (on
page 385).

Example
In the following example, the DISTINCT keyword makes sure each region is returned only once.
SELECT DISTINCT customer_region
FROM customer_dimension;
customer_region
-----------------
East
MidWest
NorthWest
South
SouthWest
West
(6 rows)

table-reference
Syntax
table-primary (on page 384) | joined-table (on page 385)

Parameters
table-primary Specifies an optionally qualified table name with optional
table aliases, column aliases, and outer joins.
joined-table Specifies an outer join.

table-primary
Syntax
{ table-name [ AS ] alias
[ ( column-alias [ , ...] ) ] [ , ...] ]
| ( joined-table (on page 385) ) }

-384-
SQL Statements

Parameters
table-name Specifies a table in the logical schema. Vertica selects a suitable
projection to use.
alias Specifies a temporary name to be used for references to the
table.
column-alias Specifies a temporary name to be used for references to the
column.
joined-table Specifies an outer join.

joined-table
Syntax
table-reference join-type table-reference
ON join-predicate (on page 83)

Parameters
table-reference Is a table-primary (page 384) or another joined-table.
join-type Is one of the following:
INNER JOIN
LEFT [ OUTER ] JOIN
RIGHT [ OUTER ] JOIN
join-predicate An equi-join based on one or more columns in the
joined tables.

Notes
• A query that uses INNER JOIN syntax in the FROM clause produces the same result set as
a query that uses the WHERE clause to state the join-predicate. See the topic, "ANSI Join
Syntax" in Joins in the Programmer's Guide for more information.
• The left-joined (outer) table in an outer join is the anchor table.

-385-
386

WHERE Clause
Eliminates rows from the result table that do not satisfy one or more predicates.

Syntax
WHERE boolean-expression
[ subquery ] ...

Parameters
boolean-expression Is an expression that returns true or false. Only rows for which the
expression is true become part of the result set.

The boolean-expression can include Boolean operators (on page 65) and the following
elements:
• BETWEEN-predicate (on page 79)
• Boolean-predicate (on page 80)
• Column-value-predicate (on page 81)
• IN-predicate (on page 82)
• Join-predicate (on page 83)
• LIKE-predicate (on page 84)
• NULL-predicate (on page 86)

Usage
You can use parentheses to group expressions, predicates, and boolean operators. For
example:
WHERE NOT (A=1 AND B=2) OR C=3;

Example
The following example returns the names of 20 customers in the Eastern region.
Note: Without the WHERE clause filter, the query returns all customer names in the
customer_dimension table.
SELECT DISTINCT customer_name
FROM customer_dimension
WHERE customer_region = 'East';

customer_name
------------------------
Alexander Brown
Alexander Greenwood
Alexander Martin
Alexander Miller
Alexander Rodriguez
Alexander Weaver
Alexander A. Jackson
Alexander A. Jones

-386-
SQL Statements

Alexander A. Lewis
Alexander A. Li
Alexander A. McCabe
Alexander A. Wilson
Alexander B. Gauthier
Alexander B. Jackson
Alexander B. Jefferson
Alexander B. Lewis
Alexander B. Nguyen
Alexander B. Perkins
Alexander B. Rodriguez
Alexander C. Fortin
(20 rows)

-387-
388

GROUP BY Clause
GROUP BY divides a query result set into groups of rows that match an expression.

Syntax
GROUP BY expression [ ,... ]

Parameters
expression Is any expression including constants and references to columns
(see "Column References" on page 74) in the tables specified in
the FROM clause.

Notes
• The expression cannot include aggregate functions (page 112).
• All non-aggregated columns in the SELECT list must be included in the GROUP BY clause.

Examples
The following example looks for customer name and city and groups the results by customer
name.
SELECT customer_name, customer_city
FROM customer_dimension
GROUP BY customer_name, customer_city
LIMIT 10;
customer_name | customer_city
---------------------+---------------
Alexander Bauer | Boston
Alexander Brown | Green Bay
Alexander Brown | Lafayette
Alexander Fortin | Palmdale
Alexander Garcia | Jacksonville
Alexander Goldberg | Downey
Alexander Greenwood | Independence
Alexander Jackson | Beaumont
Alexander Kramer | Elizabeth
Alexander Kramer | Phoenix
(10 rows)
The following example also looks for customer names but sums the average of their annual
income and sorts the results by customer name:
SELECT customer_name, AVG(annual_income) AS average_income
FROM customer_dimension
GROUP BY customer_name
LIMIT 10;
customer_name | average_income
---------------------+----------------
Alexander Bauer | 914301
Alexander Brown | 545041
Alexander Fortin | 81858
Alexander Garcia | 307990

-388-
SQL Statements

Alexander Goldberg | 909254


Alexander Greenwood | 725711
Alexander Jackson | 748682
Alexander Kramer | 419818
Alexander Li | 742141
Alexander Martin | 633009
(10 rows)
SELECT product_key + store_key AS key, sales_quantity + sales_dollar_amount AS sales, COUNT
FROM store.store_sales_fact
GROUP BY key, sales;

key | sales | count


-----+-------+-------
38 | 190 | 1
43 | -481 | 1
44 | -274 | 1
68 | 312 | 1
73 | 38 | 1
83 | 278 | 1
84 | 151 | 1
100 | 324 | 1
101 | 190 | 1
107 | 377 | 1
(10 rows)
SELECT RTRIM(customer_city) || LTRIM(customer_state) AS city_state, AVG(annual_income), COU
FROM customer_dimension
GROUP BY RTRIM(customer_city) || LTRIM(customer_state);

city_state | avg | count


--------------+------------------+-------
AbileneTX | 2505185.24637681 | 414
AlexandriaVA | 1353318.53249476 | 477
AllentownPA | 1966380.05393258 | 445
Ann ArborMI | 3052162.93208431 | 427
ArvadaCO | 3070815.40227273 | 440
AthensGA | 2131104.83801296 | 463
AustinTX | 1721137.53629977 | 427
BaltimoreMD | 1929520.38325991 | 454
BeaumontTX | 2751110.04338395 | 461
BellevueWA | 2307308.94799054 | 423
(10 rows)

Invalid Examples
The following example returns an error because the GROUP BY clause specifies a column not
included in the select list:
SELECT customer_name
FROM customer_dimension
GROUP BY customer_city;
ERROR: column "customer_dimension.customer_name" must appear in the GROUP BY
clause or be used in an aggregate function

-389-
390

HAVING Clause
Eliminates group rows that do not satisfy a predicate.

Syntax
HAVING predicate [, ...]

Parameters
predicate is the same as specified for the WHERE
clause (page 386).

Notes
• Each column referenced in predicate must unambiguously reference a grouping column,
unless the reference appears within an aggregate function.
• You can use expressions in the HAVING clause.

Example
The following example returns the employees with salaries greater than $50,000:
SELECT employee_last_name, MAX(annual_salary) as "highest_salary"
FROM employee_dimension
GROUP BY employee_last_name
HAVING MAX(annual_salary) > 50000;

employee_last_name | highest_salary
--------------------+----------------
Bauer | 920149
Brown | 569079
Campbell | 649998
Carcetti | 195175
Dobisz | 840902
Farmer | 804890
Fortin | 481490
Garcia | 811231
Garnett | 963104
Gauthier | 927335
(10 rows)

-390-
391

ORDER BY Clause
Sorts a query result set on one or more columns.

Syntax
ORDER BY expression [ ASC | DESC ] [, ...]

Parameters
expression Can be:
• The name or ordinal number of a SELECT list item
• An arbitrary expression formed from columns that do not
appear in the SELECT list
• A CASE (page 73) expression

Notes
• The ordinal number refers to the position of the result column, counting from the left
beginning at one. This makes it possible to order by a column that does not have a unique
name. (You can assign a name to a result column using the AS clause.)
• Vertica uses the ASCII collating sequence to store data and to compare character strings.
In general the order is:
§ Space
§ Numbers
§ Uppercase letters
§ Lowercase letters
Special characters collate in between and after the groups mentioned. See man ascii for
details.
• For INTEGER, INT, and DATE/TIME data types, NULL appears first (smallest) in ascending
order.
• For FLOAT, BOOLEAN, CHAR, and VARCHAR, NULL appears last (largest) in ascending
order.

Example
The follow example returns all records for customer Metamedia, sorted by customer_city in
ascending order.
SELECT customer_city
FROM customer_dimension
WHERE customer_name = 'Metamedia'
ORDER BY customer_city;

customer_city
---------------
Dallas
Erie
Fort Collins
Green Bay

-391-
SQL Reference Manual

Las Vegas
McAllen
San Diego
San Francisco
San Jose
Wichita Falls
(10 rows)

-392-
393

LIMIT Clause
Specifies the maximum number of result set rows to return.

Syntax
LIMIT { rows | ALL }

Parameters
rows Specifies the maximum number of rows to return
ALL Returns all rows (same as omitting LIMIT)

Notes
When both LIMIT and OFFSET (page 394) are used, Vertica skips the specified number of rows
before it starts to count the rows to be returned.
You can use LIMIT without an ORDER BY clause (page 391) that includes all columns in the
select list, but the query could produce non-deterministic results.

Non-deterministic: Omits the ORDER BY Deterministic: Specifies the ORDER BY


clause and returns any five records from the clause:
customer_dimension table:
SELECT customer_city SELECT customer_city
FROM customer_dimension FROM customer_dimension
LIMIT 5; ORDER BY customer_city
customer_city LIMIT 5;
--------------- customer_city
Baltimore ---------------
Nashville Abilene
Allentown Abilene
Clarksville Abilene
Baltimore Abilene
(5 rows) Abilene
(5 rows)

-393-
394

OFFSET Clause
Omits a specified number of rows from the beginning of the result set.

Syntax
OFFSET rows

Parameters
rows specifies the number of result set rows to
omit.

Notes
• When both LIMIT (page 393) and OFFSET are specified, specified number of rows are
skipped before starting to count the rows to be returned.
• When using OFFSET, use an ORDER BY clause (page 391). Otherwise the query returns
an undefined subset of the result set.

Example
The following example is similar to the the example used in the LIMIT clause (page 393). If you
want to see just records 6-10, however, use the OFFSET clause to skip over the first five cities:
SELECT customer_city
FROM customer_dimension
WHERE customer_name = 'Metamedia'
ORDER BY customer_city
LIMIT 10 OFFSET 5;

customer_city
---------------
McAllen
San Diego
San Francisco
San Jose
Wichita Falls
(10 rows)

SET
Sets one of several run-time parameters.

Syntax
SET run-time-parameter

Parameters
run-time-parameter Is one of the following:
DATESTYLE (page 396)
SEARCH_PATH

-394-
SQL Statements

SESSION CHARACTERISTICS
(page 398)
TIME ZONE (page 399)

-395-
396

DATESTYLE
The SET DATESTYLE command changes the DATESTYLE run-time parameter for the current
session.

Syntax
SET DATESTYLE TO { value | 'value' } [ ,... ]

Parameters
The DATESTYLE parameter can have multiple, non-conflicting values:

Value Interpretation Example


MDY month-day-year 12/17/1997
DMY day-month-year 17/12/1997
YMD year-month-day 1997-12-17
ISO ISO 8601/SQL 1997-12-17 07:37:16-08
standard (default)
SQL traditional style 12/17/1997 07:37:16.00 PST
GERMAN regional style 17.12.1997 07:37:16.00 PST

In the SQL style, day appears before month if DMY field ordering has been specified, otherwise
month appears before day. (See Date/Time Constants (page 58) for how this setting also
affects interpretation of input values.) The table below shows an example.

DATESTYLE Input Ordering Example Output


SQL, DMY day/month/year 17/12/1997 15:37:16.00 CET

SQL, MDY month/day/year 12/17/1997 07:37:16.00 PST

Notes
• The SQL standard requires the use of the ISO 8601 format. The name of the "SQL" output
format is a historical accident.
• INTERVAL output looks like the input format, except that units like CENTURY or WEEK are
converted to years and days and AGO is converted to an appropriate sign. In ISO mode the
output looks like
[ quantity unit [ ... ] ] [ days ] [ hours:minutes:seconds ]
• The SHOW (page 402) command displays the run-time parameters.

Example
SET DATESTYLE TO SQL, MDY;

-396-
SQL Statements

SEARCH_PATH
Specifies the order in which Vertica searches schemas when a SQL statement contains an
unqualified table name.
Vertica provides the SET search_path statement instead of the CURRENT_SCHEMA statement
found in some other databases.

Syntax
SET SEARCH_PATH TO schemaname [ , ... ]

Parameters
schemaname A comma-delimited list of schemas that indicates the order in which Vertica
searches schemas when a SQL statement contains an unqualified table
name.
The default value for this parameter is '"$user", public'
Where:
$User is the schema with the same name as the current user. If the schema
does not exist, $User is ignored.
public is the public database. Public is ignored if there is no schema named
'public'.

Notes
The first schema named in the search path is called the current schema. The current schema is
the first schema that Vertica searches. It is also the schema in which new tables are created if
the CREATE TABLE (page 346) command does not specify a schema name.

Restrictions
None

Examples
The following example sets the order in which Vertica searches schemas to T1, U1, and V1:
SET SEARCH_PATH TO T1, U1, V1;

-397-
398

SESSION CHARACTERISTICS
SET SESSION CHARACTERISTICS sets the transaction characteristics for subsequent
transactions of a user session. These are the isolation level and the access mode (read/write or
read-only).

Syntax
SET SESSION CHARACTERISTICS AS TRANSACTION
ISOLATION LEVEL { SERIALIZABLE | REPEATABLE READ |
READ COMMITTED | READ UNCOMMITTED }
{ READ WRITE | READ ONLY }

Parameters
ISOLATION LEVEL Determines what data the transaction can access when other
transactions are running concurrently. It does not apply to
temporary tables. The isolation level cannot be changed after the
first query (SELECT) or DML statement (INSERT, DELETE,
UPDATE) of a transaction has been executed.
SERIALIZABLE Provides the strictest level of SQL transaction isolation and is the
default in Vertica. This level emulates transactions executed one
after another, serially, rather than concurrently. It holds locks, and
blocks write operations and is thus not recommended for normal
query operations.
REPEATABLE READ Is automatically converted to SERIALIZABLE by Vertica.
READ COMMITTED Allows concurrent transactions. Use READ COMMITTED isolation
or Snapshot Isolation for normal query operations but be aware
that there is a subtle difference between them (see below).
READ UNCOMMITTED Is automatically converted to READ COMMITTED by Vertica.
READ WRITE Determines whether the transaction is read/write or read-only.
READ ONLY Read/write is the default. When a transaction is read-only, the
following SQL commands are disallowed: INSERT, UPDATE,
DELETE, and COPY if the table they would write to is not a
temporary table; all CREATE, ALTER, and DROP commands;
GRANT, REVOKE, and EXPLAIN if the command it would execute
is among those listed. This is a high-level notion of read-only that
does not prevent all writes to disk.

READ COMMITTED vs. Snapshot Isolation


By itself, AT EPOCH LATEST produces purely historical query behavior. However, with READ
COMMITTED, SELECT queries return the same result set as AT EPOCH LATEST plus any changes
made by the current transaction.
This is standard ANSI SQL semantics for ACID transactions. Any select query within a
transaction should see the transactions's own changes regardless of isolation level.

-398-
SQL Statements

Notes
• SERIALIZABLE isolation does not apply to temporary tables, which are isolated by their
transaction scope.
• Applications using SERIALIZABLE must be prepared to retry transactions due to serialization
failures.

TIME ZONE
Changes the TIME ZONE runtime parameter for the current session.

Syntax
SET TIME ZONE TO { value | 'value' }

Parameters
value Is one of the following:
One of the time zone names specified in the tz database, as
described in Sources for Time Zone and Daylight Saving
Time Data http://www.twinsun.com/tz/tz-link.htm. Time
Zone Names for Setting TIME ZONE (page 400) listed in
the next section are for convenience only and could be out
of date.
A signed integer representing an offset from UTC in hours
An interval value (page 60)

Notes
• TIME ZONE is a synonym for TIMEZONE. Both are allowed in Vertica syntax.
• The built-in constants LOCAL and DEFAULT, which set the time zone to the one specified in
the TZ environment variable or, if TZ is undefined, from the operating system time zone. See
Set the Default Time Zone and Using Time Zones with Vertica in the Installation and
Configuration Guide.
• When using a Country/City name, do not omit the country or the city. For example:
SET TIME ZONE TO 'Africa/Cairo'; -- valid
SET TIME ZONE TO 'Cairo'; -- invalid
• Include the required keyword TO.
• Positive integer values represent an offset east from UTC.
• The SHOW (page 402) command displays the run-time parameters.

Examples
SET TIME ZONE TO DEFAULT;
SET TIME ZONE TO 'PST8PDT'; -- Berkeley, California
SET TIME ZONE TO 'Europe/Rome'; -- Italy
SET TIME ZONE TO '-7'; -- UDT offset equivalent to PDT
SET TIME ZONE TO INTERVAL '-08:00 HOURS';

See Also
Using Time Zones with Vertica in the Installation and Configuration Guide

-399-
SQL Reference Manual

Time Zone Names for Setting TIME ZONE


The following time zone names are recognized by Vertica as valid settings for the SQL time zone
(the TIME ZONE run-time parameter).
Note: The names listed here are for convenience only and could be out of date. Refer to the
Sources for Time Zone and Daylight Saving Time Data
http://www.twinsun.com/tz/tz-link.htm page for precise information.
These names are not the same as the names shown in Time Zone Abbreviations For Input,
which are recognized by Vertica in date/time input values. The TIME ZONE names shown below
imply a local daylight-savings time rule, where date/time input names represent a fixed offset
from UTC.
In many cases there are several equivalent names for the same zone. These are listed on the
same line. The table is primarily sorted by the name of the principal city of the zone.

Time Zone

Africa

America

Antarctica

Asia

Atlantic

Australia

CET

EET

Etc/GMT

Europe

Factory

GMT GMT+0 GMT-0 GMT0 Greenwich Etc/GMT Etc/GMT+0 Etc/GMT-0


Etc/GMT0 Etc/Greenwich

Indian

MET

Pacific

-400-
SQL Statements

UCT Etc/UCT

UTC Universal Zulu Etc/UTC Etc/Universal Etc/Zulu

WET

In addition to the names listed in the table, Vertica accepts time zone names of the form
STDoffset or STDoffsetDST, where STD is a zone abbreviation, offset is a numeric offset in
hours west from UTC, and DST is an optional daylight-savings zone abbreviation, assumed to
stand for one hour ahead of the given offset. For example, if EST5EDT were not already a
recognized zone name, it would be accepted and would be functionally equivalent to USA East
Coast time. When a daylight-savings zone name is present, it is assumed to be used according
to USA time zone rules, so this feature is of limited use outside North America. One should also
be wary that this provision can lead to silently accepting bogus input, since there is no check on
the reasonableness of the zone abbreviations. For example, SET TIME ZONE TO FOOBANKO
works, leaving the system effectively using a rather peculiar abbreviation for GMT.

-401-
402

SHOW
Displays run-time parameters for the current session.

Syntax
SHOW { name | ALL }

Parameters
name Is one of:
DATESTYLE
TIME ZONE
ALL Shows all runtime parameters.

Notes
The SET < Runtime Parameter > command sets the run-time parameters.

Examples
SHOW ALL;
name | setting
-----------------------+------------------
datestyle | ISO, MDY
timezone | America/New_York
search_path | "$user", public
transaction_isolation | SERIALIZABLE
(4 rows)

SHOW SEARCH_PATH
Shows the order in which Vertica searches schemas when a SQL statement contains an
unqualified table name.

Syntax
SHOW SEARCH_PATH

Parameters
None

Restrictions
None

Example
SHOW SEARCH_PATH;

-402-
SQL Statements

name | setting
-------------+-----------------
search_path | "$user", public
(1 row)

TRUNCATE TABLE
TRUNCATE TABLE is a DDL statement that removes all storage associated with a table, while
preserving the table definitions. TRUNCATE TABLE auto-commits the current transaction after
statement execution and cannot be rolled back.

Syntax
TRUNCATE TABLE [schema_name.]table

Parameters
[schema_name.] Specifies the name of an optional schema.
table Specifies the name of a base table or temporary table.

Notes
• Only the superuser or database owner can truncate a table.
• The schema owner can drop a table but cannot truncate a table.
• TRUNCATE TABLE can be useful for testing purposes, letting you remove all table data and
not having to recreate projections.
• TRUNCATE TABLE commits the entire transaction, even if the TRUNCATE statement fails.
• If the truncated table is a fact table and contains prejoin projections, the projections show 0
rows after the transaction completes, and are ready for data reload.
• If the truncated table is a dimension table, the system returns the following error:
Cannot truncate a dimension table with pre-joined projections
Drop the prejoin projection first, and then issue the TRUNCATE command.
• If the truncated table has out-of-date projections, those projections are cleared and marked
up-to-date after the truncation operation completes.
• TRUNCATE TABLE takes X (exclusive) locks until the truncation process completes, when
savepoint is then released.
• To truncate a temporary table specified AS ON COMMIT DELETE ROWS without ending the
transaction, use DELETE FROM temp_table (page 358) syntax.
Note: The effect of DELETE FROM depends on the table type. If the table specified as ON
COMMIT DELETE ROWS then DELETE FROM works like TRUNCATE TABLE; otherwise it
behaves like a normal delete in that it does not truncate the table. So, it could say "To
truncate an "ON COMMIT DELETE ROWS" temporary table without..." instead. The deleting
data guide should be updated to reflect this as well

-403-
SQL Reference Manual

• After truncate operations complete, the data recovers from that current epoch onward.
Because TRUNCATE TABLE removes all history for the table, AT EPOCH queries return
nothing. TRUNCATE TABLE behaves the same when you have data in WOS, ROS, or both,
as well as for unsegmented / segmented projections.

See Also
DELETE (page 358) and DROP TABLE (page 362)
Transactions in the Concepts Guide
Deleting Data in the Administrator's Guide

UNION
Combines the results of two or more select statements.

Syntax
SELECT
... UNION [ ALL ] select
... [ UNION [ ALL ] select ]...
... [ ORDER BY { column-name | ordinal-number } [ ASC | DESC ] [, ...] ]
... [ LIMIT { integer | ALL } ]
... [ OFFSET integer ]
Note: SELECT statements can contain ORDER BY, LIMIT or OFFSET clauses if the
statement is enclosed within parentheses.

Usage
The results of several SELECT statements can be combined into a larger result using UNION.
Each SELECT statement produces results in which the UNION combines all those results into a
final single result. Specifically, a row in the results of a UNION operation must have existed in
the results from one of the SELECT statements. Each SELECT statement must have the same
number of items in the select list as well as compatible data types. If the data types are
incompatible, Vertica returns an error.
The results of a UNION contain only distinct rows. so use UNION ALL to keep duplicate rows.
UNION pays the performance price of eliminating duplicates; therefore, unless duplicate rows
are not wanted, use UNION ALL for its performance benefits.
A SELECT statement containing ORDER BY, LIMIT, or OFFSET clauses must be enclosed in
parentheses . If the statement is not enclosed in parentheses an error is returned. However, the
rightmost ORDER BY, LIMIT, or OFFSET clause in the UNION query does not need to be
enclosed in parentheses to the rightmost query. This indicates to perform these operations on
results of the UNION operation. GROUP BY and HAVING operations cannot be applied to the
results.

-404-
SQL Statements

The ordering of the results of a UNION operation does not necessarily depend on the ordering of
the results for each SELECT statement. The resulting rows can be ordered by adding an
ORDER BY to the UNION operation, as in the syntax above. If ORDER BY is used, only integers
and column names from the first (leftmost) SELECT statement are allowed in the order by list.
The integers specify the position of the columns on which to sort. The column names displayed
in the results are the same column names that display for the first (leftmost) select statement.

Notes
UNION correlated and noncorrelated subquery predicates are also supported.
SELECT * FROM T1
WHERE T1.x IN
(SELECT MAX(c1) FROM T2
UNION ALL
SELECT MAX(cc1) FROM T3
UNION ALL
SELECT MAX(d1) FROM T4);

Examples
Consider the following two tables:
Company_A
Id emp_lname dept sales
------+------------+-------------+-------
1234 | Vincent | auto parts | 1000
5678 | Butch | auto parts | 2500
9012 | Marcellus | floral | 500

Company B
Id emp_lname dept sales
------+------------+-------------+-------
4321 | Marvin | home goods | 250
9012 | Marcellus | home goods | 500
8765 | Zed | electronics | 20000
The following query lists all distinct IDs and surnames of employees:
SELECT id, emp_lname
FROM company_A
UNION
SELECT id, emp_lname
FROM company_B;

id | emp_lname
------+-----------
1234 | Vincent
4321 | Marvin
5678 | Butch
8765 | Zed
9012 | Marcellus
(5 rows)
The following query lists all IDs and surnames of employees:

-405-
SQL Reference Manual

SELECT id, emp_lname


FROM company_A
UNION ALL
SELECT id, emp_lname
FROM company_B;

id | emp_lname
------+-----------
1234 | Vincent
5678 | Butch
9012 | Marcellus
4321 | Marvin
8765 | Zed
9012 | Marcellus
(6 rows)
The next example returns the top two performing salespeople in each company combined:
(SELECT id, emp_lname, sales
FROM company_A
ORDER BY sales
LIMIT 2)
UNION ALL
(SELECT id, emp_lname, sales
FROM company_B
ORDER BY sales
LIMIT 2);

id | emp_lname | sales
------+-----------+-------
4321 | Marvin | 250
9012 | Marcellus | 500
9012 | Marcellus | 500
1234 | Vincent | 1000
(4 rows)
In this example, return all employee orders by sales. Note that the ORDER BY clause is applied
to the entire result:
SELECT id, emp_lname, sales
FROM company_A
UNION
SELECT id, emp_lname, sales
FROM company_B
ORDER BY sales;

id | emp_lname | sales
------+-----------+-------
4321 | Marvin | 250
9012 | Marcellus | 500
1234 | Vincent | 1000
5678 | Butch | 2500
8765 | Zed | 20000
(5 rows)

-406-
SQL Statements

And now sum the sales for each company, ordered by sales in descending order, and grouped
by department:
(SELECT 'company a' as company, dept, SUM(sales)
FROM company_a
GROUP BY dept
ORDER by 2 DESC)
UNION
(SELECT 'company b' as company, dept, SUM(sales)
FROM company_b
GROUP BY dept
ORDER by 2 DESC)
ORDER BY 1;
company | dept | sum
-----------+-------------+-------
company a | auto parts | 3500
company a | floral | 500
company b | electronics | 20000
company b | home goods | 750
(4 rows)
The final query shows the results of a mismatched data types:
SELECT id, emp_lname
FROM company_a
UNION
SELECT emp_lname, id
FROM company_b;
ERROR: UNION types int8 and character varying cannot be matched

See Also
SELECT (page 382)
Subqueries and UNION in Subqueries in the Programmer's Guide

-407-
408

UPDATE
Replaces the values of the specified columns in all rows for which a specific condition is true. All
other columns and rows in the table are unchanged.

Syntax
UPDATE [schema-name.]table SET column = { expression | DEFAULT } [, ...]
[ FROM from-list ]
[ WHERE clause (on page 386) ]

Parameters
[schema-name.]table Specifies the name of a table in the schema. When using more than
one schema, specify the schema that contains the table.
You cannot UPDATE a projection.
column Specifies the name of a non-key column in the table.
expression Specifies a value to assign to the column. The expression can use
the current values of this and other columns in the table. For
example:
UPDATE T1 SET C1 = C1+1;
from-list A list of table expressions, allowing columns from other tables to
appear in the WHERE condition and the update expressions. This is
similar to the list of tables that can be specified in the FROM Clause
(on page 384) of a SELECT command. Note that the target table
must not appear in the from-list.

Notes
• UPDATE inserts new records into the WOS and marks the old records for deletion. Be aware
of WOS Overload.
• You cannot UPDATE columns that have primary key or foreign key referential integrity
constraints.
• To use the DELETE (page 358) or UPDATE (page 408) commands with a WHERE clause
(page 386), you must have both SELECT and DELETE privileges on the table.

Examples
UPDATE FACT SET PRICE = PRICE - COST * 80 WHERE COST > 100;
UPDATE Retail.CUSTOMER SET STATE = 'NH' WHERE CID > 100;

-408-
SQL System Tables (Monitoring APIs)
Vertica provides system tables that let you monitor the health of your database. These tables
can be queried the same way you perform query operations on base or temporary tables, using
SELECT support, such as expressions, predicates, aggregates, analytics, subqueries, and joins.
For example:
SELECT t.table_name AS table_name,
SUM(ps.wos_row_count + ps.ros_row_count) AS row_count,
SUM(ps.wos_used_bytes + ps.ros_used_bytes) AS byte_count
FROM tables t
JOIN projections p ON t.table_id = p.anchor_table_id
JOIN projection_storage ps on p.projection_name = ps.projection_name
WHERE (ps.wos_used_bytes + ps.ros_used_bytes) > 500000
GROUP BY t.table_name
ORDER BY byte_count DESC;
table_name | row_count | byte_count
--------------------+-----------+------------
online_sales_fact | 200000 | 11920371
store_sales_fact | 200000 | 7621694
product_dimension | 240000 | 7367560
customer_dimension | 200000 | 6981564
store_orders_fact | 200000 | 5126330
(5 rows)
System tables are grouped into one of two schemas:
• A catalog schema called v_catalog (page 411)
• A monitoring schema called v_monitor (page 424)
The system table schemas reside in the default search path so there is no need to specify
schema.table in queries unless you change the search path to exclude v_monitor and
v_catalog. See Setting Schema Search Paths in the Administrator's Guide.

Restrictions and Cautions


• DDL and DML are not supported
• System tables do not hold historical data

You can use external monitoring tools or scripts to query the system tables and act upon the
information as necessary. For example, when a host failure causes the K-Safety level to fall
below a desired level, the tool or script can notify the database administrator and/or appropriate
IT personnel.
To view all of the system tables issue the following command:
SELECT * FROM system_tables;
The following two tables list the catalog and monitor tables. See also Using the SQL Monitoring
API in the Administrator's Guide.

-409-
SQL Reference Manual

Table 1: System tables in the v_catalog schema

Catalog Tables Description


COLUMNS (page 411) Provides information about columns.
FOREIGN_KEYS (page 412) Provides foreign key information.

GRANTS (page 413) Provides grant information.

PRIMARY_KEYS (page 415) Provides primary key information.

PROJECTIONS (page 416) Provides information about projections.

TABLE_CONSTRAINTS (page Provides information about table constraints.


418)
TABLES (page 419) Provides information about tables in the database.

TYPES (page 420) Provides information about supported data types.

USERS (page 421) Provides information about users.

VIEW_COLUMNS (page 421) Provides view attribute information.

VIEWS (page 422) Provides information about all views within the
system.
SYSTEM_TABLES (page 423) Displays a list of all system table names.

Table 2: System tables in the v_monitor schema

Monitor Tables Description


ACTIVE_EVENTS (page 424) Displays all the active events in the cluster.
COLUMN_STORAGE (page 426) Returns the amount of disk storage used by each
column of each projection on each node.
CURRENT_SESSION (page 428) Returns information about the current active session.
DISK_RESOURCE_REJECTION Returns requests for resources that are rejected due
S (page 431) to disk space shortages.
DISK_STORAGE (page 432) Returns the amount of disk storage used by the
database on each node.
EVENT_CONFIGURATIONS Returns configuration information about current
(page 435) events.
EXECUTION_ENGINE_PROFILI Returns information regarding query execution runs.
NG (page 436)
HOST_RESOURCES (page 437) Returns information about host profiling.

-410-
SQL System Tables (Monitoring APIs)

LOAD_STREAMS (page 440) Returns load metrics for each load stream on each
node.
LOCAL_NODES (page 441) Monitors the status of local nodes in the cluster.

LOCKS (page 441) Monitors lock grants and requests for all nodes.

NODE_RESOURCES (page 444) Provides a snapshot of the node. This is useful for
regularly polling the node with automated tools or
scripts.
PARTITIONS (page 445) Displays partition metadata, one row per partition key,
per ROS container.
PROJECTIONS (page 416) Returns information regarding projections.
PROJECTION_REFRESHES Returns information about refresh operations for
(page 446) projections.
PROJECTION_STORAGE (page Returns the amount of disk storage used by each
448) projection on each node.
QUERY_METRICS (page 449) Monitors the sessions and queries executing on each
node.
QUERY_PROFILES (page 450) Provides information regarding executed queries.

RESOURCE_REJECTIONS Returns requests for resources that are rejected by


(page 452) the resource manager.
RESOURCE_USAGE (page 454) Returns system resource management on each node.

SESSION_PROFILES (page 457) Provides basic session parameters and lock time out
data.
SESSIONS (page 458) Monitors external sessions.

STORAGE_CONTAINERS (page Monitors information about each storage container in


460) the database.
SYSTEM (page 461) Monitors the overall state of the database.

TUPLE_MOVER_OPERATIONS Monitors the status of the Tuple Mover on each node.


(page 462)
WOS_CONTAINER_STORAGE Monitors information about WOS storage, which is
(page 463) divided into regions.

V_CATALOG Schema
The system tables in this section reside in the v_catalog schema.

COLUMNS
Provides table column information.

-411-
SQL Reference Manual

Column Name Data Type Description


TABLE_ID INTEGER The unique table OID assigned by the Vertica catalog
TABLE_SCHEMA VARCHAR The name of the schema
TABLE_NAME VARCHAR The name of the table
IS_SYSTEM_TABLE VARCHAR Indicates whether the table is a system table, where t is true
and f is false.
COLUMN_NAME VARCHAR The name of the column.
DATA_TYPE VARCHAR The data type assigned to the column.
DATA_TYPE_ID INTEGER A unique ID assigned by the <Vertica catalog.
CHARACTER_MAXIMUM_LENG VARCHAR The maximum allowable length of the column.
TH
ORDINAL_POSITION VARCHAR The column's position in the table.
IS_NULLABLE VARCHAR Indicates whether the column can contain null values, where
t is true and f is false.
COLUMN_DEFAULT VARCHAR The default value of a column, such as empty or expression.

Example
=> SELECT table_schema, table_name, column_name, data_type, is_nullable
FROM columns WHERE table_schema = 'store' AND data_type = 'Date';
table_schema | table_name | column_name | data_type |
is_nullable
--------------+-------------------+------------------------+-----------+------
-------
store | store_dimension | first_open_date | Date | f
store | store_dimension | last_remodel_date | Date | f
store | store_orders_fact | date_ordered | Date | f
store | store_orders_fact | date_shipped | Date | f
store | store_orders_fact | expected_delivery_date | Date | f
store | store_orders_fact | date_delivered | Date | f
(6 rows)

FOREIGN_KEYS
Provides foreign key information.

Column Name Data Type Description


CONSTRAINT_ID INTEGER The object ID assigned by the Vertica catalog.
CONSTRAINT_NAME VARCHAR The name of the constraint.
COLUMN_NAME VARCHAR The name of the column that is constrained.

-412-
SQL System Tables (Monitoring APIs)

ORDINAL_POSITION VARCHAR The position of the constraint respective to other


constraints in the table.
TABLE_NAME VARCHAR The name of the table
REFERENCE_TABLE_NAME VARCHAR References the TABLE_NAME column in the
PRIMARY_KEY table.
CONSTRAINT_TYPE VARCHAR The constraint type, f, for foreign key.
REFERENCE_COLUMN_NAM VARCHAR References the COLUMN_NAME column in the
E PRIMARY_KEY table.
TABLE_SCHEMA VARCHAR The name of the schema
REFERENCE_TABLE_SCHE VARCHAR References the TABLE_SCHEMA column in the
MA PRIMARY_KEY table.

Example
SELECT constraint_name, table_name, ordinal_position, reference_table_name
FROM foreign_keys
ORDER BY 3;
constraint_name | table_name | ordinal_position | reference_table_name
---------------------------+-------------------+------------------+-----------------------
fk_store_sales_date | store_sales_fact | 1 | date_dimension
fk_online_sales_saledate | online_sales_fact | 1 | date_dimension
fk_store_orders_product | store_orders_fact | 1 | product_dimension
fk_inventory_date | inventory_fact | 1 | date_dimension
fk_inventory_product | inventory_fact | 2 | product_dimension
fk_store_sales_product | store_sales_fact | 2 | product_dimension
fk_online_sales_shipdate | online_sales_fact | 2 | date_dimension
fk_store_orders_product | store_orders_fact | 2 | product_dimension
fk_inventory_product | inventory_fact | 3 | product_dimension
fk_store_sales_product | store_sales_fact | 3 | product_dimension
fk_online_sales_product | online_sales_fact | 3 | product_dimension
fk_store_orders_store | store_orders_fact | 3 | store_dimension
fk_online_sales_product | online_sales_fact | 4 | product_dimension
fk_inventory_warehouse | inventory_fact | 4 | warehouse_dimension
fk_store_orders_vendor | store_orders_fact | 4 | vendor_dimension
fk_store_sales_store | store_sales_fact | 4 | store_dimension
fk_store_orders_employee | store_orders_fact | 5 | employee_dimension
fk_store_sales_promotion | store_sales_fact | 5 | promotion_dimension
fk_online_sales_customer | online_sales_fact | 5 | customer_dimension
fk_store_sales_customer | store_sales_fact | 6 | customer_dimension
fk_online_sales_cc | online_sales_fact | 6 | call_center_dimension
fk_store_sales_employee | store_sales_fact | 7 | employee_dimension
fk_online_sales_op | online_sales_fact | 7 | online_page_dimension
fk_online_sales_shipping | online_sales_fact | 8 | shipping_dimension
fk_online_sales_warehouse | online_sales_fact | 9 | warehouse_dimension
fk_online_sales_promotion | online_sales_fact | 10 | promotion_dimension
(26 rows)

GRANTS
Provides grant information.

Column Name Data Type Description


GRANTEE_ID INTEGER The grantee object ID (OID) from the catalog.

-413-
SQL Reference Manual

GRANTEE VARCHAR The user being granted permission.

GRANTOR_ID INTEGER The object ID from the catalog.

GRANTOR VARCHAR The user granting permission.

PRIVILEGES INTEGER The bitmask representation of the privileges being


granted.
PRIVILEGES_DESCRIPTI VARCHAR A readable description of the privileges being
ON granted; for example INSERT, SELECT.
OBJECT_ID INTEGER The unique OID assigned by the Vertica catalog

TABLE_SCHEMA VARCHAR The name of the schema

TABLE_NAME VARCHAR The name of the table

Notes
The vsql commands \dp and \z both include the schema name in the output:
mydb=> \dp
Access privileges for database "vmartdb"
Grantee | Grantor | Privileges | Schema | Name
---------+---------+------------+--------+-----------------
| release | USAGE | | public
| vertica | USAGE | | monitoring
| vertica | USAGE | | catalog
| vertica | USAGE | | system
| release | USAGE | | v_internal
| release | USAGE | | v_catalog
| release | USAGE | | v_monitor
| release | USAGE | | designer_system
(8 rows)
mydb=> \z
Access privileges for database "vmartdb"
Grantee | Grantor | Privileges | Schema | Name
---------+---------+------------+--------+-----------------
| release | USAGE | | public
| vertica | USAGE | | monitoring
| vertica | USAGE | | catalog
| vertica | USAGE | | system
| release | USAGE | | v_internal
| release | USAGE | | v_catalog
| release | USAGE | | v_monitor
| release | USAGE | | designer_system
(8 rows)
The vsql command \dp *.tablename displays table names in all schemas. This command lets
you distinguish the grants for same-named tables in different schemas:

-414-
SQL System Tables (Monitoring APIs)

$ \dp *.events;
Access privileges for database "dbadmin"
Grantee | Grantor | Privileges | Schema | Table
---------+---------+--------------------------------------------+---------+--------
user2 | dbadmin | INSERT, SELECT, UPDATE, DELETE, REFERENCES | schema1 | events
user1 | dbadmin | SELECT | schema1 | events
user2 | dbadmin | INSERT, SELECT, UPDATE, DELETE, REFERENCES | schema2 | events
user1 | dbadmin | INSERT, SELECT | schema2 | events
(4 rows)

The vsql command \dp schemaname.* displays all tables in the named schema:
$ \dp schema1.*
Access privileges for database "dbadmin"
grantee | grantor | privileges_description | table_schema | table_name
---------+---------+--------------------------------------------+--------------+------------
user2 | dbadmin | INSERT, SELECT, UPDATE, DELETE, REFERENCES | schema1 | events
user1 | dbadmin | SELECT | schema1 | events
(2 rows)

Call the GRANTS table:


SELECT * FROM GRANTS;

PRIMARY_KEYS
Provides primary key information.

Column Name Data Type Description


CONSTRAINT_ID INTEGER The object ID assigned by the Vertica catalog.
CONSTRAINT_NAME VARCHA The name of the constraint.
R
COLUMN_NAME VARCHA The name of the column.
R
ORDINAL_POSITION VARCHA The position of the constraint respective to other
R constraints in the table.
TABLE_NAME VARCHA The name of the table
R
CONSTRAINT_TYPE VARCHA The constraint type, p, for primary key.
R
TABLE_SCHEMA VARCHA The name of the schema
R

Example
Request specific columns from the PRIMARY_KEYS table:
SELECT constraint_name, table_name, ordinal_position, table_schema
FROM foreign_keys ORDER BY 3;

constraint_name | table_name | ordinal_position | table_schema


---------------------------+-------------------+------------------+--------------
fk_store_sales_date | store_sales_fact | 1 | store

-415-
SQL Reference Manual

fk_online_sales_saledate | online_sales_fact | 1 | online_sales


fk_store_orders_product | store_orders_fact | 1 | store
fk_inventory_date | inventory_fact | 1 | public
fk_inventory_product | inventory_fact | 2 | public
fk_store_sales_product | store_sales_fact | 2 | store
fk_online_sales_shipdate | online_sales_fact | 2 | online_sales
fk_store_orders_product | store_orders_fact | 2 | store
fk_inventory_product | inventory_fact | 3 | public
fk_store_sales_product | store_sales_fact | 3 | store
fk_online_sales_product | online_sales_fact | 3 | online_sales
fk_store_orders_store | store_orders_fact | 3 | store
fk_online_sales_product | online_sales_fact | 4 | online_sales
fk_inventory_warehouse | inventory_fact | 4 | public
fk_store_orders_vendor | store_orders_fact | 4 | store
fk_store_sales_store | store_sales_fact | 4 | store
fk_store_orders_employee | store_orders_fact | 5 | store
fk_store_sales_promotion | store_sales_fact | 5 | store
fk_online_sales_customer | online_sales_fact | 5 | online_sales
fk_store_sales_customer | store_sales_fact | 6 | store
fk_online_sales_cc | online_sales_fact | 6 | online_sales
fk_store_sales_employee | store_sales_fact | 7 | store
fk_online_sales_op | online_sales_fact | 7 | online_sales
fk_online_sales_shipping | online_sales_fact | 8 | online_sales
fk_online_sales_warehouse | online_sales_fact | 9 | online_sales
fk_online_sales_promotion | online_sales_fact | 10 | online_sales
(26 rows)

PROJECTIONS
Provides information about projections.

Column Name Data Type Description


PROJECTION_SCHEMA_ID INTEGER A unique numeric ID (OID) that identifies the specific
schema that contains the projection.
PROJECTION_SCHEMA VARCHA The name of the schema that contains the projection.
R
PROJECTION_ID INTEGER A unique numeric ID (OID) that identifies the projection.
PROJECTION_NAME VARCHA The name of the projection.
R
OWNER_ID INTEGER A unique numeric ID (OID) that identifies the owner of the
projection.
OWNER_NAME VARCHA The name of the projection's owner.
R
ANCHOR_TABLE_ID INTEGER The unique numeric identification (OID) of the anchor table,
for pre-join projections, or the OID of the table from which
the projection was created if it isn't a pre-join projection.
ANCHOR_TABLE_NAME VARCHA The name of the anchor table, for pre-join projections, or the
R name of the table from which the projection was created if it
isn't a pre-join projection.
NODE_ID INTEGER A unique numeric ID (OID) that identifies the node, or
nodes, that contain the projection.

-416-
SQL System Tables (Monitoring APIs)

NODE_NAME VARCHA The name of the node, or nodes, that contain the projection.
R
IS_PREJOIN BOOLEA Indicates whether or not the projection is a pre-join
N projection where t is true and f is false.
CREATED_EPOCH INTEGER The epoch in which the projection was created.
VERIFIED_FAULT_TOLERAN INTEGER K-safety value for the projection.
CE
IS_UP_TO_DATE BOOLEA Indicates whether the projection is current where t is true
N and f is false. Projections must be up-to-date to be used in
queries.

Example
SELECT projection_name, anchor_table_name, is_prejoin, is_up_to_date
FROM projections;

projection_name | anchor_table_name | is_prejoin | is_up_to_date


------------------------------+-----------------------+------------+---------------
customer_dimension_site01 | customer_dimension | f | t
customer_dimension_site02 | customer_dimension | f | t
customer_dimension_site03 | customer_dimension | f | t
customer_dimension_site04 | customer_dimension | f | t
product_dimension_site01 | product_dimension | f | t
product_dimension_site02 | product_dimension | f | t
product_dimension_site03 | product_dimension | f | t
product_dimension_site04 | product_dimension | f | t
store_sales_fact_p1 | store_sales_fact | t | t
store_sales_fact_p1_b1 | store_sales_fact | t | t
store_orders_fact_p1 | store_orders_fact | t | t
store_orders_fact_p1_b1 | store_orders_fact | t | t
online_sales_fact_p1 | online_sales_fact | t | t
online_sales_fact_p1_b1 | online_sales_fact | t | t
promotion_dimension_site01 | promotion_dimension | f | t
promotion_dimension_site02 | promotion_dimension | f | t
promotion_dimension_site03 | promotion_dimension | f | t
promotion_dimension_site04 | promotion_dimension | f | t
date_dimension_site01 | date_dimension | f | t
date_dimension_site02 | date_dimension | f | t
date_dimension_site03 | date_dimension | f | t
date_dimension_site04 | date_dimension | f | t
vendor_dimension_site01 | vendor_dimension | f | t
vendor_dimension_site02 | vendor_dimension | f | t
vendor_dimension_site03 | vendor_dimension | f | t
vendor_dimension_site04 | vendor_dimension | f | t
employee_dimension_site01 | employee_dimension | f | t
employee_dimension_site02 | employee_dimension | f | t
employee_dimension_site03 | employee_dimension | f | t
employee_dimension_site04 | employee_dimension | f | t
shipping_dimension_site01 | shipping_dimension | f | t
shipping_dimension_site02 | shipping_dimension | f | t
shipping_dimension_site03 | shipping_dimension | f | t
shipping_dimension_site04 | shipping_dimension | f | t
warehouse_dimension_site01 | warehouse_dimension | f | t
warehouse_dimension_site02 | warehouse_dimension | f | t
warehouse_dimension_site03 | warehouse_dimension | f | t
warehouse_dimension_site04 | warehouse_dimension | f | t
inventory_fact_p1 | inventory_fact | f | t
inventory_fact_p1_b1 | inventory_fact | f | t
store_dimension_site01 | store_dimension | f | t

-417-
SQL Reference Manual

store_dimension_site02 | store_dimension | f | t
store_dimension_site03 | store_dimension | f | t
store_dimension_site04 | store_dimension | f | t
online_page_dimension_site01 | online_page_dimension | f | t
online_page_dimension_site02 | online_page_dimension | f | t
online_page_dimension_site03 | online_page_dimension | f | t
online_page_dimension_site04 | online_page_dimension | f | t
call_center_dimension_site01 | call_center_dimension | f | t
call_center_dimension_site02 | call_center_dimension | f | t
call_center_dimension_site03 | call_center_dimension | f | t
call_center_dimension_site04 | call_center_dimension | f | t
(52 rows)

TABLE_CONSTRAINTS
Provides information about table constraints.

Column Name Data Type Description


CONSTRAINT_ID VARCHAR The constraint object ID from the table
assigned by Vertica.
CONSTRAINT_NAME VARCHAR The name of the constraint, if specified:
UNIQUE, FOREIGN KEY, NOT NULL, or
PRIMARY KEY.
CONSTRAINT_SCHEMA_ID INTEGER The schema object ID assigned by Vertica.
CONSTRAINT_KEY_COUNT INTEGER The number of constraint keys.
FOREIGN_KEY_COUNT INTEGER The number of foreign keys.
TABLE_ID INTEGER The table ID assigned by Vertica.
FOREIGN_TABLE_ID INTEGER OID of the foreign table referenced in a
foreign key constraint (zero if not a foreign key
constraint).
CONSTRAINT_TYPE INTEGER Is one of 'c', 'f', 'p', 'u' or 'd' which refer to
'check', 'foreign', 'primary', 'unique' and
'determines', respectively.

Example
The following command returns constraint column names and types against the VMart schema.
SELECT constraint_name, constraint_type FROM table_constraints
ORDER BY constraint_type;

constraint_name | constraint_type
---------------------------+-----------------
fk_online_sales_promotion | f
fk_online_sales_warehouse | f
fk_online_sales_shipping | f
fk_online_sales_op | f
fk_online_sales_cc | f
fk_online_sales_customer | f
fk_online_sales_product | f
fk_online_sales_shipdate | f

-418-
SQL System Tables (Monitoring APIs)

fk_online_sales_saledate | f
fk_store_orders_employee | f
fk_store_orders_vendor | f
fk_store_orders_store | f
fk_store_orders_product | f
fk_store_sales_employee | f
fk_store_sales_customer | f
fk_store_sales_promotion | f
fk_store_sales_store | f
fk_store_sales_product | f
fk_store_sales_date | f
fk_inventory_warehouse | f
fk_inventory_product | f
fk_inventory_date | f
- | p
- | p
- | p
- | p
- | p
- | p
- | p
- | p
- | p
- | p
- | p
(33 rows)

See Also
Adding Constraints in the Administrator's Guide
ANALYZE_CONSTRAINTS (page 241)

TABLES
Provides information about all tables in the database.

Column Name Data Type Description


TABLE_SCHEMA_ID INTEGER The schema ID from the catalog.
TABLE_SCHEMA VARCHAR The name of the schema
TABLE_ID INTEGER The unique table OID assigned by the Vertica catalog
TABLE_NAME VARCHAR The name of the table
OWNER_ID INTEGER The owner ID from the catalog.
OWNER_NAME VARCHAR The name of the user who created the table.
IS_SYSTEM_TABLE BOOLEAN Is 'f' for user-created tables, 't' for Vertica system tables"
SYSTEM_TABLE_CREATOR VARCHAR The name of the process that creates the table, such as
Designer.

Notes
The TABLE_SCHEMA and TABLE_NAME columns are case sensitive when executing queries
that use the equality (=) predicate. Use the ILIKE predicate instead:

-419-
SQL Reference Manual

SELECT table_schema, table_name FROM v_catalog.tables WHERE table_schema ILIKE


'schema1';

Example
The following command returns information on all tables in the Vmart schema:
SELECT table_schema, table_name, owner_name, is_system_table
FROM TABLES;
table_schema | table_name | owner_name | is_system_table
--------------+-----------------------+------------+-----------------
public | customer_dimension | release | f
public | product_dimension | release | f
public | promotion_dimension | release | f
public | date_dimension | release | f
public | vendor_dimension | release | f
public | employee_dimension | release | f
public | shipping_dimension | release | f
public | warehouse_dimension | release | f
public | inventory_fact | release | f
store | store_dimension | release | f
store | store_sales_fact | release | f
store | store_orders_fact | release | f
online_sales | online_page_dimension | release | f
online_sales | call_center_dimension | release | f
online_sales | online_sales_fact | release | f
(15 rows)

TYPES
Provides information about supported data types.

Column Name Data Type Description


TYPE_ID INTEGER The ID assigned to a specific data type.
TYPE_NAME VARCHAR The data type name.

Example
SELECT * FROM types;
type_id | type_name
---------+-------------
5 | Boolean
6 | Integer
7 | Float
8 | Char
9 | Varchar
10 | Date
11 | Time
12 | Timestamp
13 | TimestampTz

-420-
SQL System Tables (Monitoring APIs)

14 | Interval
15 | TimeTz
16 | Numeric
17 | Varbinary
117 | Binary
(14 rows)

USERS
Provides information about all users in the database.

Column Name Data Type Description


USER_ID INTEGER The user ID assigned by Vertica.
USER_NAME VARCHAR The name of the user.
IS_SUPER_USER VARCHAR Indicates whether the current user is superuser, where t is true
and f is false.

Example
SELECT * FROM users;
user_id | user_name | is_super_user
-------------------+-----------+---------------
45035996273704962 | dbadmin | t
45035996273767658 | ajax | f
(2 rows)

VIEW_COLUMNS
Provides view attribute information.

Column Name Data Type Description


TABLE_ID The unique table OID assigned by the Vertica catalog of the
view
TABLE_SCHEMA VARCHAR The name of the schema
TABLE_NAME VARCHAR The name of the table
COLUMN_NAME VARCHAR The name of the column being returned
DATA_TYPE VARCHAR The data type of the column being returned
DATA_TYPE_DESCRIPTION VARCHAR A description of the data type; for example, NUMERIC(10,2)
DATA_TYPE_ID INTEGER The data type unique ID assigned by the Vertica catalog
DATA_TYPE_LENGTH INTEGER The maximum allowable length for the data type
CHARACTER_MAXIMUM_LENG INTEGER The maximum allowable length for the column, valid for
TH character types
NUMERIC_PRECISION INTEGER The number of significant decimal digits

-421-
SQL Reference Manual

NUMERIC_SCALE INTEGER The number of fractional digits


DATETIME_PRECISION INTEGER The number of fractional digits retained in the seconds field
INTERVAL_PRECISION INTEGER The number of fractional digits retained in the seconds field
ORDINAL_POSITION VARCHAR The position of the column

Example
SELECT * FROM view_columns;
The following is a small portion of the entire result set for illustration purposes. NULL fields
indicate that those columns were not defined.
-[ RECORD 5 ]------------+-------------------------
table_id | 45035996273833232
table_schema | public
table_name | myview
column_name | category_description
data_type | Char
data_type_description | Char(32)
data_type_id | 8
data_type_length | 32
character_maximum_length | 32
numeric_precision |
numeric_scale |
datetime_precision |
interval_precision |
ordinal_position | 5

VIEWS
Provides information about all views within the system. See Using Views for more information.

Column Name Data Type Description


TABLE_SCHEMA_ID INTEGER The schema ID assigned by Vertica.
TABLE_SCHEMA VARCHA The name of the schema that contains the view.
R
TABLE_ID INTEGER The table ID assigned by Vertica.
TABLE_NAME VARCHA The table name.
R
OWNER_ID INTEGER The owner ID assigned by Vertica.
OWNER_NAME VARCHA The name of the view owner.
R
VIEW_DEFINITION VARCHA The query used to define the view.
R
IS_SYSTEM_VIEW VARCHA Indicates whether the table is a system view,
R where t is true and f is false.

-422-
SQL System Tables (Monitoring APIs)

SYSTEM_VIEW_CREATOR VARCHA The user name who created the view.


R

Example
Call the VIEWS table:
SELECT * FROM VIEWS;
-[ RECORD 1 ]-------+--------------------------------------------------------------
table_schema_id | 45035996273704963
table_schema | public
table_id | 45035996273823130
table_name | temp
owner_id | 45035996273704961
owner_name | release
view_definition | SELECT to_date('F'::character varying, 'dd mm yyyy'::character
varying) AS to_date FROM public.customer_dimension
is_system_view | f
system_view_creator |

SYSTEM_TABLES
Displays a list of all system table names.

Column Name Data Type Description


TABLE_NAME VARCHAR The name of the table
TABLE_DESCRIPTION VARCHAR A description of the system table's purpose.

Example
Call all the system tables:
SELECT * FROM system_tables;
table_schema | table_name | table_description
--------------+---------------------------+-------------------------------------------------------
----------------------
v_catalog | columns | Table column information
v_catalog | foreign_keys | Foreign key information
v_catalog | grants | Grant information
v_catalog | primary_keys | Primary key information
v_catalog | projections | Projection information
v_catalog | system_tables | Displays a list of all system tables except internal ones
v_catalog | table_constraints | Constraint information
v_catalog | tables | Table information
v_catalog | types | Information about supported data types
v_catalog | users | User information
v_catalog | view_columns | View attribute information
v_catalog | views | View information
v_monitor | active_events | Displays all of the active events in the cluster
v_monitor | column_storage | Information on amount of disk storage used by each
column/projection/node
v_monitor | current_session | Information on current Session
v_monitor | disk_resource_rejections | Disk resource rejection summarizations
v_monitor | disk_storage | Disk usage information
v_monitor | event_configurations | Current event configuration
v_monitor | execution_engine_profiles | Per EE operator profiling information
v_monitor | host_resources | Per host profiling information
v_monitor | load_streams | Load metrics for each load stream on each node
v_monitor | local_nodes | Local node information

-423-
SQL Reference Manual

v_monitor | locks | Lock grants and requests for all nodes


v_monitor | node_resources | Per node profiling information
v_monitor | partitions | Partition metadata
v_monitor | projection_refreshes | Refresh information on each projection
v_monitor | projection_storage | Storage information on each projection
v_monitor | query_metrics | Summarized query information
v_monitor | query_profiles | Query profiling
v_monitor | resource_rejections | Resource rejection summarizations
v_monitor | resource_usage | Resource usage information
v_monitor | session_profiles | Per session profiling information
v_monitor | sessions | Information on each session
v_monitor | storage_containers | Information on each storage container
v_monitor | system | System level information
v_monitor | tuple_mover_operations | Information about (automatic) Tuple Mover
v_monitor | wos_container_storage | Storage information on WOS allocator
(38 rows)

V_MONITOR Schema
The system tables in this section reside in the v_monitor schema.

ACTIVE_EVENTS
Displays all active events in the cluster. See Monitoring Events.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHA The Linux system time of query execution in a
R format that can be used as a Date/Time
Expression (page 76).
NODE_NAME VARCHA The name of the node that is reporting the
R requested information.
EVENT_CODE INTEGER A numeric ID that indicates the type of event. See
Event Types for a list of event type codes.
EVENT_ID INTEGER A unique numeric ID that identifies the specific
event.
EVENT_SEVERITY VARCHA The severity of the event from highest to lowest.
R These events are based on standard syslog severity
types.
0—Emergency
1—Alert
2—Critical
3—Error
4—Warning
5—Notice
6—Informational
7—Debug
EVENT_POSTED_TIMESTAMP VARCHA The year, month, day, and time the event was
R reported. The time is posted in military time.
EVENT_EXPIRATION VARCHA The year, month, day, and time the event expire.

-424-
SQL System Tables (Monitoring APIs)

R The time is posted in military time. If the cause of


the event is still active, the event is posted again.
EVENT_CODE_DESCRIPTION VARCHA A brief description of the event and details pertinent
R to the specific situation.
EVENT_PROBLEM_DESCRIPT VARCHA A generic description of the event.
ION R
REPORTING_NODE VARCHA The name of the node within the cluster that
R reported the event.
EVENT_SENT_TO_CHANNELS VARCHA The event logging mechanisms that are configured
R for Vertica. These can include vertica.log,
(configured by default) syslog, and SNMP.
EVENT_POSTED_COUNT INTEGER Tracks the number of times an event occurs. Rather
than posting the same event multiple times, Vertica
posts the event once and then counts the number of
additional instances in which the event occurs.

Example
Call the ACTIVE_EVENTS table:
SELECT * FROM active_events;
-[ RECORD 1 ]-------------+-----------------------------------------
current_timestamp | 2009-08-11 14:38:18.083285
node_name | site01
event_code | 6
event_id | 6
event_severity | Informational
is_event_posted | 2009-08-11 09:38:39.008458
event_expiration | 2077-08-29 11:52:46.008458
event_code_description | Node State Change
event_problem_description | Changing node site01 startup state to UP
reporting_node | site01
event_sent_to_channels | Vertica Log
event_posted_count | 1
-[ RECORD 2 ]-------------+-----------------------------------------
current_timestamp | 2009-08-11 14:38:34.226377
node_name | site02
event_code | 6
event_id | 6
event_severity | Informational
is_event_posted | 2009-08-11 09:38:39.018172
event_expiration | 2077-08-29 11:52:46.018172
event_code_description | Node State Change
event_problem_description | Changing node site02 startup state to UP
reporting_node | site02
event_sent_to_channels | Vertica Log
event_posted_count | 1
-[ RECORD 3 ]-------------+-----------------------------------------
current_timestamp | 2009-08-11 14:38:48.859987
node_name | site03
event_code | 6
event_id | 6

-425-
SQL Reference Manual

event_severity | Informational
is_event_posted | 2009-08-11 09:38:39.027258
event_expiration | 2077-08-29 11:52:46.027258
event_code_description | Node State Change
event_problem_description | Changing node site03 startup state to UP
reporting_node | site03
event_sent_to_channels | Vertica Log
event_posted_count | 1
-[ RECORD 4 ]-------------+-----------------------------------------
current_timestamp | 2009-08-11 14:39:04.226379
node_name | site04
event_code | 6
event_id | 6
event_severity | Informational
is_event_posted | 2009-08-11 09:38:39.008288
event_expiration | 2077-08-29 11:52:46.008288
event_code_description | Node State Change
event_problem_description | Changing node site04 startup state to UP
reporting_node | site04
event_sent_to_channels | Vertica Log
event_posted_count | 1

COLUMN_STORAGE
Returns the amount of disk storage used by each column of each projection on each node.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a
format that can be used as a Date/Time
Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the
requested information.
COLUMN_NAME VARCHAR A projection column name.
ROW_COUNT INTEGER The number of rows in the column (cardinality).
USED_BYTES INTEGER The disk storage allocation of the column in
bytes.
WOS_ROW_COUNT INTEGER The number of WOS rows in the column.
ROS_ROW_COUNT INTEGER The number of ROS rows in the column.
ROS_USED_BYTES INTEGER The number of ROS bytes in the column.
ROS_COUNT INTEGER The number of ROS containers.
PROJECTION_NAME VARCHAR The associated projection name.
PROJECTION_SCHEMA VARCHAR The name of the schema associated with the
projection.

-426-
SQL System Tables (Monitoring APIs)

ANCHOR_TABLE_NAME VARCHAR The associated table name.


ANCHOR_TABLE_SCHEMA VARCHAR The associated table's schema name.

Notes
WOS data is stored by row, so per-column byte counts are not available.

Example
Call the COLUMN_STORAGE table:
SELECT * FROM COLUMN_STORAGE;
-[ RECORD 1 ]-------+---------------------------------------
current_timestamp | 2009-08-11 14:40:30.549209
node_name | site01
column_name | call_center_key
row_count | 200
used_bytes | 277
wos_row_count | 0
ros_row_count | 200
ros_used_bytes | 277
ros_count | 1
projection_name | call_center_dimension_site01
projection_schema | online_sales
anchor_table_name | call_center_dimension
anchor_table_schema | online_sales
-[ RECORD 2 ]-------+---------------------------------------
current_timestamp | 2009-08-11 14:40:58.12043
node_name | site01
column_name | cc_address
row_count | 200
used_bytes | 1402
wos_row_count | 0
ros_row_count | 200
ros_used_bytes | 1402
ros_count | 1
projection_name | call_center_dimension_site01
projection_schema | online_sales
anchor_table_name | call_center_dimension
anchor_table_schema | online_sales
-[ RECORD 3 ]-------+---------------------------------------
current_timestamp | 2009-08-11 14:41:11.298898
node_name | site01
column_name | cc_city
row_count | 200
used_bytes | 1196
wos_row_count | 0
ros_row_count | 200
ros_used_bytes | 1196
ros_count | 1
projection_name | call_center_dimension_site01
projection_schema | online_sales
anchor_table_name | call_center_dimension
anchor_table_schema | online_sales
-[ RECORD 4 ]-------+---------------------------------------
...

Call specific columns from the COLUMN_STORAGE table:


SELECT column_name, row_count, projection_name, anchor_table_name
FROM COLUMN_STORAGE
WHERE node_name = 'site02' AND row_count = 1000;

column_name | row_count | projection_name | anchor_table_name

-427-
SQL Reference Manual

----------------------+-----------+------------------------------+-----------------------
end_date | 1000 | online_page_dimension_site02 | online_page_dimension
epoch | 1000 | online_page_dimension_site02 | online_page_dimension
online_page_key | 1000 | online_page_dimension_site02 | online_page_dimension
page_description | 1000 | online_page_dimension_site02 | online_page_dimension
page_number | 1000 | online_page_dimension_site02 | online_page_dimension
page_type | 1000 | online_page_dimension_site02 | online_page_dimension
start_date | 1000 | online_page_dimension_site02 | online_page_dimension
ad_media_name | 1000 | promotion_dimension_site02 | promotion_dimension
ad_type | 1000 | promotion_dimension_site02 | promotion_dimension
coupon_type | 1000 | promotion_dimension_site02 | promotion_dimension
display_provider | 1000 | promotion_dimension_site02 | promotion_dimension
display_type | 1000 | promotion_dimension_site02 | promotion_dimension
epoch | 1000 | promotion_dimension_site02 | promotion_dimension
price_reduction_type | 1000 | promotion_dimension_site02 | promotion_dimension
promotion_begin_date | 1000 | promotion_dimension_site02 | promotion_dimension
promotion_cost | 1000 | promotion_dimension_site02 | promotion_dimension
promotion_end_date | 1000 | promotion_dimension_site02 | promotion_dimension
promotion_key | 1000 | promotion_dimension_site02 | promotion_dimension
promotion_media_type | 1000 | promotion_dimension_site02 | promotion_dimension
promotion_name | 1000 | promotion_dimension_site02 | promotion_dimension
20 rows)

CURRENT_SESSION
Returns information about the current active session. You can use this table to find out the
current session's sessionID and get the duration of the previously-run query.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a format
that can be used as a Date/Time Expression (page
76).
NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
USER_NAME VARCHAR The name used to log into the database or NULL if the
session is internal
CLIENT_HOSTNAME VARCHAR The host name and port of the TCP socket from which
the client connection was made; NULL if the session is
internal
LOGIN_TIMESTAMP TIMESTAMP The date and time the user logged into the database or
when the internal session was created. This column
can be useful for identifying sessions that have been
left open and could be idle.
SESSION_ID VARCHAR The identifier required to close or interrupt a session.
This identifier is unique within the cluster at any point in
time but can be reused when the session closes.
TRANSACTION_START TIMESTAMP The date/time the current transaction started or NULL if
no transaction is running.
TRANSACTION_ID VARCHAR A string containing the hexadecimal representation of
the transaction ID, if any; otherwise NULL.
TRANSACTION_DESCRIPTI VARCHAR A description of the current transaction.

-428-
SQL System Tables (Monitoring APIs)

ON
STATEMENT_START TIMESTAMP The date/time the current statement started execution,
or NULL if no statement is running.
STATEMENT_ID VARCHAR An ID for the currently executing statement. NULL
indicates that no statement is currently being
processed.
LAST_STATEMENT_DURATI INTEGER The duration of the last completed statement in
ON_US microseconds.
CURRENT_STATEMENT VARCHAR The currently executing statement, if any. NULL
otherwise
LAST_STATEMENT VARCHAR NULL if the user has just logged in; otherwise the
currently running statement or the most recently
completed statement.
EXECUTION_ENGINE_PROF VARCHAR Returns a value that indicates whether profiling is
ILING_CONFIGURATION turned on. Results are:
Empty when no profiling
'Local' when profiling on for this session
'Global' when on by default for all sessions
'Local, Global' when on by default for all sessions and
on for current session
QUERY_PROFILING_CONFI VARCHAR Returns a value that indicates whether profiling is
GURATION turned on; empty when no profiling, 'Local' when on for
this session, 'Global' when on by default for sessions
and 'Local, Global' when on by default for all sessions
and on for current session.
SESSION_PROFILING_CON VARCHAR Returns a value that indicates whether profiling is
FIGURATION turned on; empty when no profiling, 'Local' when on for
this session, 'Global' when on by default for sessions
and 'Local, Global' when on by default for all sessions
and on for current session.

Notes
• The default for profiling is ON ('1') for all sessions. Each session can turn profiling ON or
OFF.
• Profiling parameters (such as GlobalEEProfiling in the examples below) are set in the Vertica
configuration file (vertica.conf). To turn profiling off, set the parameter to '0'. To turn profiling
on, set the parameter to '1'.

Examples
Call the CURRENT_SESSION table:
-[ RECORD 1 ]----------------------------+---------------------------------------------
current_timestamp | 2009-08-11 14:44:25.719724
node_name | site01
user_name | release
client_hostname | 127.0.0.1:36674
login_timestamp | 2009-08-11 13:37:59.486908
session_id | fc10-1-16482:0x7586

-429-
SQL Reference Manual

transaction_start | 2009-08-11 14:23:15.014816


transaction_id | 0xa000000001a440
transaction_description | user release (select * from node_resources;)
statement_start | 2009-08-11 14:44:25.709802
statement_id | 42949673011
last_statement_duration_sec | 43865
current_statement | select * from current_session;
last_statement | select * from column_storage;
execution_engine_profiling_configuration | Global
query_profiling_configuration |
session_profiling_configuration |

Request specific columns from the table:


SELECT node_name, session_id, execution_engine_profiling_configuration
FROM CURRENT_SESSION;
node_name | session_id | execution_engine_profiling_configuration
-----------+---------------------+------------------------------------------
site01 | fc10-1-16482:0x2523 | Global
(1 row)
The sequence of commands in this example shows the use of disabling and enabling profiling for
local and global sessions.
This command disables EE profiling for query execution runs:
SELECT disable_profiling('EE');
disable_profiling
-----------------------
EE Profiling Disabled
(1 row)
The following command sets the GlobalEEProfiling configuration parameter to 0, which turns off
profiling:
SELECT set_config_parameter('GlobalEEProfiling', '0');
set_config_parameter
----------------------------
Parameter set successfully
(1 row)
The following command tells you whether profiling is set to 'Local' or 'Global' or none:
SELECT execution_engine_profiling_configuration
FROM CURRENT_SESSION;
ee_profiling_config
---------------------

(1 row)
Note: The result set is empty because profiling was turned off in the preceding example.
This command now enables EE profiling for query execution runs:
SELECT enable_profiling('EE');
enable_profiling
----------------------
EE Profiling Enabled
(1 row)

-430-
SQL System Tables (Monitoring APIs)

Now when you run a select on the CURRENT_SESSION table, you can see profiling is ON for
the local session:
SELECT execution_engine_profiling_configuration
FROM CURRENT_SESSION;
ee_profiling_config
---------------------
Local
(1 row)
Now turn profiling on for all sessions by setting the GlobalEEProfiling configuration parameter to
1:
SELECT set_config_parameter('GlobalEEProfiling', '1');
set_config_parameter
----------------------------
Parameter set successfully
(1 row)
Now when you run a select on the CURRENT_SESSION table, you can see profiling is ON for
the local sessions, as well as for all sessions:
SELECT execution_engine_profiling_configuration
FROM CURRENT_SESSION;
ee_profiling_config
---------------------
Local, Global
(1 row)

See Also
EXECUTION_ENGINE_PROFILES (page 436), QUERY_PROFILES (page 450), and
SESSION_PROFILES (page 457)

DISK_RESOURCE_REJECTIONS
Returns requests for resources that are rejected due to disk space shortages.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution
in a format that can be used as a
Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting
the requested information.
ACCUMULATION_START VARCHAR The time of first request rejection for this
requester.
REQUEST_TYPE VARCHAR The resource request requester
(example: plan type).
DISK_SPACE_REJECTED_COUNT INTEGER The total number of disk space resource
requests rejected.

-431-
SQL Reference Manual

FAILED_VOLUME_REJECTED_CO INTEGER The total number of disk space resource


UNT requests on a failed volume.

Example
The result of no rows in the following example means that there were no disk rejections:
SELECT node_name, request_type, disk_space_rejected_count, failed_volume_rejected_count
FROM DISK_RESOURCE_REJECTIONS;
node_name | request_type | disk_space_rejected_count | failed_volume_rejected_count
-----------+--------------+---------------------------+------------------------------

(0 rows)

DISK_STORAGE
Returns the amount of disk storage used by the database on each node.

Column Name Date Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a
format that can be used as a Date/Time
Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the
requested information.
STORAGE_PATH VARCHAR The path where the storage location is mounted.

STORAGE_USAGE VARCHAR The type of information stored in the location:


DATA: Only data is stored in the location.
TEMP: Only temporary files that are created
during loads or queries are stored in the
location.
DATA,TEMP: Both types of files are stored in
the location.
RANK INTEGER The rank assigned to the storage location based
on its performance. Ranks are used to create a
tiered disk architecture in which projections,
columns, and partitions are stored on different
disks based on predicted or measured access
patterns. See Creating and Configuring Storage
Locations in the Administrator's Guide.
THROUGHPUT INTEGER The measure of a storage location's
performance in MB/sec. 1/throughput is the time
taken to read 1MB of data.
LATENCY INTEGER The measure of a storage location's
performance in seeks/sec. 1/latency is the time
taken to seek to the data.
STORAGE_STATUS VARCHAR The status of the storage location: active or
retired.

-432-
SQL System Tables (Monitoring APIs)

DISK_BLOCK_SIZE_BYTES INTEGER The block size of the disk in bytes.

DISK_SPACE_USED_BLOCKS INTEGER The number of disk blocks in use.

DISK_SPACE_USED_MB INTEGER The number of megabytes of disk storage in


use.
DISK_SPACE_FREE_BLOCKS INTEGER The number of free disk blocks available.

DISK_SPACE_FREE_MB INTEGER The number of megabytes of free storage


available.
DISK_SPACE_FREE_PERCEN INTEGER The percentage of free disk space remaining.
T

Notes
• A storage location's performance is measured in throughput in MB/sec and latency in
seeks/sec. These two values are converted to single number(Speed) with the following
formula:
• ReadTime (time to read 1MB) = 1/throughput + 1 / latency
§ 1/throughput is the time taken to read 1MB of data
§ 1/latency is the time taken to seek to the data.
§ ReadTime is the time taken to read 1MB of data.
• A disk is faster than another disk if its ReadTime is less.
• There can be multiple storage locations per node, and these locations can be on different
disks with different free/used space, block size, etc. This information is useful in letting you
know where the data files reside.

Example
Call the DISK_STORAGE table:
SELECT * FROM DISK_STORAGE;
-[ RECORD 1 ]-----------+---------------------------------------------
current_timestamp | 2009-08-11 14:48:35.932541
node_name | site01
storage_path | /mydb/site01_catalog/Catalog
storage_usage | CATALOG
rank | 0
throughput | 0
latency | 0
storage_status | Active
disk_block_size_bytes | 4096
disk_space_used_blocks | 34708721
disk_space_used_mb | 135581
disk_space_free_blocks | 178816678
disk_space_free_mb | 698502
disk_space_free_percent | 83%
-[ RECORD 2 ]-----------+---------------------------------------------
current_timestamp | 2009-08-11 14:48:53.884255
node_name | site01
storage_path | /mydb/site01_data
storage_usage | DATA,TEMP
rank | 0
throughput | 0
latency | 0
storage_status | Active

-433-
SQL Reference Manual

disk_block_size_bytes | 4096
disk_space_used_blocks | 34708721
disk_space_used_mb | 135581
disk_space_free_blocks | 178816678
disk_space_free_mb | 698502
disk_space_free_percent | 83%
-[ RECORD 3 ]-----------+---------------------------------------------
current_timestamp | 2009-08-11 14:49:08.299012
node_name | site02
storage_path | /mydb/site02_catalog/Catalog
storage_usage | CATALOG
rank | 0
throughput | 0
latency | 0
storage_status | Active
disk_block_size_bytes | 4096
disk_space_used_blocks | 19968349
disk_space_used_mb | 78001
disk_space_free_blocks | 193557050
disk_space_free_mb | 756082
disk_space_free_percent | 90%
-[ RECORD 4 ]-----------+---------------------------------------------
current_timestamp | 2009-08-11 14:49:22.696772
node_name | site02
storage_path | /mydb/site02_data
storage_usage | DATA,TEMP
rank | 0
throughput | 0
latency | 0
storage_status | Active
disk_block_size_bytes | 4096
disk_space_used_blocks | 19968349
disk_space_used_mb | 78001
disk_space_free_blocks | 193557050
disk_space_free_mb | 756082
disk_space_free_percent | 90%
-[ RECORD 5 ]-----------+---------------------------------------------
current_timestamp | 2009-08-11 14:50:03.960157
node_name | site03
storage_path | /mydb/site03_catalog/Catalog
storage_usage | CATALOG
rank | 0
throughput | 0
latency | 0
storage_status | Active
disk_block_size_bytes | 4096
disk_space_used_blocks | 19902595
disk_space_used_mb | 77744
disk_space_free_blocks | 193622804
disk_space_free_mb | 756339
disk_space_free_percent | 90%
-[ RECORD 6 ]-----------+---------------------------------------------
current_timestamp | 2009-08-11 14:50:27.415735
node_name | site03
storage_path | /mydb/site03_data
storage_usage | DATA,TEMP
rank | 0
throughput | 0
latency | 0
storage_status | Active
disk_block_size_bytes | 4096
disk_space_used_blocks | 19902595
disk_space_used_mb | 77744
disk_space_free_blocks | 193622804
disk_space_free_mb | 756339
disk_space_free_percent | 90%
-[ RECORD 7 ]-----------+---------------------------------------------
current_timestamp | 2009-08-11 14:50:39.398879

-434-
SQL System Tables (Monitoring APIs)

node_name | site04
storage_path | /mydb/site04_catalog/Catalog
storage_usage | CATALOG
rank | 0
throughput | 0
latency | 0
storage_status | Active
disk_block_size_bytes | 4096
disk_space_used_blocks | 19972309
disk_space_used_mb | 78017
disk_space_free_blocks | 193553090
disk_space_free_mb | 756066
disk_space_free_percent | 90%
-[ RECORD 8 ]-----------+---------------------------------------------
current_timestamp | 2009-08-11 14:50:57.879302
node_name | site04
storage_path | /mydb/site04_data
storage_usage | DATA,TEMP
rank | 0
throughput | 0
latency | 0
storage_status | Active
disk_block_size_bytes | 4096
disk_space_used_blocks | 19972309
disk_space_used_mb | 78017
disk_space_free_blocks | 193553090
disk_space_free_mb | 756066
disk_space_free_percent | 90%

Request only specific columns from the table:


SELECT node_name, storage_path, storage_status, disk_space_free_percent
FROM disk_storage;

node_name | storage_path | storage_status | disk_space_free_percent


-----------+------------------------------+----------------+-------------------------
site01 | /mydb/site01_catalog/Catalog | Active | 83%
site01 | /mydb/site01_data | Active | 83%
site02 | /mydb/site02_catalog/Catalog | Active | 90%
site02 | /mydb/site02_data | Active | 90%
site03 | /mydb/site03_catalog/Catalog | Active | 90%
site03 | /mydb/site03_data | Active | 90%
site04 | /mydb/site04_catalog/Catalog | Active | 90%
site04 | /mydb/site04_data | Active | 90%
(8 rows)

EVENT_CONFIGURATIONS
Monitors the configuration of events.

Column Name Date Type Description


EVENT_ID VARCHAR The name of the event.

EVENT_DELIVERY_CHANNELS VARCHAR The delivery channel on which the event


occurred.

Example
SELECT * FROM event_configurations;
event_id | event_delivery_channels
-------------------------------------------+-------------------------

-435-
SQL Reference Manual

Low Disk Space | Vertica Log, SNMP Trap


Read Only File System | Vertica Log, SNMP Trap
Loss Of K Safety | Vertica Log, SNMP Trap
Current Fault Tolerance at Critical Level | Vertica Log, SNMP Trap
Too Many ROS Containers | Vertica Log, SNMP Trap
WOS Over Flow | Vertica Log, SNMP Trap
Node State Change | Vertica Log, SNMP Trap
Recovery Failure | Vertica Log, SNMP Trap
Recovery Error | Vertica Log
Recovery Lock Error | Vertica Log
Recovery Projection Retrieval Error | Vertica Log
Refresh Error | Vertica Log
Refresh Lock Error | Vertica Log
Tuple Mover Error | Vertica Log
Timer Service Task Error | Vertica Log
(15 rows)

EXECUTION_ENGINE_PROFILES
Provides information regarding query execution runs. To obtain information about query
execution runs for your database, see Profiling Database Performance.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHA The Linux system time of query execution in a
R format that can be used as a Date/Time
Expression (page 76).
NODE_NAME VARCHA The name of the node that is reporting the
R requested information.
SESSION_ID VARCHA The identification of the session for which profiling
R information is captured. This identifier is unique
within the cluster at any point in time but can be
reused when the session closes.
TRANSACTION_ID INTEGER An identifier for the transaction within the session if
any; otherwise NULL.
STATEMENT_ID INTEGER An ID for the currently executing statement. NULL
indicates that
no statement is currently being processed.
OPERATOR_NAME VARCHA The name of the user who started the session.
R
OPERATOR_ID INTEGER The ID of the user who started the session.
COUNTER_NAME VARCHA The name of the counter. See COUNTER_NAME
R Values below.
COUNTER_VALUE INTEGER The value of the counter.

COUNTER_NAME Values
The value of COUNTER_NAME can be any of the following:

-436-
SQL System Tables (Monitoring APIs)

COUNTER_NAME Description
bytes sent The number of bytes sent over the network for the query
execution.
bytes received The number of bytes received over the network for the query
execution.
rows produced The number of rows produced by the EE operator.
executable time (ms) The time required to execute the query (in milliseconds).

Example
SELECT operator_name, operator_id, counter_name, counter_value
FROM EXECUTION_ENGINE_PROFILES
WHERE operator_name = 'CopyNode'
ORDER BY counter_value DESC;

operator_name | operator_id | counter_name | counter_value


---------------+-------------+------------------------+---------------
CopyNode | 3 | rows produced | 219
CopyNode | 1 | rows produced | 95
CopyNode | 1 | rows produced | 53
CopyNode | 1 | rows produced | 34
CopyNode | 1 | rows produced | 20
CopyNode | 2 | rows produced | 15
CopyNode | 1 | execution time (us) | 1
CopyNode | 1 | execution time (us) | 1
CopyNode | 3 | execution time (us) | 1
CopyNode | 1 | execution time (us) | 0
CopyNode | 1 | output queue wait (us) | 0
CopyNode | 3 | output queue wait (us) | 0
CopyNode | 1 | output queue wait (us) | 0
CopyNode | 1 | execution time (us) | 0
CopyNode | 1 | output queue wait (us) | 0
CopyNode | 1 | output queue wait (us) | 0
CopyNode | 2 | execution time (us) | 0
CopyNode | 2 | output queue wait (us) | 0
(18 rows)

HOST_RESOURCES
Provides a snapshot of the node. This is useful for regularly polling the node with automated
tools or scripts.

-437-
SQL Reference Manual

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a format
that can be used as a Date/Time Expression (page
76).
HOST_NAME VARCHAR The name of the node that is reporting the requested
information.
OPEN_FILES_LIMIT INTEGER The maximum number of files that can be open at
one time on the node.
THREADS_LIMIT INTEGER The maximum number of threads that can coexist on
the node.
CORE_FILE_LIMIT_MAX_SIZE INTEGER The maximum core file size allowed on the node.
_BYTES
PROCESSOR_COUNT INTEGER The number of system processors.
PROCESSOR_CORE_COUNT INTEGER The number of processor cores in the system.
PROCESSOR_DESCRIPTION VARCHAR A description of the processor. For example: Inter(R)
Core(TM)2 Duo CPU T8100 @2.10GHz (1 row)
OPENED_FILE_COUNT INTEGER The total number of open files on the node.
OPENED_SOCKET_COUNT INTEGER The total number of open sockets on the node.
OPENED_NONFILE_NONSOCKET INTEGER The total number of other file descriptions open in
_COUNT which other could be a directory or FIFO. It is not an
open file or socket.
TOTAL_MEMORY_BYTES INTEGER The total amount of physical RAM, in bytes, available
on the system.
TOTAL_MEMORY_FREE_BYTES INTEGER The amount of physical RAM, in bytes, left unused by
the system.
TOTAL_BUFFER_MEMORY_BYTE INTEGER The amount of physical RAM, in bytes, used for file
S buffers on the system
TOTAL_MEMORY_CACHE_BYTES INTEGER The amount of physical RAM, in bytes, used as
cache memory on the system.
TOTAL_SWAP_MEMORY_BYTES INTEGER The total amount of swap memory available, in bytes,
on the system.
TOTAL_SWAP_MEMORY_FREE_B INTEGER The total amount of swap memory free, in bytes, on
YTES the system.
DISK_SPACE_FREE_MB INTEGER The free disk space available, in megabytes, for all
storage location file systems (data directories).
DISK_SPACE_USED_MB INTEGER The disk space used, in megabytes, for all storage
location file systems.
DISK_SPACE_TOTAL_MB INTEGER The total free disk space available, in megabytes, for
all storage location file systems.

-438-
SQL System Tables (Monitoring APIs)

Examples
Call the HOST_RESOURCES table:
SELECT * FROM HOST_RESOURCES;
-[ RECORD 1 ]------------------+-----------------------------
current_timestamp | 2009-09-01 18:20:06.952211
host_name | fc10-1.vertica.com
open_files_limit | 65536
threads_limit | 139264
core_file_limit_max_size_bytes | 4096872448
processor_count | 1
processor_core_count | 4
processor_description | Intel(R) Core(TM)2 Quad CPU
Q6600 @ 2.40GHz
opened_file_count | 5
opened_socket_count | 5
opened_nonfile_nonsocket_count | 3
total_memory_bytes | 8393220096
total_memory_free_bytes | 4652384256
total_buffer_memory_bytes | 664379392
total_memory_cache_bytes | 1128968192
total_swap_memory_bytes | 4293586944
total_swap_memory_free_bytes | 4280012800
disk_space_free_mb | 664355
disk_space_used_mb | 169728
disk_space_total_mb | 834083
-[ RECORD 2 ]------------------+-----------------------------
current_timestamp | 2009-09-01 18:20:06.951431
host_name | fc10-2.vertica.com
open_files_limit | 65536
threads_limit | 139264
core_file_limit_max_size_bytes | 4096872448
processor_count | 1
processor_core_count | 4
processor_description | Intel(R) Core(TM)2 Quad CPU
Q6600 @ 2.40GHz
opened_file_count | 5
opened_socket_count | 4
opened_nonfile_nonsocket_count | 3
total_memory_bytes | 8393220096
total_memory_free_bytes | 4610568192
total_buffer_memory_bytes | 2133594112
total_memory_cache_bytes | 870674432
total_swap_memory_bytes | 4293586944
total_swap_memory_free_bytes | 4288507904
disk_space_free_mb | 756833
disk_space_used_mb | 77250
disk_space_total_mb | 834083
-[ RECORD 3 ]------------------+-----------------------------
current_timestamp | 2009-09-01 18:20:06.961542
host_name | fc10-3.vertica.com
open_files_limit | 65536
threads_limit | 139264
core_file_limit_max_size_bytes | 4096872448
processor_count | 1
processor_core_count | 4
processor_description | Intel(R) Core(TM)2 Quad CPU
Q6600 @ 2.40GHz
opened_file_count | 5
opened_socket_count | 4
opened_nonfile_nonsocket_count | 3
total_memory_bytes | 8393220096
total_memory_free_bytes | 4144705536
total_buffer_memory_bytes | 2606940160
total_memory_cache_bytes | 818905088

-439-
SQL Reference Manual

total_swap_memory_bytes | 4293586944
total_swap_memory_free_bytes | 4283015168
disk_space_free_mb | 756955
disk_space_used_mb | 77128
disk_space_total_mb | 834083
-[ RECORD 4 ]------------------+-----------------------------
current_timestamp | 2009-09-01 18:20:06.957279
host_name | fc10-4.vertica.com
open_files_limit | 65536
threads_limit | 139264
core_file_limit_max_size_bytes | 4096872448
processor_count | 1
processor_core_count | 4
processor_description | Intel(R) Core(TM)2 Quad CPU
Q6600 @ 2.40GHz
opened_file_count | 5
opened_socket_count | 4
opened_nonfile_nonsocket_count | 3
total_memory_bytes | 8393220096
total_memory_free_bytes | 4375576576
total_buffer_memory_bytes | 2455642112
total_memory_cache_bytes | 764538880
total_swap_memory_bytes | 4293586944
total_swap_memory_free_bytes | 4292378624
disk_space_free_mb | 756898
disk_space_used_mb | 77185
disk_space_total_mb | 834083

LOAD_STREAMS
Monitors load metrics for each load stream on each node.

Column Name Date Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in
a format that can be used as a Date/Time
Expression (page 76).
STREAM_NAME VARCHAR The optional identifier that names a stream,
if specified.
TABLE_NAME VARCHAR The name of the table being loaded.
LOAD_START VARCHAR The Linux system time when the load
started.
ACCEPTED_ROW_COUNT INTEGER The number of rows loaded.
REJECTED_ROW_COUNT INTEGER The number of rows rejected.
READ_BYTES INTEGER The number of bytes read from the input
file.
INPUT_FILE_SIZE_BYTES INTEGER The size of the input file in bytes.
Note: When using STDIN as input size of
input file size is zero (0).
PARSE_COMPLETE_PERCENT INTEGER The percent of the rows in the input file that
have been loaded. If using STDIN, this
column remains at zero (0) until the COPY
statement is complete.

-440-
SQL System Tables (Monitoring APIs)

UNSORTED_ROW_COUNT INTEGER The number of rows that have not been


sorted.
SORTED_ROW_COUNT INTEGER The number of rows that have been sorted.
SORT_COMPLETE_PERCENT INTEGER The percent of the rows in the input file that
have been sorted.

Notes
If a COPY ... DIRECT is in progress, the ACCEPTED_ROW_COUNT field could increase up to
the maximum number of rows in the input file as the rows are being parsed. However,
PARSE_PERCENT_COMPLETE stays at 0 until the COPY operation has finished sorting,
compressing and writing the data to disk. This can take a significant amount of time and it is
easy to mistake this state as a hang. Check your system CPU and disk accesses to determine if
any activity is in progress before canceling the COPY or reporting a hang.

Example
Call the LOAD_STREAMS table:
SELECT * FROM load_streams

LOCAL_NODES
Monitors the status of local nodes in the cluster.

Column Name Data Type Description


NODE_ID INTEGER The node ID assigned by Vertica.
SHUTDOWN_EPOCH INTEGER The shutdown epoch number.
BACKUP_SHUTDOWN_EPOCH INTEGER The backup-shutdown epoch number.

Example
SELECT * FROM local_nodes;
node_id | shutdown_epoch | backup_shutdown_epoch
-------------------+----------------+-----------------------
45035996273704971 | | 960
(1 row)

LOCKS
Monitors lock grants and requests for all nodes.

-441-
SQL Reference Manual

Column Name Date Type Description


NODE_NAMES VARCHAR The nodes on which lock interaction occurs.
Note on node rollup: If a transaction has the same
lock in the same mode in the same scope on multiple
nodes, it gets one (1) line in the table. NODE_NAMES
are separated by commas
OBJECT_NAME VARCHAR Name of object being locked; can be a TABLE or an
internal structure (projection, global catalog, local
catalog, Tuple Mover, epoch map)
OBJECT_ID INTEGER The unique OID assigned by the Vertica catalog of the
object being locked
TRANSACTION_DESCRIPTI VARCHAR ID of transaction and associated description, typically the
ON query that caused the transaction's creation
LOCK_MODE VARCHAR Lock mode describes the intended operations of the
transaction:
S — Share lock needed for select operations
I — Insert lock needed for insert operations
X — Exclusive lock is always needed for delete
operations. X lock is also the result of lock promotion
(see Table 2)
T — Tuple Mover lock used by the Tuple Mover and also
used for COPY into pre-join projections
LOCK_SCOPE VARCHAR Scope is the expected duration of the lock once it is
granted. Before the lock is granted, the scope is listed as
REQUESTED.
Once a lock has been granted, the following scopes are
possible:
STATEMENT_LOCALPLAN
STATEMENT_COMPILE
STATEMENT_EXECUTE
TRANSACTION_POSTCOMMIT
TRANSACTION
All scopes, other than TRANSACTION, are transient and
are used only as part of normal query processing.
Notes
• Locks acquired on tables that were subsequently dropped by another transaction can result
in the message, Unknown or deleted object, appearing in the output's OBJECT column.
• Running a SELECT … from LOCKS can time out after five minutes. This situation occurs
occurs when the cluster has failed. Run the Diagnostics Utility and contact Technical
Support (on page 33).

-442-
SQL System Tables (Monitoring APIs)

The following two tables are from Transaction Processing: Concepts and Techniques
http://www.amazon.com/gp/product/1558601902/ref=s9sdps_c1_14_at1-rfc_p-frt_p-3237_g1_si1
?pf_rd_m=ATVPDKIKX0DER&pf_rd_s=center-1&pf_rd_r=1QHH6V589JEV0DR3DQ1D&pf_rd_t
=101&pf_rd_p=463383351&pf_rd_i=507846 by Jim Gray (Figure 7.11, p. 408 and Figure 8.6, p.
467).
Table 1: Compatibility matrix for granular locks
This table is for compatibility with other users. The table is symmetric.

Granted Mode

Requested Mode S I X T

S Yes No No Yes
I No Yes No Yes
X No No No No
T Yes Yes No Yes

The following two examples refer to Table 1:


• Example 1: If someone else has an S lock, you cannot get an I lock.
• Example 2: If someone has an I lock, you can get an I lock.

Table 2: Lock conversion matrix


This table is used for upgrading locks you already have. For example, If you have an S lock and
you want an I lock, you request an X lock. If you have an S lock and you want an S lock, no lock
requests is required.

Granted Mode

Requested Mode S I X T

S S X X S
I X I X I
X X X X X
T S I X T

The following table call shows that there are no current locks in use:
SELECT * FROM LOCKS;
node_names | object_name | object_id | transaction_description | lock_mode | lock_scope
------------+-------------+-----------+-------------------------+-----------+------------
(0 rows)

See Also
DUMP_LOCKTABLE (page 270)

-443-
SQL Reference Manual

PROJECTION_REFRESHES (page 446)


SESSION_PROFILES (page 457)

NODE_RESOURCES
Provides a snapshot of the node. This is useful for regularly polling the node with automated
tools or scripts.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a format
that can be used as a Date/Time Expression (page
76).
NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
HOST_NAME VARCHAR The hostname associated with a particular node.
PROCESS_SIZE_BYTES INTEGER The total size of the program.
PROCESS_RESIDENT_SET_S INTEGER The total number of pages that the process has in
IZE_BYTES memory.
PROCESS_SHARED_MEMORY_ INTEGER The amount of shared memory used.
SIZE_BYTES
PROCESS_TEXT_MEMORY_SI INTEGER The total number of text pages that the process has
ZE_BYTES in physical memory. This does not include any
shared libraries.
PROCESS_DATA_MEMORY_SI INTEGER The amount of physical memory, in pages, used for
ZE_BYTES performing processes. This does not include the
executable code.
PROCESS_LIBRARY_MEMORY INTEGER The total number of library pages that the process
_SIZE_BYTES has in physical memory.
PROCESS_DIRTY_MEMORY_S INTEGER The number of pages that have been modified since
IZE_BYTES they were last written to disk.

Example
Call the NODE_RESOURCES table:
SELECT * FROM NODE_RESOURCES;
-[ RECORD 1 ]---------------------+---------------------------
current_timestamp | 2009-09-01 18:17:24.825473
node_name | site01
host_name | fc10-1.vertica.com
process_size_bytes | 2665398272
process_resident_set_size_bytes | 44126208
process_shared_memory_size_bytes | 11071488
process_text_memory_size_bytes | 32907264
process_data_memory_size_bytes | 0
process_library_memory_size_bytes | 2531241984
process_dirty_memory_size_bytes | 0
-[ RECORD 2 ]---------------------+---------------------------

-444-
SQL System Tables (Monitoring APIs)

current_timestamp | 2009-09-01 18:17:24.825051


node_name | site02
host_name | fc10-2.vertica.com
process_size_bytes | 2530394112
process_resident_set_size_bytes | 27426816
process_shared_memory_size_bytes | 8208384
process_text_memory_size_bytes | 32907264
process_data_memory_size_bytes | 0
process_library_memory_size_bytes | 2396237824
process_dirty_memory_size_bytes | 0
-[ RECORD 3 ]---------------------+---------------------------
current_timestamp | 2009-09-01 18:17:24.835606
node_name | site03
host_name | fc10-3.vertica.com
process_size_bytes | 2529394688
process_resident_set_size_bytes | 26603520
process_shared_memory_size_bytes | 8220672
process_text_memory_size_bytes | 32907264
process_data_memory_size_bytes | 0
process_library_memory_size_bytes | 2395238400
process_dirty_memory_size_bytes | 0
-[ RECORD 4 ]---------------------+---------------------------
current_timestamp | 2009-09-01 18:17:24.831238
node_name | site04
host_name | fc10-4.vertica.com
process_size_bytes | 2529611776
process_resident_set_size_bytes | 26202112
process_shared_memory_size_bytes | 8208384
process_text_memory_size_bytes | 32907264
process_data_memory_size_bytes | 0
process_library_memory_size_bytes | 2395455488
process_dirty_memory_size_bytes | 0

PARTITIONS
Displays partition metadata, one row per partition key, per ROS container.

Column Name Data Type Description


PARTITION_KEY VARCHAR The partition value
TABLE_SCHEMA VARCHAR The name of the schema
PROJECTION_NAME VARCHAR The projection name
ROS_ID VARCHAR The object ID that uniquely references the ROS container
ROS_SIZE_BYTES INTEGER The ROS container size in bytes
ROS_ROW_COUNT INTEGER Number of rows in the ROS container
NODE_NAME VARCHAR Node where the ROS container resides

-445-
SQL Reference Manual

Notes
• A many-to-many relationship exists between partitions and ROS containers. PARTITIONS
displays information in a denormalized fashion.
• To find the number of ROS containers having data of a specific partition, you aggregate
PARTITIONS over the partition_key column.
• To find the number of partitions stored in a ROS container, you aggregate PARTITIONS over
the ros_id column.

Example
Projection 'p1' has three ROS containers, RC1, RC2 and RC3, with the values defined in the
following table:
COLUMN NAME RC1 RC2 RC3
----------------+-------------------+-------------------+-----------------
PARTITION_KEY (20,30,40) (20) (30,60)
ROS_ID 45035986273705000 45035986273705001 45035986273705002
SIZE 1000 20000 30000
ROW_ROW_COUNT 100 200 300
NODE_NAME n1 n1 n1
In this example, PARTITIONS has six rows with the following values:
(20, 'p1', 45035986273705000, 10000, 100, 'n1')
(30, 'p1', 45035986273705000, 10000, 100, 'n1')
(40, 'p1', 45035986273705000, 10000, 100, 'n1')
(20, 'p1', 45035986273705001, 20000, 200, 'n1')
(30, 'p1', 45035986273705002, 30000, 300, 'n1')
(60, 'p1', 45035986273705002, 30000, 300, 'n1')

PROJECTION_REFRESHES
Provides information about refresh operations for projections. Information regarding refresh
operations is maintained as follows:
• Information about a successful refresh is maintained until the refresh session is closed.
• Information about an unsuccessful refresh is maintained, whether or not the refresh session
is closed, until the projection is the target of another refresh operation.
• All refresh information for a node is lost when the node is shut down.
• After a refresh completes, the refreshed projections go into a single ROS container. If the
table was created with a PARTITION BY clause, then you should call PARTITION_TABLE()
or PARTITION_PROJECTION() to reorganize the data into multiple ROS containers, since
queries on projections with multiple ROS containers perform better than queries on
projections with a single ROS container.

Column Name Data Type Description


NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
PROJECTION_NAME VARCHAR The name of the projection that is targeted for refresh.

-446-
SQL System Tables (Monitoring APIs)

ANCHOR_TABLE_NAME VARCHAR The name of the projection's anchor table.

REFRESH_STATUS VARCHAR The status of the projection:


Queued — Indicates that a projection is queued for refresh.
Refreshing — Indicates that a refresh for a projection is in
process.
Refreshed — Indicates that a refresh for a projection has
successfully completed.
Failed — Indicates that a refresh for a projection did not
successfully complete.
REFRESH_PHASE VARCHAR Indicates how far the refresh has progressed:
Historical – Indicates that the refresh has reached the first
phase and is refreshing data from historical data. This refresh
phase requires the most amount of time.
Current – Indicates that the refresh has reached the final
phase and is attempting to refresh data from the current
epoch. To complete this phase, refresh must be able to obtain
a lock on the table. If the table is locked by some other
transaction, refresh is put on hold until that transaction
completes.
The LOCKS (page 441) system table is useful for
determining if a refresh has been blocked on a table lock.
To determine if a refresh has been blocked, locate the
term "refresh" in the transaction description. A refresh has
been blocked when the scope for the refresh is
REQUESTED and one or more other transactions have
acquired a lock on the table.
Note: This field is NULL until the projection starts to refresh.
REFRESH_METHOD VARCHAR The method used to refresh the projection:
Buddy – Uses the contents of a buddy to refresh the
projection. This method maintains historical data. This enables
the projection to be used for historical queries.
Scratch – Refreshes the projection without using a buddy.
This method does not generate historical data. This means
that the projection cannot participate in historical queries from
any point before the projection was refreshed.
REFRESH_FAILURE_COUN INTEGER The number of times a refresh failed for the projection.
T FAILURE_COUNT does not indicate whether or not the
projection was eventually refreshed successfully. See
STATUS to determine how the refresh operation is
progressing.
SESSION_ID VARCHAR A unique ID that identifies the refresh session.

REFRESH_START STRING The time the projection refresh started (provided as a time
stamp).
REFRESH_DURATION_SEC INTEGER The length of time that the projection refresh ran in seconds.

-447-
SQL Reference Manual

Example
Call the PROJECTION_REFRESHES table:
SELECT * FROM PROJECTION_REFRESHES;

PROJECTION_STORAGE
Monitors the amount of disk storage used by each projection on each node.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a format that can
be used as a Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
PROJECTION_NAME VARCHAR The name of the projection.
PROJECTION_SCHEMA VARCHAR The name of the schema associated with the projection.
PROJECTION_COLUMN_CO INTEGER The number of columns in the projection.
UNT
ROW_COUNT INTEGER The number of rows in the table's projections, excluding any
rows marked for deletion.
USED_BYTES INTEGER The number of bytes of disk storage used by the projection.
WOS_ROW_COUNT INTEGER The number of WOS rows in the projection.
WOS_USED_BYTES INTEGER The number of WOS bytes in the projection.
ROS_ROW_COUNT INTEGER The number of ROS rows in the projection.
ROS_USED_BYTES INTEGER The number of ROS bytes in the projection.
ROS_COUNT INTEGER The number of ROS containers in the projection.
ANCHOR_TABLE_NAME VARCHAR The associated table name.
ANCHOR_TABLE_SCHEMA VARCHAR The associated table's schema name.

Example
SELECT projection_name, total_row_count, ros_used_bytes, total_used_bytes
FROM PROJECTION_STORAGE
WHERE projection_schema = 'store'
ORDER BY total_used_bytes;

projection_name | total_row_count | ros_used_bytes | total_used_bytes


-------------------------+-----------------+----------------+------------------
store_dimension_site04 | 250 | 10715 | 10715
store_dimension_site02 | 250 | 10715 | 10715
store_dimension_site03 | 250 | 10715 | 10715
store_dimension_site01 | 250 | 10715 | 10715
store_orders_fact_p1_b1 | 24827 | 636533 | 636533
store_orders_fact_p1 | 24827 | 636533 | 636533
store_orders_fact_p1 | 24888 | 637872 | 637872
store_orders_fact_p1_b1 | 24888 | 637872 | 637872
store_orders_fact_p1 | 25010 | 641137 | 641137

-448-
SQL System Tables (Monitoring APIs)

store_orders_fact_p1_b1 | 25010 | 641137 | 641137


store_orders_fact_p1 | 25275 | 647623 | 647623
store_orders_fact_p1_b1 | 25275 | 647623 | 647623
store_sales_fact_p1_b1 | 24827 | 946657 | 946657
store_sales_fact_p1 | 24827 | 946657 | 946657
store_sales_fact_p1_b1 | 24888 | 949436 | 949436
store_sales_fact_p1 | 24888 | 949436 | 949436
store_sales_fact_p1 | 25010 | 952744 | 952744
store_sales_fact_p1_b1 | 25010 | 952744 | 952744
store_sales_fact_p1_b1 | 25275 | 962010 | 962010
store_sales_fact_p1 | 25275 | 962010 | 962010
(20 rows)

QUERY_METRICS
Monitors the sessions and queries executing on each node.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a
format that can be used as a Date/Time
Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the
requested information.
ACTIVE_USER_SESSION_COUNT INTEGER The number of active user sessions
(connections).
ACTIVE_SYSTEM_SESSION_COUN INTEGER The number of active system sessions.
T
TOTAL_USER_SESSION_COUNT INTEGER The total number of user sessions.
TOTAL_SYSTEM_SESSION_COUNT INTEGER The total number of system sessions.
TOTAL_ACTIVE_SESSION_COUNT INTEGER The total number of active user and system
sessions.
TOTAL_SESSION_COUNT INTEGER The total number of user and system sessions.
RUNNING_QUERY_COUNT INTEGER The number of queries currently running.
EXECUTED_QUERY_COUNT INTEGER Containing the total number of queries executed.

Example
=> SELECT * FROM QUERY_METRICS;

-[ RECORD 1 ]---------------+---------------------------
current_timestamp | 2009-08-11 15:40:58.292286
node_name | site01
active_user_session_count | 1
active_system_session_count | 2
total_user_session_count | 50
total_system_session_count | 45490
total_active_session_count | 3

-449-
SQL Reference Manual

total_session_count | 3
running_query_count | 1
executed_query_count | 76
-[ RECORD 2 ]---------------+---------------------------
current_timestamp | 2009-08-11 15:40:58.241425
node_name | site02
active_user_session_count | 1
active_system_session_count | 2
total_user_session_count | 50
total_system_session_count | 45491
total_active_session_count | 3
total_session_count | 3
running_query_count | 0
executed_query_count | 0
-[ RECORD 3 ]---------------+---------------------------
current_timestamp | 2009-08-11 15:40:58.259699
node_name | site03
active_user_session_count | 1
active_system_session_count | 2
total_user_session_count | 50
total_system_session_count | 45491
total_active_session_count | 3
total_session_count | 3
running_query_count | 0
executed_query_count | 0
-[ RECORD 4 ]---------------+---------------------------
current_timestamp | 2009-08-11 15:40:58.292833
node_name | site04
active_user_session_count | 1
active_system_session_count | 2
total_user_session_count | 50
total_system_session_count | 45490
total_active_session_count | 3
total_session_count | 3
running_query_count | 0
executed_query_count | 0

QUERY_PROFILES
Provides information regarding executed queries. To obtain information about executed queries,
see Profiling Database Performance.

Column Name Data Type Description


NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
SESSION_ID VARCHAR The identification of the session for which profiling
information is captured. This identifier is unique within
the cluster at any point in time but can be reused when
the session closes.
TRANSACTION_ID INTEGER An identifier for the transaction within the session if any;
otherwise NULL.

-450-
SQL System Tables (Monitoring APIs)

STATEMENT_ID INTEGER An ID for the currently executing statement. NULL


indicates that
no statement is currently being processed.
QUERY VARCHAR The query string used for the query.
QUERY_SEARCH_PATH VARCHAR A list of schemas in which to look for tables.
PROJECTIONS_USED VARCHAR The projections used in the query.
QUERY_DURATION_US INTEGER The duration of the query in microseconds.
QUERY_START_EPOCH VARCHAR The epoch number at the start of the given query.
QUERY_START VARCHAR The Linux system time of query execution in a format
that can be used as a Date/Time Expression.
QUERY_TYPE VARCHAR Is one of INSERT, SELECT, UPDATE, DELETE, or
UTILITY
ERROR_CODE INTEGER The return error code for the query.
USER_NAME VARCHAR The name of the user who ran the query.
PROCESSED_ROW_COUNT INTEGER The number of rows returned by the query.

Example
Call the QUERY_PROFILES table:
SELECT * FROM QUERY_PROFILES;
-[ RECORD 1 ]-------+-------------------------------------------------------
...
-[ RECORD 22 ]------+-------------------------------------------------------
node_name | site01
session_id | fc10-1-16482:0x505e
transaction_id | 45035996273812216
statement_id | 2
query | SELECT constraint_name, table_name, ordinal_position,
reference_table_name FROM foreign_keys ORDER BY 4;
query_search_path | "$user", public, v_catalog, v_monitor, v_system
projections_used | v_catalog.foreign_keys_p
query_duration | 30836
query_start_epoch | 1416
query_start | 2009-08-11 12:22:16.504098
query_type | 1
error_code | 0
user_name | dbadmin
processed_row_count | 26
-[ RECORD 23 ]------+-------------------------------------------------------
node_name | site01
session_id | fc10-1-16482:0x50f1
transaction_id | 45035996273812219
statement_id | 2
query | SELECT constraint_name, table_name, ordinal_position,
reference_table_name FROM foreign_keys ORDER BY 3;
query_search_path | "$user", public, v_catalog, v_monitor, v_system
projections_used | v_catalog.foreign_keys_p
query_duration | 19841
query_start_epoch | 1416
query_start | 2009-08-11 12:23:27.209952
query_type | 1
error_code | 0

-451-
SQL Reference Manual

user_name | dbadmin
processed_row_count | 26
-[ RECORD 24 ]------+-------------------------------------------------------
node_name | site01
session_id | fc10-1-16482:0x535e
transaction_id | 45035996273812233
statement_id | 2
query | SELECT * FROM foreign_keys;
query_search_path | "$user", public, v_catalog, v_monitor, v_system
projections_used | v_catalog.foreign_keys_p
query_duration | 19860
query_start_epoch | 1418
query_start | 2009-08-11 12:28:24.744115
query_type | 1
error_code | 0
user_name | dbadmin
processed_row_count | 26
-[ RECORD 25 ]------+-------------------------------------------------------
node_name | site01
session_id | fc10-1-16482:0x6342
transaction_id | 45035996273812322
statement_id | 0
query | SELECT * FROM grants;
query_search_path | "$user", public, v_catalog, v_monitor, v_system
projections_used |
query_duration | -1
query_start_epoch | 1429
query_start | 2009-08-11 13:00:44.217356
query_type | 0
error_code | 16932996
user_name | dbadmin
processed_row_count | 0
-[ RECORD 26 ]------+-------------------------------------------------------
node_name | site01
session_id | fc10-1-16482:0x63df
transaction_id | 45035996273812325
statement_id | 2
query | SELECT grantee,grantor,privileges_description,
table_schema,table_name from v_system.vs_grants;
query_search_path | "$user", public, v_catalog, v_monitor, v_system
projections_used | v_system.vs_grants_p
query_duration | 19857
query_start_epoch | 1429
query_start | 2009-08-11 13:02:06.870678
query_type | 1
error_code | 0
user_name | dbadmin
-[ RECORD ...]------+-------------------------------------------------------

RESOURCE_REJECTIONS
Monitors requests for resources that are rejected by the resource manager.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a format
that can be used as a Date/Time Expression (page
76).
NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
ACCUMULATION_START TIMESTAMP The time of first rejection for this requester type.
REQUEST_TYPE VARCHAR The requester type.

-452-
SQL System Tables (Monitoring APIs)

REJECT_COUNT INTEGER The total number of rejections for this requester type.
TIMEOUT_COUNT INTEGER The total number of timeouts for this requester type.
CANCEL_COUNT INTEGER The total number of cancellations for this requester
type.
LAST_REQUEST_REJECTED_TYP VARCHAR The last resource type rejected for this plan type.
E
LAST_REQUEST_REJECTED_REA VARCHAR The reason for the last rejection of this plan type.
SON
THREAD_REQUEST_COUNT INTEGER The total number of thread type rejections.
FILE_HANDLE_REQUEST_COUNT INTEGER The total number of file handle type rejections.
MEMORY_REQUIREMENTS_BYTES INTEGER The total number of memory type rejections.
ADDRESS_SPACE_REQUIREMENT INTEGER The total number of address space type rejections.
S_BYTES

Requester Types
• Plans (see Plan Types)
• WOS

Plan Types
• Load Query
• Load Query Direct
• Insert Query
• Insert Query Direct
• Delete Query
• Select Query
• TM_MOVEOUT
• TM_MERGEOUT
• TM_ANALYZE,
• TM_DIRECTLOAD
• TM_REDELETE_MOVE
• TM_REDELETE_MERGE
• RECOVER
• ROS_SPLIT
• TM_DVWOS_MOVEOUT
• REDELETE_RECOVER
• REFRESH_HISTORICAL
• REFRESH_CURRENT
• ROS_SPLIT_REDELETE_1
• ROS_SPLIT_REDELETE_2

-453-
SQL Reference Manual

Resource Types
• Number of running plans
• Number of running plans on initiator node (local)
• Number of requested Threads
• Number of requested File Handles
• Number of requested KB of Memory
• Number of requested KB of Address Space

Reasons for Rejection


• Usage of single request exceeds high limit
• Timed out waiting for resource reservation
• Canceled waiting for resource reservation

Example
Call the RESOURCE_REJECTIONS table:
SELECT * FROM RESOURCE_REJECTIONS;

RESOURCE_USAGE
Monitors system resource management on each node.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a
format that can be used as a Date/Time
Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the
requested information.
REQUEST_COUNT INTEGER The cumulative number of requests for threads,
file handles, and memory (in kilobytes).
LOCAL_REQUEST_COUNT INTEGER The cumulative number of local requests.
REQUEST_QUEUE_DEPTH INTEGER The current request queue depth.
ACTIVE_THREAD_COUNT INTEGER The current number of active threads.
OPEN_FILE_HANDLE_COU INTEGER The current number of open file handles.
NT
MEMORY_REQUESTED_KB INTEGER The memory requested in kilobytes.
ADDRESS_SPACE_REQUES INTEGER The address space requested in kilobytes.
TED_KB
WOS_USED_BYTES INTEGER The size of the WOS in bytes.
WOS_ROW_COUNT INTEGER The number of rows in the WOS.
ROS_USED_BYTES INTEGER The size of the ROS in bytes.
ROS_ROW_COUNT INTEGER The number of rows in the ROS.

-454-
SQL System Tables (Monitoring APIs)

TOTAL_USED_BYTES INTEGER The total size of storage (WOS + ROS) in bytes.


TOTAL_ROW_COUNT INTEGER The total number of rows in storage (WOS +
ROS).
RESOURCE_REQUEST_REJ INTEGER The number of rejected plan requests.
ECT_COUNT
RESOURCE_REQUEST_TIM INTEGER The number of resource request timeouts.
EOUT_COUNT
RESOURCE_REQUEST_CAN INTEGER The number of resource request cancellations.
CEL_COUNT
DISK_SPACE_REQUEST_R INTEGER The number of rejected disk write requests.
EJECT_COUNT
FAILED_VOLUME_REJECT INTEGER The number of rejections due to a failed volume.
_COUNT
TOKENS_USED INTEGER For internal use only.
TOKENS_AVAILABLE INTEGER For internal use only.

Example
=> SELECT * FROM RESOURCE_USAGE;

-[ RECORD 1 ]-------------------+---------------------------
current_timestamp | 2009-08-11 16:22:50.005942
node_name | site01
request_count | 1
local_request_count | 1
request_queue_depth | 0
active_thread_count | 4
open_file_handle_count | 2
memory_requested_kb | 4352
address_space_requested_kb | 106752
wos_used_bytes | 0
wos_row_count | 0
ros_used_bytes | 10390319
ros_row_count | 324699
total_used_bytes | 10390319
total_row_count | 324699
resource_request_reject_count | 0
resource_request_timeout_count | 0
resource_request_cancel_count | 0
disk_space_request_reject_count | 0
failed_volume_reject_count | 0
tokens_used | 1
tokens_available | 7999999
-[ RECORD 2 ]-------------------+---------------------------
current_timestamp | 2009-08-11 16:22:50.005965
node_name | site02
request_count | 0
local_request_count | 0
request_queue_depth | 0
active_thread_count | 0
open_file_handle_count | 0

-455-
SQL Reference Manual

memory_requested_kb | 0
address_space_requested_kb | 0
wos_used_bytes | 0
wos_row_count | 0
ros_used_bytes | 10359489
ros_row_count | 324182
total_used_bytes | 10359489
total_row_count | 324182
resource_request_reject_count | 0
resource_request_timeout_count | 0
resource_request_cancel_count | 0
disk_space_request_reject_count | 0
failed_volume_reject_count | 0
tokens_used | 0
tokens_available | 8000000
-[ RECORD 3 ]-------------------+---------------------------
current_timestamp | 2009-08-11 16:22:50.005976
node_name | site03
request_count | 0
local_request_count | 0
request_queue_depth | 0
active_thread_count | 0
open_file_handle_count | 0
memory_requested_kb | 0
address_space_requested_kb | 0
wos_used_bytes | 0
wos_row_count | 0
ros_used_bytes | 10355231
ros_row_count | 324353
total_used_bytes | 10355231
total_row_count | 324353
resource_request_reject_count | 0
resource_request_timeout_count | 0
resource_request_cancel_count | 0
disk_space_request_reject_count | 0
failed_volume_reject_count | 0
tokens_used | 0
tokens_available | 8000000
-[ RECORD 4 ]-------------------+---------------------------
current_timestamp | 2009-08-11 16:22:50.005986
node_name | site04
request_count | 0
local_request_count | 0
request_queue_depth | 0
active_thread_count | 0
open_file_handle_count | 0
memory_requested_kb | 0
address_space_requested_kb | 0
wos_used_bytes | 0
wos_row_count | 0
ros_used_bytes | 10385744
ros_row_count | 324870
total_used_bytes | 10385744
total_row_count | 324870

-456-
SQL System Tables (Monitoring APIs)

resource_request_reject_count | 0
resource_request_timeout_count | 0
resource_request_cancel_count | 0
disk_space_request_reject_count | 0
failed_volume_reject_count | 0
tokens_used | 0
tokens_available | 8000000

SESSION_PROFILES
Provides basic session parameters and lock time out data. To obtain information about sessions,
see Profiling Database Performance.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHA The Linux system time of query execution in a format that can
R be used as a Date/Time Expression (page 76).
NODE_NAME VARCHA The name of the node that is reporting the requested
R information.
USER_NAME VARCHA The name used to log into the database or NULL if the
R session is
internal.
CLIENT_HOSTMANE VARCHA The host name and port of the TCP socket from which the
R client
connection was made; NULL if the session is internal.
LOGIN_TIMESTAMP VARCHA The date and time the user logged into the database or when
R the internal session was created. This can be useful for
identifying sessions that have been left open for a period of
time and could be idle.
LOGOUT_TIMESTAMP VARCHA The date and time the user logged out of the database or
R when the internal session was closed.
SESSION_ID VARCHA The identification of the session for which profiling information
R is captured. This identifier is unique within the cluster at any
point in time but can be reused when the session closes.
EXECUTED_STATEMENT_S INTEGER The number of successfully executed statements.
UCCESS_COUNT
EXECUTED_STATEMENT_F INTEGER The number of unsuccessfully executed statements.
AILURE_COUNT
LOCK_GRANT_COUNT INTEGER The number of locks granted during the session.

DEADLOCK_COUNT INTEGER The number of deadlocks encountered during the session.


LOCK_TIMEOUT_COUNT INTEGER The number of times a lock timed out during the session.
LOCK_CANCELLATION_CO INTEGER The number of times a lock was cancelled during the session.
UNT

-457-
SQL Reference Manual

LOCK_REJECTION_COUNT INTEGER The number of times a lock was rejected during a session.
LOCK_ERROR_COUNT INTEGER The number of lock errors encountered during the session.

Example
Call the SESSION_PROFILES table:
SELECT * FROM SESSION_PROFILES;

See Also
LOCKS (page 441)

SESSIONS
Monitors external sessions. You can use this table to:
• Identify users who are running long queries
• Identify users who are holding locks due to an idle but uncommitted transaction
• Disconnect users in order to shut down the database
• Determine the details behind the type of database security (Secure Socket Layer (SSL) or
client authentication) used for a particular session.

Column Name Data Type Description


CURRENT_TIMESTAM VARCHAR The Linux system time of query execution in a format that can be
P used as a Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the requested information.
USER_NAME VARCHAR The name used to log into the database or NULL if the session is
internal.
CLIENT_HOSTNAME VARCHAR The host name and port of the TCP socket from which the client
connection was made; NULL if the session is internal.
LOGIN_TIMESTAMP DATE The date and time the user logged into the database or when the
internal session was created. This can be useful for identifying
sessions that have been left open for a period of time and could be
idle.
SESSION_ID VARCHAR The identifier required to close or interrupt a session. This identifier
is unique within the cluster at any point in time but can be reused
when the session closes.
TRANSACTION_STAR DATE The date/time the current transaction started or NULL if no
T transaction is running.
TRANSACTION_ID VARCHAR A string containing the hexadecimal representation of the
transaction ID, if any; otherwise NULL.
TRANSACTION_DESC VARCHAR A description of the current transaction.
RIPTION
STATEMENT_START DATE The date/time the current statement started execution, or NULL if
no statement is running.

-458-
SQL System Tables (Monitoring APIs)

STATEMENT_ID VARCHAR An ID for the currently executing statement. NULL indicates that
no statement is currently being processed.
LAST_STATEMENT_D INTEGER The duration of the last completed statement in microseconds.
URATION_US
CURRENT_STATEMEN VARCHAR The currently executing statement, if any. NULL otherwise.
T
SSL_STATE VARCHAR Indicates if Vertica used Secure Socket Layer (SSL) for a particular
session. Possible values are:
None – Vertica did not use SSL.
Server – Sever authentication was used, so the client could
authenticate the server.
Mutual – Both the server and the client authenticated one another
through mutual authentication.
See Vertica Security and Implementing SSL.
AUTHENTICATION_M VARCHAR The type of client authentication used for a particular session, if
ETHOD known. Possible values are:
Unknown
Trust
Reject
Kerberos
Password
MD5
LDAP
Kerberos-GSS
See Vertica Security and Implementing Client Authentication.

Notes
• The superuser has unrestricted access to all session information, but users can only view
information about their own, current sessions.
• During session initialization and termination, you might see sessions running only on nodes
other than the node on which you executed the virtual table query. This is a temporary
situation and corrects itself as soon as session initialization and termination completes.

Example
=> SELECT * FROM SESSIONS;

-[ RECORD 1 ]--------------+---------------------------------------------
current_timestamp | 2009-08-11 16:32:16.540641
node_name | site01
user_name | dbadmin
client_hostname | 127.0.0.1:36674
login_timestamp | 2009-08-11 13:37:59.486908
session_id | fc10-1-16482:0x7586
transaction_start | 2009-08-11 14:23:15.014816
transaction_id | 0xa000000001a440
transaction_description | user dbadmin (select * from node_resources;)

-459-
SQL Reference Manual

statement_start | 2009-08-11 16:32:16.530551


statement_id | 42949673042
last_statement_duration_ms | 26856
current_statement | SELECT * FROM SESSIONS;
ssl_state | None
authentication_method | Trust

STORAGE_CONTAINERS
Monitors information about each storage container in the database.

Column Name Data Type Description


NODE_NAME VARCHAR The name of the node that is reporting the requested information.
SCHEMA_NAME VARCHAR The name of the schema.
PROJECTION_NAME VARCHAR The name of the projection.
STORAGE_TYPE VARCHAR Type of storage container: ROS or WOS.
STORAGE_OID INTEGER The storage ID assigned by Vertica.
TOTAL_ROW_COUNT VARCHAR Total rows in the projection.
DELETED_ROW_COUN INTEGER Rows deleted from the projection.
T
BYTES_USED INTEGER Size of the projection.
START_EPOCH VARCHAR The number of the start epoch.
END_EPOCH VARCHAR The number of the end epoch.
GROUPING VARCHAR The group by which columns are stored:
ALL – All columns are grouped
PROJECTION – Columns grouped according to projection
definition
NONE – No columns grouped, despite grouping in the projection
definition
OTHER – Some grouping but neither all nor according to projection
(e.g., results from add column)

Example
SELECT schema_name, storage_type, projection_name, grouping
FROM storage_containers;

schema_name | projection_name | storage_type | grouping


-------------+-----------------------------------------+--------------+-------------
public product_dimension_tmp_initiator ROS ALL
public product_dimension_grouped_tmp_initiator ROS ALL
public product_dimension_tmp_initiator ROS PROJECTION
public product_dimension_grouped_tmp_initiator ROS ALL
public product_dimension_tmp_initiator ROS PROJECTION
public product_dimension_grouped_tmp_initiator ROS ALL
public product_dimension_grouped_tmp_initiator ROS ALL
public product_dimension_grouped_tmp_initiator ROS ALL
public product_dimension_grouped_tmp_initiator ROS ALL
public product_dimension_tmp_initiator ROS PROJECTION

-460-
SQL System Tables (Monitoring APIs)

public product_dimension_tmp_initiator ROS PROJECTION


public product_dimension_tmp_initiator ROS PROJECTION
public product_dimension_tmp_initiator ROS PROJECTION
public product_dimension_tmp_initiator ROS PROJECTION
public product_dimension_tmp_initiator ROS PROJECTION
public product_dimension_grouped_tmp_initiator ROS ALL
public product_dimension_grouped_tmp_initiator ROS ALL
public product_dimension_grouped_tmp_initiator ROS ALL

SYSTEM
Monitors the overall state of the database.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a format that can
be used as a Date/Time Expression (page 76).
CURRENT_EPOCH INTEGER The current epoch number.
AHM_EPOCH INTEGER The ahm epoch number.
LAST_GOOD_EPOCH INTEGER The smallest (min) of all the checkpoint epochs on the cluster.
REFRESH_EPOCH INTEGER The oldest of the refresh epochs of all the nodes in the cluster
DESIGNED_FAULT_TOLER INTEGER The designed or intended K-Safety level.
ANCE
NODE_COUNT INTEGER The number of nodes in the cluster.
NODES_DOWN_COUNT INTEGER The number of nodes in the cluster that are currently down.
CURRENT_FAULT_TOLERA INTEGER The number of node failures the cluster can tolerate before it
NCE shuts down automatically.
CATALOG_REVISION_NUM INTEGER The catalog version number.

WOS_USED_BYTES INTEGER The WOS size in bytes (cluster-wide).


WOS_ROW_COUNT INTEGER The number of rows in WOS (cluster-wide).
ROS_USED_BYTES INTEGER The ROS size in bytes (cluster-wide).
ROS_ROW_COUNT INTEGER The number of rows in ROS (cluster-wide).
TOTAL_USED_BYTES INTEGER The total storage in bytes (WOS + ROS) (cluster-wide).
TOTAL_ROW_COUNT INTEGER The total number of rows (WOS + ROS) (cluster-wide).

Example
Call the SYSTEM table:
mydb=> SELECT * FROM SYSTEM;
-[ RECORD 1 ]------------+---------------------------
current_timestamp | 2009-08-11 17:09:54.651413
current_epoch | 1512
ahm_epoch | 961
last_good_epoch | 1510

-461-
SQL Reference Manual

refresh_epoch | -1
designed_fault_tolerance | 1
node_count | 4
node_down_count | 0
current_fault_tolerance | 1
catalog_revision_number | 1590
wos_used_bytes | 0
wos_row_count | 0
ros_used_bytes | 41490783
ros_row_count | 1298104
total_used_bytes | 41490783
total_row_count | 1298104

TUPLE_MOVER_OPERATIONS
Monitors the status of the Tuple Mover on each node.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a format that
can be used as a Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
OPERATION_NAME VARCHAR One of the following operations:
Moveout
Mergeout
Analyze Statistics
OPERATION_STATUS VARCHAR Running or an empty string to indicate 'not running.'
PROJECTION_NAME VARCHAR The name of the projection being processed.
OPERATION_START_EPOCH INTEGER The first epoch of the mergeout operation. (Not applicable
for other operations.)
OPERATION_END_EPOCH INTEGER The last epoch of the mergeout operation (not applicable for
other operations).
ROS_COUNT INTEGER The number of ROS containers.
TOTAL_ROS_USED_BYTES INTEGER The size in bytes of all ROS containers in the mergeout
operation. (Not applicable for other operations.)
PLAN_TYPE VARCHAR One of the following values:
Moveout
Mergeout
Analyze
Replay Delete

Example
Call the TUPLE_MOVER_OPERATIONS table:
SELECT * FROM TUPLE_MOVER_OPERATIONS;
No output means that the Tuple Mover is not performing an operation.

-462-
SQL System Tables (Monitoring APIs)

The following statement returns only a few columns from TUPLE_MOVER_OPERATIONS:


SELECT current_timestamp, node_name, operation_status, projection_name, plan_type
FROM TUPLE_MOVER_OPERATIONS;
current_timestamp | node_name | operation_status | projection_name | plan_type
----------------------------+-----------+------------------+------------------+-----------
2009-09-02 10:02:33.689642 | node0001 | Running | p1_b2 | Mergeout
2009-09-02 10:02:34.773513 | node0002 | Running | p1 | Mergeout
2009-09-02 10:04:09.755708 | node0001 | Running | p1_b2 | Replay Delete
2009-09-02 10:07:38.513913 | node0001 | Running | p1_b2 | Mergeout
2009-09-02 10:07:39.597861 | node0002 | Running | p1_b2 | Mergeout
2009-09-02 10:27:30.331516 | node0001 | Running | p1_b2 | Replay Delete
2009-09-02 10:27:31.437047 | node0002 | Running | p1 | Mergeout
2009-09-02 10:27:31.423701 | node0003 | Running | p1_b2 | Replay Delete
2009-09-02 10:44:21.775402 | node0001 | Running | p1 | Mergeout
2009-09-02 10:44:22.856147 | node0002 | Running | p1_b1 | Mergeout

WOS_CONTAINER_STORAGE
Monitors information about WOS storage, which is divided into regions. Each region allocates
blocks of a specific size to store rows.

Column Name Data Type Description


CURRENT_TIMESTAMP VARCHAR The Linux system time of query execution in a format that can
be used as a Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
WOS_TYPE VARCHAR Either system or user data.
WOS_ALLOCATION_REGIO VARCHAR The block size allocated by region in KB. The summary line
N sums the amount of memory used by all regions.
REGION_VIRTUAL_SIZE INTEGER The amount of virtual memory in use by region in KB. Virtual
size is greater than or equal to allocated size, which is greater
than or equal to in-use size.
REGION_ALLOCATED_SIZ INTEGER The amount of physical memory in use by a particular region in
E KB.
REGION_IN_USE_SIZE INTEGER The actual number of bytes of data stored by the region in KB.
REGION_SMALL_RELEASE INTEGER Internal use only.
_COUNT
REGION_BIG_RELEASE_C INTEGER Internal use only.
OUNT

Notes
• The WOS allocator can use large amounts of virtual memory without assigning physical
memory.

-463-
SQL Reference Manual

• To see the difference between virtual size and allocated size, look at the
REGION_IN_USE_SIZE column to see if the WOS is full. The summary line tells you the
amount of memory used by the WOS, which is typically capped at one quarter of physical
memory per node.

Examples
=> SELECT * FROM WOS_CONTAINER_STORAGE;

-[ RECORD 1 ]--------------+---------------------------
current_timestamp | 2009-08-11 18:02:05.556007
node_name | site01
wos_type | user
wos_allocation_region | Summary
region_virtual_size | 0
region_allocated_size | 0
region_in_use_size | 0
region_small_release_count | 0
region_big_release_count | 0
-[ RECORD 2 ]--------------+---------------------------
current_timestamp | 2009-08-11 18:02:05.556012
node_name | site01
wos_type | v_system
wos_allocation_region | Summary
region_virtual_size | 0
region_allocated_size | 0
region_in_use_size | 0
region_small_release_count | 0
region_big_release_count | 0
-[ RECORD 3 ]--------------+---------------------------
current_timestamp | 2009-08-11 18:02:05.556015
node_name | site02
wos_type | user
wos_allocation_region | Summary
region_virtual_size | 0
region_allocated_size | 0
region_in_use_size | 0
region_small_release_count | 0
region_big_release_count | 0
-[ RECORD 4 ]--------------+---------------------------
current_timestamp | 2009-08-11 18:02:05.556018
node_name | site02
wos_type | v_system
wos_allocation_region | Summary
region_virtual_size | 0
region_allocated_size | 0
region_in_use_size | 0
region_small_release_count | 0
region_big_release_count | 0
-[ RECORD 5 ]--------------+---------------------------
current_timestamp | 2009-08-11 18:02:05.55602
node_name | site03
wos_type | user
wos_allocation_region | Summary

-464-
SQL System Tables (Monitoring APIs)

region_virtual_size | 0
region_allocated_size | 0
region_in_use_size | 0
region_small_release_count | 0
region_big_release_count | 0
-[ RECORD 6 ]--------------+---------------------------
current_timestamp | 2009-08-11 18:02:05.556023
node_name | site03
wos_type | v_system
wos_allocation_region | Summary
region_virtual_size | 0
region_allocated_size | 0
region_in_use_size | 0
region_small_release_count | 0
region_big_release_count | 0
-[ RECORD 7 ]--------------+---------------------------
current_timestamp | 2009-08-11 18:02:05.556025
node_name | site04
wos_type | user
wos_allocation_region | Summary
region_virtual_size | 0
region_allocated_size | 0
region_in_use_size | 0
region_small_release_count | 0
region_big_release_count | 0
-[ RECORD 8 ]--------------+---------------------------
current_timestamp | 2009-08-11 18:02:05.556026
node_name | site04
wos_type | v_system
wos_allocation_region | Summary
region_virtual_size | 0
region_allocated_size | 0
region_in_use_size | 0
region_small_release_count | 0
region_big_release_count | 0

-465-
Deprecated System Tables
The monitoring APIs listed in this section have been replaced by the tables listed in SQL System
Tables (Monitoring APIs (page 409).

Accessing Deprecated System Tables


If you run monitoring tools or scripts that rely on system tables in Vertica 3.0 and prior, you can
access the deprecated tables for a limited time by using a backward compatibility script. This
script creates views in the PUBLIC schema that are consistent with 3.0 functionality and grants
SELECT permissions to all users.

Enabling Deprecated Tables Manually


1 Upgrade Vertica or upgrade your drivers. See Upgrading Vertica in the Installation and
Configuration Guide for details.
Note: For major releases, the client must match the server.
2 Start your database and connect to it as the database owner. You'll see a prompt similar to
the following:
dbname=>
3 Type the following command to run the script:
=> \i /opt/vertica/scripts/virtual_tables_backward_compatibility.sql
Note: Run the backward compatibility script once only — either after you upgrade Vertica to
a newer version (on your existing databases) or after your create a new database. You do
not need to run it each time you log in.

Enabling Deprecated using AdminTools


Alternatively, the first time you run AdminTools after you update your database, you have the
option to let the system run the script for you:

To undo the effects of the backward compatibility script, simply drop the view or views created by
it. For example:
DROP public.vt_partitions;
For information about view names that the backward compatibility script creates, see
/opt/vertica/scripts/virtual_tables_backward_compatibility.sql.

See Also
DROP VIEW (page 364) in the SQL Reference Manual

-467-
SQL Reference Manual

See also Using the SQL Monitoring API for more information.

VT_ACTIVE_EVENTS
Note: This table is deprecated. Use ACTIVE_EVENTS (page 424) instead.
Provides information about database events across the cluster. See Monitoring Events.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a
format that can be used as a Date/Time
Expression.
NODE_NAME VARCHAR The node where the event occurred.
EVENT_CODE INTEGER A numeric ID that indicates the type of event. See
Event Types for a list of event type codes.
EVENT_ID INTEGER A unique numeric ID that identifies the specific
event.
EVENT_SEVERITY VARCHAR The severity of the event from highest to lowest.
These events are based on standard syslog
severity types.
0—Emergency
1—Alert
2—Critical
3—Error
4—Warning
5—Notice
6—Informational
7—Debug
EVENT_POSTED VARCHAR The year, month, day, and time the event was
reported. The time is posted in military time.
EVENT_EXPIRATION VARCHAR The year, month, day, and time the event expire.
The time is posted in military time. If the cause of
the event is still active, the event is posted again.
EVENT_CODE_DESCRIPTION VARCHAR A brief description of the event and details
pertinent to the specific situation.
PROBLEM_DESCRIPTION VARCHAR A generic description of the event.

REPORTING_NODE VARCHAR The name of the node within the cluster that
reported the event.
EVENT_SENT_TO VARCHAR The event logging mechanisms that are configured
for Vertica. These can include vertica.log,
(configured by default) syslog, and SNMP.
NUM_TIMES_POSTED INTEGER Tracks the number of times an event occurs.
Rather than posting the same event multiple times,

-468-
SQL System Tables (Monitoring APIs)

Vertica posts the event once and then counts the


number of additional instances in which the event
occurs.

VT_COLUMN_STORAGE
Note: This table is deprecated. Use COLUMN_STORAGE (page 426) instead.
Returns the amount of disk storage used by each column of each projection on each node.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a
format that can be used as a Date/Time
Expression.
NODE_NAME VARCHAR The name of the node that is reporting the
requested information.
COLUMN_NAME VARCHAR A projection column name.
NUM_ROWS INTEGER The number of rows in the column (cardinality).
NUM_BYTES INTEGER The disk storage allocation of the column in
bytes.
WOS_ROWS INTEGER The number of WOS rows in the column.
ROS_ROWS INTEGER The number of ROS rows in the column.
ROS_BYTES INTEGER The number of ROS bytes in the column.
NUM_ROS INTEGER The number of ROS containers.
PROJECTION_NAME VARCHAR The associated projection name.
TABLE_NAME VARCHAR The name of the table

VT_CURRENT_SESSION
Note: This table is deprecated. Use CURRENT_SESSION (page 428) instead.
Monitors the current active session. You can use this table to find out the current session's
sessionID and get the duration of the previously-run query.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a format
that can be used as a DATE/TIME expression.
NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
USERNAME VARCHAR The name used to log into the database or NULL if the

-469-
SQL Reference Manual

session is internal
CLIENT VARCHAR The host name and port of the TCP socket from which
the client connection was made; NULL if the session is
internal
LOGIN_TIME DATE The date and time the user logged into the database or
when the internal session was created. This column
can be useful for identifying sessions that have been
left open and could be idle.
SESSIONID VARCHAR The identifier required to close or interrupt a session.
This identifier is unique within the cluster at any point in
time but can be reused when the session closes.
TXN_START DATE The date/time the current transaction started or NULL if
no transaction is running.
TXNID VARCHAR A string containing the hexadecimal representation of
the transaction ID, if any; otherwise NULL.
TXN_DESCRIPT VARCHAR A description of the current transaction.
STMT_START DATE The date/time the current statement started execution,
or NULL if no statement is running.
STMTID VARCHAR An ID for the currently executing statement. NULL
indicates that no statement is currently being
processed.
LAST_STMT_DURATI INTEGER The duration of the last completed statement in
ON milliseconds.
CURRENT_STMT VARCHAR The currently executing statement, if any. NULL
otherwise
LAST_STMT VARCHAR NULL if the user has just logged in; otherwise the
currently running statement or the most recently
completed statement.
EE_PROFILING_CON VARCHAR Returns a value that indicates whether profiling is
FIG turned on. Results are:
Empty when no profiling
'Local' when profiling on for this session
'Global' when on by default for all sessions
'Local, Global' when on by default for all sessions and
on for current session
QRY_PROFILING_CO VARCHAR Returns a value that indicates whether profiling is
NFIG turned on; empty when no profiling, 'Local' when on for
this session, 'Global' when on by default for sessions
and 'Local, Global' when on by default for all sessions
and on for current session.
SESSION_PROFILIN VARCHAR Returns a value that indicates whether profiling is
G_CONFIG turned on; empty when no profiling, 'Local' when on for
this session, 'Global' when on by default for sessions
and 'Local, Global' when on by default for all sessions
and on for current session.

-470-
SQL System Tables (Monitoring APIs)

Notes
• The default for profiling is ON ('1') for all sessions. Each session can turn profiling ON or
OFF.
• Profiling parameters (such as GlobalEEProfiling in the examples below) are set in the Vertica
configuration file (vertica.conf). To turn profiling off, set the parameter to '0'. To turn profiling
on, set the parameter to '1'.

Examples
The sequence of commands in this example shows the use of disabling and enabling profiling for
local and global sessions.
This command disables EE profiling for query execution runs:
SELECT disable_profiling('EE');
disable_profiling
-----------------------
EE Profiling Disabled
(1 row)
The following command sets the GlobalEEProfiling configuration parameter to 0, which turns off
profiling:
SELECT set_config_parameter('GlobalEEProfiling', '0');
set_config_parameter
----------------------------
Parameter set successfully
(1 row)
The following command tells you whether profiling is set to 'Local' or 'Global' or none:
SELECT ee_profiling_config FROM vt_current_session;
ee_profiling_config
---------------------

(1 row)
Note: The result set is empty because profiling was turned off in the preceding example.
This command now enables EE profiling for query execution runs:
SELECT enable_profiling('EE');
enable_profiling
----------------------
EE Profiling Enabled
(1 row)
Now when you run a select on the VT_CURRENT_SESSIONS table, you can see profiling is ON
for the local session:
SELECT ee_profiling_config FROM vt_current_session;
ee_profiling_config
---------------------
Local
(1 row)

-471-
SQL Reference Manual

Now turn profiling on for all sessions by setting the GlobalEEProfiling configuration parameter to
1:
SELECT set_config_parameter('GlobalEEProfiling', '1');
set_config_parameter
----------------------------
Parameter set successfully
(1 row)
Now when you run a select on the VT_CURRENT_SESSIONS table, you can see profiling is ON
for the local sessions, as well as for all sessions:
SELECT ee_profiling_config FROM vt_current_session;
ee_profiling_config
---------------------
Local, Global
(1 row)

See Also
VT_EE_PROFILING (page 473), VT_QUERY_PROFILING (page 485), and
VT_SESSION_PROFILING (page 490)

VT_DISK_RESOURCE_REJECTIONS
Note: This table is deprecated. Use DISK_RESOURCE_REJECTIONS (page 431) instead.
Monitors requests for resources that are rejected due to disk space shortages.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution
in a format that can be used as a
Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting
the requested information.
ACCUMULATION_START VARCHAR The time of first request rejection for this
requester.
REQUESTER VARCHAR The resource request requester
(example: plan type).
DISK_SPACE_RJT INTEGER The total number of disk space resource
requests rejected.
FAILED_VOLUME_RJT INTEGER The total number of disk space resource
requests on a failed volume.

VT_DISK_STORAGE
Note: This table is deprecated. Use DISK_STORAGE (page 432) instead.

-472-
SQL System Tables (Monitoring APIs)

Monitors the amount of disk storage used by the database on each node.

Column Name Date Type Description


TIMESTAMP VARCHA The Linux system time of query execution in a
R format that can be used as a Date/Time
Expression (page 76).
NODE_NAME VARCHA The name of the node that is reporting the
R requested information.
DISK_BLK_SIZE INTEGER The block size of the disk.

USED_BLKS INTEGER The number of disk blocks in use.

MB_USED INTEGER The number of megabytes of disk storage in use.

FREE_BLKS INTEGER The number of free disk blocks available.

MB_FREE INTEGER The number of megabytes of free storage


available.
PERCENTAGE_FREE INTEGER The percentage of free disk space remaining.

VT_EE_PROFILING
Note: This table is deprecated. Use EXECUTION_ENGINE_PROFILES (page 436)
instead.
Provides information regarding query execution runs. To obtain information about query
execution runs for your database, see Profiling Database Performance.

Column Name Data Type Description


TIMESTAMP VARCHA The Linux system time of query execution in a
R format that can be used as a Date/Time Expression.
NODE_NAME VARCHA The name of the node that is reporting the
R requested information.
SESSIONID VARCHA The identification of the session for which profiling
R information is captured. This identifier is unique
within the cluster at any point in time but can be
reused when the session closes.
TXNID INTEGER An identifier for the transaction within the session if
any; otherwise NULL.
STMTID INTEGER An ID for the currently executing statement. NULL
indicates that
no statement is currently being processed.

-473-
SQL Reference Manual

OPERATOR_NAME VARCHA The name of the user who started the session.
R
OPERATORID INTEGER The ID of the user who started the session.
COUNTER_NAME VARCHA The name of the counter. See COUNTER_NAME
R Values below.
COUNTER_VALUE INTEGER The value of the counter.

COUNTER_NAME Values
The value of COUNTER_NAME can be any of the following:

COUNTER_NAME Description
bytes sent The number of bytes sent over the network for the query
execution.
bytes received The number of bytes received over the network for the query
execution.
rows produced The number of rows produced by the EE operator.
executable time (ms) The time required to execute the query (in milliseconds).

VT_GRANT
Note: This table is deprecated. Use GRANTS (page 413) instead.
Provides grant information.

Column Name Data Type Description


GRANTEE_ID INTEGER The grantee object ID (OID) from the catalog

GRANTEE VARCHA The user being granted permission


R
GRANTOR_ID INTEGER The object ID from the catalog

GRANTOR VARCHA The user granting permission


R
PRIVILEGES INTEGER The bitmask representation of the privileges being
granted

-474-
SQL System Tables (Monitoring APIs)

PRIVILEGES_DESC VARCHA A readable description of the privileges being


R granted; for example INSERT, SELECT.
OBJECTID INTEGER The object ID from the catalog

SCHEMANAME VARCHA Name of the schema


R
TABLENAME VARCHA Name of the table
R

Notes
The vsql commands \dp and \z both include the schema name in the output:
$ \dp
Access privileges for database "dbadmin"
Grantee | Grantor | Privileges | Schema | Name
---------+---------+--------------------------------------------+---------+--------
user2 | dbadmin | INSERT, SELECT, UPDATE, DELETE, REFERENCES | schema1 | events
user1 | dbadmin | SELECT | schema1 | events
user2 | dbadmin | INSERT, SELECT, UPDATE, DELETE, REFERENCES | schema2 | events
user1 | dbadmin | INSERT, SELECT | schema2 | events
(4 rows)

Access privileges for database "dbadmin"


Grantee | Grantor | Privileges | Schema | Name
---------+---------+------------+--------+--------
| dbadmin | USAGE | | public
(1 row)
The vsql command \dp *.tablename displays table names in all schemas. This command lets
you distinguish the grants for same-named tables in different schemas:
$ \dp *.events;
Access privileges for database "dbadmin"
Grantee | Grantor | Privileges | Schema | Name
---------+---------+--------------------------------------------+---------+--------
user2 | dbadmin | INSERT, SELECT, UPDATE, DELETE, REFERENCES | schema1 | events
user1 | dbadmin | SELECT | schema1 | events
user2 | dbadmin | INSERT, SELECT, UPDATE, DELETE, REFERENCES | schema2 | events
user1 | dbadmin | INSERT, SELECT | schema2 | events
(4 rows)

The vsql command \dp schemaname.* displays all tables in the named schema:
$ \dp schema1.*
Access privileges for database "dbadmin"
Grantee | Grantor | Privileges | Schema | Name
---------+---------+--------------------------------------------+---------+--------
user2 | dbadmin | INSERT, SELECT, UPDATE, DELETE, REFERENCES | schema1 | events
user1 | dbadmin | SELECT | schema1 | events
(2 rows)

Call the VT_GRANT table:


SELECT * FROM vt_grant;

-475-
SQL Reference Manual

VT_LOAD_STREAMS
Note: This table is deprecated. Use LOAD_STREAMS (page 440) instead.
Monitors load metrics for each load stream on each node.

Column Name Date Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in
a format that can be used as a Date/Time
Expression.
NODE_NAME VARCHAR The name of the node that is reporting the
requested information.
STREAM VARCHAR The optional identifier that names a stream,
if specified.
TABLE_NAME VARCHAR The name of the table being loaded.
LOAD_START_TIMESTAMP VARCHAR The Linux system time when the load
started.
ROWS_LOADED INTEGER The number of rows loaded.
ROWS_REJECTED INTEGER The number of rows rejected.
BYTES_READ INTEGER The number of bytes read from the input
file.
INPUT_FILE_SIZE INTEGER The size of the input file in bytes.
Note: When using STDIN as input size of
input file size is zero (0).
PERCENT_COMPLETE INTEGER The percent of the rows in the input file that
have been loaded. If using STDIN, this
column remains at zero (0) until the COPY
statement is complete.

Example
=> \pset expanded
Call the VT_LOAD_STREAMS table:
SELECT * FROM VT_LOAD_STREAMS;

VT_LOCK
Note: This table is deprecated. Use LOCKS (page 441) instead.
Monitors the locks in use.

-476-
SQL System Tables (Monitoring APIs)

Column Name Date Type Description


NODE_NAMES VARCHAR The nodes on which lock interaction occurs.
Note on node rollup: If a transaction has the
same lock in the same mode in the same scope on
multiple nodes, it gets one (1) line in the table.
NODE_NAMES are separated by commas.
OBJECT VARCHAR Name of object being locked; can be a TABLE or an
internal structure (projection, global catalog, local
catalog, Tuple Mover, epoch map).
OID INTEGER Object ID of of object being locked.
TRANSACTION VARCHAR ID of transaction and associated description, typically
the query that caused the transaction's creation.
MODE VARCHAR Lock mode describes the intended operations of the
transaction:
S — Share lock needed for select operations.
I — Insert lock needed for insert operations.
X — Exclusive lock is always needed for delete
operations. X lock is also the result of lock promotion
(see Table 2).
T — Tuple Mover lock used by the Tuple Mover and
also used for COPY into pre-join projections.
SCOPE VARCHAR Scope is the expected duration of the lock once it is
granted. Before the lock is granted, the scope is
listed as REQUESTED.
Once a lock has been granted, the following scopes
are possible:
STATEMENT_LOCALPLAN
STATEMENT_COMPILE
STATEMENT_EXECUTE
TRANSACTION_POSTCOMMIT
TRANSACTION
All scopes, other than TRANSACTION, are transient
and are used only as part of normal query
processing.
Notes
• Locks acquired on tables that were subsequently dropped by another transaction can result
in the message, Unknown or deleted object, appearing in the output's OBJECT column.
• Running a SELECT … from VT_LOCK can time out after five minutes. This situation occurs
occurs when the cluster has failed. Run the Diagnostics Utility and contact Technical
Support (on page 33).

-477-
SQL Reference Manual

The following two tables are from Transaction Processing: Concepts and Techniques
http://www.amazon.com/gp/product/1558601902/ref=s9sdps_c1_14_at1-rfc_p-frt_p-3237_g1_si1
?pf_rd_m=ATVPDKIKX0DER&pf_rd_s=center-1&pf_rd_r=1QHH6V589JEV0DR3DQ1D&pf_rd_t
=101&pf_rd_p=463383351&pf_rd_i=507846 by Jim Gray (Figure 7.11, p. 408 and Figure 8.6, p.
467).
Table 1: Compatibility matrix for granular locks
This table is for compatibility with other users. The table is symmetric.

Granted Mode

Requested Mode S I X T

S Yes No No Yes

I No Yes No Yes

X No No No No

T Yes Yes No Yes

The following two examples refer to Table 1:


• Example 1: If someone else has an S lock, you cannot get an I lock.
• Example 2: If someone has an I lock, you can get an I lock.

Table 2: Lock conversion matrix


This table is used for upgrading locks you already have. For example, If you have an S lock and
you want an I lock, you request an X lock. If you have an S lock and you want an S lock, no lock
requests is required.

Granted Mode

Requested Mode S I X T

S S X X S
I X I X I
X X X X X
T S I X T

See Also
DUMP_LOCKTABLE (page 270)

-478-
SQL System Tables (Monitoring APIs)

VT_NODE_INFO
Note: This table is deprecated. Use HOST_RESOURCES (page 437) OR
NODE_RESOURCES (page 444) instead.
Provides a snapshot of the node. This is useful for regularly polling the node with automated
tools or scripts.

Column Name Data Type Description


TIMESTAMP VARCHA The Linux system time of query
R execution in a format that can be used
as a Date/Time Expression.
NODE_NAME VARCHA The name of the node that is reporting
R the requested information.
NUM_OPEN_FILES_LIMIT INTEGER The maximum number of files that can
be open at one time on the node.
NUM_THREADS_LIMIT INTEGER The maximum number of threads that
can coexist on the node.
MAX_CORE_FILE_SIZE_LIMIT INTEGER The maximum core file size allowed on
the node.
NUM_PROCESSORS INTEGER The number of system processors.
NUM_PROCESSOR_CORES INTEGER The number of processor cores in the
system.
PROCESSOR_DESCRIPTION VARCHA A description of the processor. For
R example: Inter(R) Core(TM)2 Duo CPU
T8100 @2.10GHz (1 row)
NUM_OPENED_FILES INTEGER The total number of open files on the
node.
NUM_OPENED_SOCKETS INTEGER The total number of open sockets on the
node.
NUM_NONFILE_NONSOCKET_OPENED INTEGER The total number of other file
descriptions open in which other could
be a directory or FIFO. It is not an open
file or socket.
TOTAL_MEMORY INTEGER The total amount of physical RAM, in
kilobytes, available on the system.
TOTAL_MEMORY_FREE INTEGER The amount of physical RAM, in
kilobytes, left unused by the system.
TOTAL_BUFFER_MEMORY INTEGER The amount of physical RAM, in
kilobytes, used for file buffers on the
system
TOTAL_MEMORY_CACHE INTEGER The amount of physical RAM, in
kilobytes, used as cache memory on the

-479-
SQL Reference Manual

system.
TOTAL_SWAP_MEMORY INTEGER The total amount of swap memory
available, in kilobytes, on the system.
TOTAL_SWAP_MEMORY_FREE INTEGER The total amount of swap memory free,
in kilobytes, on the system.
PROCESS_SIZE INTEGER *The total size of the program (in pages).
PROCESS_RESIDENT_SET_SIZE INTEGER *The total number of pages that the
process has in memory.
PROCESS_SHARED_MEMORY_SIZE INTEGER *The amount of shared memory used (in
pages).
PROCESS_TEXT_MEMORY_SIZE INTEGER *The total number of text pages that the
process has in physical memory. This
does not include any shared libraries.
PROCESS_DATA_MEMORY_SIZE INTEGER *The amount of physical memory, in
pages, used for performing processes.
This does not include the executable
code.
PROCESS_LIBRARY_MEMORY_SIZE INTEGER *The total number of library pages that
the process has in physical memory.
PROCESS_DIRTY_MEMORY_SIZE INTEGER *The number of pages that have been
modified since they were last written to
disk.
MB_FREE_DISK_SPACE INTEGER The free disk space available (in
megabytes) for all storage location file
systems (data directories).
MB_USED_DISK_SPACE INTEGER The disk space used (in megabytes) for
all storage location file systems.
MB_TOTAL_DISK_SPACE INTEGER The total free disk space available (in
megabytes) for all storage location file
systems.

* Each page is 4096 bytes in size.

VT_PARTITIONS
Note: This table is deprecated. Use PARTITIONS (page 445) instead.
Displays partition metadata, one row per partition key, per ROS container.

-480-
SQL System Tables (Monitoring APIs)

Column Name Data Type Description


PARTITION_KEY VARCHAR The partition value
SCHEMANAME VARCHAR The name of the schema
PROJNAME VARCHAR The projection name
ROSID VARCHAR The object ID that uniquely references the ROS container
SIZE INTEGER The ROS container size in bytes
ROWCOUNT INTEGER Number of rows in the ROS container
LOCATION VARCHAR Site where the ROS container resides

Notes
• A many-to-many relationship exists between partitions and ROS containers. VT_PARTITION
displays information in a denormalized fashion.
• To find the number of ROS containers having data of a specific partition, you aggregate
VT_PARTITIONS over the partition_key column.
• To find the number of partitions stored in a ROS container, you aggregate VT_PARTITIONS
over the ros_id column.

Example
Projection 'p1' has three ROS containers, RC1, RC2 and RC3, with the values defined in the
following table:
RC1 RC2 RC3
----------------+-------------------+-------------------+-----------------
PARTITION_KEY (20,30,40) (20) (30,60)
ROS_ID 45035986273705000 45035986273705001 45035986273705002
SIZE 1000 20000 30000
ROWCOUNT 100 200 300
LOCATION e1 e1 e1
In this example, VT_PARTITIONS has six rows with the following values:
(20, 'p1', 45035986273705000, 10000, 100, 'e1')
(30, 'p1', 45035986273705000, 10000, 100, 'e1')
(40, 'p1', 45035986273705000, 10000, 100, 'e1')
(20, 'p1', 45035986273705001, 20000, 200, 'e1')
(30, 'p1', 45035986273705002, 30000, 300, 'e1')
(60, 'p1', 45035986273705002, 30000, 300, 'e1')

VT_PROJECTION
Note: This table is deprecated. Use PROJECTIONS (page 416) instead.
This meta data table provides information regarding projections.

-481-
SQL Reference Manual

Column Name Data Type Description


SCHEMAID OID A unique numeric ID (OID) that identifies the specific schema
that contains the projection.
SCHEMANAME VARCHAR The name of the schema that contains the projection.
PROJID OID A unique numeric ID (OID) that identifies the projection.
PROJNAME VARCHAR The name of the projection.
OWNERID OID A unique numeric ID (OID) that identifies the owner of the
projection.
OWNERNAME VARCHAR The name of the projection's owner.
ANCHORTABLEID OID The unique numeric identification (OID) of the anchor table, for
pre-join projections, or the OID of the table from which the
projection was created if it isn't a pre-join projection.
ANCHORTABLENAME VARCHAR The name of the anchor table, for pre-join projections, or the
name of the table from which the projection was created if it
isn't a pre-join projection.
SITEID OID A unique numeric ID (OID) that identifies the node, or nodes,
that contain the projection.
SITENAME VARCHAR The name of the node, or nodes, that contain the projection.
PREJOIN BOOLEAN Indicates whether or not the projection is a pre-join projection
where t is true and f is false.
CREATEEPOCH OID The epoch in which the projection was created.
VERIFIEDK INTEGER K-safety value for the projection.
UPTODATE BOOLEAN Indicates whether or not the projection is current where t is
true and f is false. Projections must be up-to-date to be used
in queries.

VT_PROJECTION_REFRESH
Note: This table is deprecated. Use PROJECTION_REFRESHES (page 446) instead.
Provides information about refresh operations for projections. Information regarding refresh
operations is maintained as follows:
• Information about a successful refresh is maintained until the refresh session is closed.
• Information about an unsuccessful refresh is maintained, whether or not the refresh session
is closed, until the projection is the target of another refresh operation.
• All refresh information for a node is lost when the node is shut down.

-482-
SQL System Tables (Monitoring APIs)

• After a refresh completes, the refreshed projections go into a single ROS container. If the
table was created with a PARTITION BY clause, then you should call PARTITION_TABLE()
or PARTITION_PROJECTION() to reorganize the data into multiple ROS containers, since
queries on projections with multiple ROS containers perform better than queries on
projections with a single ROS container.

Column Name Data Type Description


NODE_NAME VARCHAR The name of the node upon which the refresh operation is
running or ran.
PROJECTION_NAME VARCHAR The name of the projection that is targeted for refresh.
TABLE_NAME VARCHAR The name of the projection's anchor table.

STATUS VARCHAR The status of the projection:


queued--Indicates that a projection is queued for refresh.
refreshing--Indicates that a refresh for a projection is in
process.
refreshed--Indicates that a refresh for a projection has
successfully completed.
failed--Indicates that a refresh for a projection did not
successfully complete.
PHASE VARCHAR Indicates how far the refresh has progressed:
historical--Indicates that the refresh has reached the first
phase and is refreshing data from historical data. This refresh
phase requires the most amount of time.
current--Indicates that the refresh has reached the final phase
and is attempting to refresh data from the current epoch. To
complete this phase, refresh must be able to obtain a lock on
the table. If the table is locked by some other transaction,
refresh is put on hold until that transaction completes.
The LOCKs (page 441) system table is useful for
determining if a refresh has been blocked on a table lock.
To determine if a refresh has been blocked, locate the
term "refresh" in the transaction description. A refresh has
been blocked when the scope for the refresh is
REQUESTED and one or more other transactions have
acquired a lock on the table.
Note: This field is null until the projection starts to refresh.
METHOD VARCHAR The method used to refresh the projection:
buddy--Uses the contents of a buddy to refresh the projection.
This method maintains historical data. This enables the
projection to be used for historical queries.
scratch--Refreshes the projection without using a buddy. This
method does not generate historical data. This means that the
projection cannot participate in historical queries from any
point before the projection was refreshed.

-483-
SQL Reference Manual

FAILURE_COUNT INTEGER The number of times a refresh failed for the projection.
FAILURE_COUNT does not indicate whether or not the
projection was eventually refreshed successfully. See
STATUS to determine how the refresh operation is
progressing.
SESSION_ID VARCHAR A unique ID that identifies the refresh session.

START_TIME STRING The time the projection refresh started (provided as a time
stamp).
DURATION INTEGER The length of time that the projection refresh ran in seconds.

VT_PROJECTION_STORAGE
Note: This table is deprecated. Use PROJECTION_STORAGE (page 448) instead.
Monitors the amount of disk storage used by each projection on each node.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a format that can
be used as a Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
PROJECTION_NAME VARCHAR The name of the projection.
NUM_COLUMNS INTEGER The number of columns in the projection.
NUM_ROWS INTEGER The number of rows in the projection.
NUM_BYTES INTEGER The number of bytes of disk storage used by the projection.
WOS_ROWS INTEGER The number of WOS rows in the projection.
WOS_BYTES INTEGER The number of WOS bytes in the projection.
ROS_ROWS INTEGER The number of ROS rows in the projection.
ROS_BYTES INTEGER The number of ROS bytes in the projection.
NUM_ROS INTEGER The number of ROS containers in the projection.
TABLE_NAME VARCHAR The associated table name.

VT_QUERY_METRICS
Note: This table is deprecated. Use QUERY_METRICS (page 449) instead.
Monitors the sessions and queries executing on each node.

-484-
SQL System Tables (Monitoring APIs)

Column Name Data Type Description


TIMESTAMP VARCHA The Linux system time of query execution in a
R format that can be used as a Date/Time
Expression (page 76).
NODE_NAME VARCHA The name of the node that is reporting the
R requested information.
ACTIVE_USER_SESSIONS INTEGER The number of active user sessions
(connections).
ACTIVE_SYS_SESSIONS INTEGER The number of active system sessions.
TOTAL_USER_SESSIONS INTEGER The total number of user sessions.
TOTAL_SYS_SESSIONS INTEGER The total number of system sessions.
TOTAL_ACTIVE_SESSIONS INTEGER The total number of active user and system
sessions.
TOTAL_SESSIONS INTEGER The total number of user and system sessions.
QUERIES_CURRENTLY_RUNNING INTEGER The number of queries currently running.
TOTAL_QUERIES_EXECUTED INTEGER Containing the total number of queries executed.

VT_QUERY_PROFILING
Note: This table is deprecated. Use QUERY_PROFILES (page 450) instead.
Provides information regarding executed queries. To obtain information about executed queries,
see Profiling Database Performance.

Column Name Data Type Description


NODE_NAME VARCHA The name of the node that is reporting the requested information.
R
SESSIONID VARCHA The identification of the session for which profiling information is
R captured. This identifier is unique within the cluster at any point in
time but can be reused when the session closes.
TXNID INTEGER An identifier for the transaction within the session if any;
otherwise NULL.
STMTID INTEGER An ID for the currently executing statement. NULL indicates that
no statement is currently being processed.
QUERY VARCHA The query string used for the query.
R
QRYSEARCHPATH VARCHA A list of schemas in which to look for tables.
R
PROJECTIONS VARCHA The projections used in the query.
R

-485-
SQL Reference Manual

DURATION INTEGER The duration of the query in microseconds.


TIMESTAMP VARCHA The Linux system time of query execution in a format that can be
R used as a Date/Time Expression.

VT_RESOURCE_REJECTIONS
Note: This table is deprecated. Use RESOURCE_REJECTIONS (page 452) instead.
Monitors requests for resources that are rejected by the resource manager.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a format
that can be used as a Date/Time Expression (page
76).
NODE_NAME VARCHAR The name of the node that is reporting the requested
information.
ACCUMULATION_START TIMESTAMP The time of first rejection for this requester type.
REQUESTER VARCHAR The requester type.
REJECTED INTEGER The total number of rejections for this requester type.
TIMEDOUT INTEGER The total number of timeouts for this requester type.
CANCELED INTEGER The total number of cancellations for this requester
type.
LAST_RQT_RJTD VARCHAR The last resource type rejected for this plan type.
LAST_REASON VARCHAR The reason for the last rejection of this plan type.
THREAD_RQTS INTEGER The total number of thread type rejections.
FILEHANDLE_RQTS INTEGER The total number of file handle type rejections.
MEM_RQTS INTEGER The total number of memory type rejections.
ADDR_SPACE_RQTS INTEGER The total number of address space type rejections.

Requester Types
• Plans (see Plan Types)
• WOS

Plan Types
• Load Query
• Load Query Direct

-486-
SQL System Tables (Monitoring APIs)

• Insert Query
• Insert Query Direct
• Delete Query
• Select Query
• TM_MOVEOUT
• TM_MERGEOUT
• TM_ANALYZE,
• TM_DIRECTLOAD
• TM_REDELETE_MOVE
• TM_REDELETE_MERGE
• RECOVER
• ROS_SPLIT
• TM_DVWOS_MOVEOUT
• REDELETE_RECOVER
• REFRESH_HISTORICAL
• REFRESH_CURRENT
• ROS_SPLIT_REDELETE_1
• ROS_SPLIT_REDELETE_2

Resource Types
• Number of running plans
• Number of running plans on initiator node (local)
• Number of requested Threads
• Number of requested File Handles
• Number of requested KB of Memory
• Number of requested KB of Address Space

Reasons for Rejection


• Usage of single request exceeds high limit
• Timed out waiting for resource reservation
• Canceled waiting for resource reservation

VT_RESOURCE_USAGE
Note: This table is deprecated. Use RESOURCE_USAGE (page 454) instead.
Monitors system resource management on each node.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a format that can
be used as a Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the requested

-487-
SQL Reference Manual

information.
REQUESTS INTEGER The cumulative number of requests for threads, file handles,
and memory (in kilobytes).
LOCAL_REQUESTS INTEGER The cumulative number of local requests.
REQ_QUE_DEPTH INTEGER The current request queue depth.
ACTIVE_THREADS INTEGER The current number of active threads.
OPEN_FILE_HANDLES INTEGER The current number of open file handles.
KB_MEM_RQTD INTEGER The memory requested in kilobytes.
KB_ADDR_SPACE_RQTD INTEGER The address space requested in kilobytes.
WOS_BYTES INTEGER The size of the WOS in bytes.
WOS_ROWS INTEGER The number of rows in the WOS.
ROS_BYTES INTEGER The size of the ROS in bytes.
ROS_ROWS INTEGER The number of rows in the ROS.
BYTES INTEGER The total size of storage (WOS + ROS) in bytes.
ROWS INTEGER The total number of rows in storage (WOS + ROS).
RSC_RQT_REJECTED INTEGER The number of rejected plan requests.
RSC_RQT_TIMEOUTS INTEGER The number of resource request timeouts.
RSC_RQT_CANCELLED INTEGER The number of resource request cancellations.
DISK_SPACE_RQT_RJTD INTEGER The number of rejected disk write requests.
FAILED_VOL_RJTD INTEGER The number of rejections due to a failed volume.
TOKENS_USED INTEGER For internal use only.
TOKENS_AVAILABLE INTEGER For internal use only.

VT_SCHEMA
Provides information about the database schema.

Column Name Data Type Description


SCHEMAID INTEGER The schema ID from the catalog.
SCHEMANAME VARCHAR The name of the schema.
OWNERID INTEGER The owner ID from the catalog.
OWNERNAME VARCHAR The name of the user who created the schema.

Example
SELECT * FROM vt_schema;
schemaid | schemaname | ownerid | ownername

-488-
SQL System Tables (Monitoring APIs)

-------------------+--------------+-------------------+-----------
45035996273704963 | public | 45035996273704961 | release
45035996273704977 | store | 45035996273704961 | release
45035996273704978 | online_sales | 45035996273704961 | release
(3 rows)

VT_SESSION
Note: This table is deprecated. Use SESSIONS (page 458) instead.
Monitors external sessions. You can use this table to:
• Identify users who are running long queries
• Identify users who are holding locks due to an idle but uncommitted transaction
• Disconnect users in order to shut down the database
• Determine the details behind the type of database security (Secure Socket Layer (SSL) or
client authentication) used for a particular session.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a format that can be
used as a Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the requested information.
USERNAME VARCHAR The name used to log into the database or NULL if the session is
internal
CLIENT VARCHAR The host name and port of the TCP socket from which the client
connection was made; NULL if the session is internal
LOGIN_TIME DATE The date and time the user logged into the database or when the
internal session was created. This can be useful for identifying
sessions that have been left open for a period of time and could be
idle.
SESSIONID VARCHAR The identifier required to close or interrupt a session. This identifier
is unique within the cluster at any point in time but can be reused
when the session closes.
TXN_START DATE The date/time the current transaction started or NULL if no
transaction is running.
TXNID VARCHAR A string containing the hexadecimal representation of the
transaction ID, if any; otherwise NULL.
TXN_DESCRIPTION VARCHAR A description of the current transaction
STMT_START DATE The date/time the current statement started execution, or NULL if
no statement is running.
STMTID VARCHAR An ID for the currently executing statement. NULL indicates that
no statement is currently being processed.
LAST_STMT_DURATI INTEGER The duration of the last completed statement in milliseconds.
ON

-489-
SQL Reference Manual

CURRENT_STMT VARCHAR The currently executing statement, if any. NULL otherwise.


LAST_STATEMENT VARCHAR NULL if the user has just logged in; otherwise the currently running
statement or the most recently completed statement.
SSL_STATE VARCHAR Indicates if Vertica used Secure Socket Layer (SSL) for a particular
session. Possible values are:
None – Vertica did not use SSL.
Server – Sever authentication was used, so the client could
authenticate the server.
Mutual – Both the server and the client authenticated one another
through mutual authentication.
See Vertica Security and Implementing SSL.
AUTHENTICATION_M VARCHAR The type of client authentication used for a particular session, if
ETHOD known. Possible values are:
Unknown
Trust
Reject
Kerberos
Password
MD5
LDAP
Kerberos-GSS
See Vertica Security and Implementing Client Authentication.

Notes
• The superuser has unrestricted access to all session information, but users can only view
information about their own, current sessions.
• During session initialization and termination, you might see sessions running only on nodes
other than the node on which you executed the virtual table query. This is a temporary
situation and corrects itself as soon as session initialization and termination completes.

VT_SESSION_PROFILING
Note: This table is deprecated. Use SESSION_PROFILES (page 457) instead.
Provides basic session parameters and lock time out data. To obtain information about sessions,
see Profiling Database Performance.

Column Name Data Type Description


TIMESTAMP VARCHA The Linux system time of query execution in a format that can be
R used as a Date/Time Expression.
NODE_NAME VARCHA The name of the node that is reporting the requested information.
R

-490-
SQL System Tables (Monitoring APIs)

USERNAME VARCHA The name used to log into the database or NULL if the session is
R internal.
CLIENT VARCHA The host name and port of the TCP socket from which the client
R connection was made; NULL if the session is internal.
LOGIN_TIME VARCHA The date and time the user logged into the database or when the
R internal session was created. This can be useful for identifying
sessions that have been left open for a period of time and could
be
idle.
LOGOUT_TIME VARCHA The date and time the user logged out of the database or when
R the
internal session was closed.
SESSIONID VARCHA The identification of the session for which profiling information is
R captured. This identifier is unique within the cluster at any point in
time but can be reused when the session closes.
NUM_STMTSUCESSFULLYEX INTEGE The number of successfully executed statements.
ECUTED R
NUM_STMTUNSUCCESSFULL INTEGE The number of unsuccessfully executed statements.
YEXECUTED R
NUM_LOCKGRANTS INTEGE The number of locks granted during the session.
R
NUM_DEADLOCKS INTEGE The number of deadlocks encountered during the session.
R
NUM_LOCKTIMEOUTS INTEGE The number of times a lock timed out during the session.
R
NUM_LOCKCANCELED INTEGE The number of times a lock was cancelled during the session.
R
NUM_LOCKREJECTIONS INTEGE The number of times a lock was rejected during a session.
R
NUM_LOCKERRORS INTEGE The number of lock errors encountered during the session.
R

VT_SYSTEM
Note: This table is deprecated. Use SYSTEM (page 461) instead.
Monitors the overall state of the database.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a format that can
be used as a Date/Time Expression (page 76).

-491-
SQL Reference Manual

CURRENT_EPOCH INTEGER The current epoch number.


AHM_EPOCH INTEGER The ahm epoch number.
LAST_GOOD_EPOCH INTEGER The smallest (min) of all the checkpoint epochs on the cluster.
REFRESH_EPOCH INTEGER The oldest of the refresh epochs of all the nodes in the cluster
DESIGNED_FAULT_TOLER INTEGER The designed or intended K-Safety level.
ANCE
NUM_NODES INTEGER The number of nodes in the cluster.
NUM_NODES_DOWN INTEGER The number of nodes in the cluster that are currently down.
CURRENT_FAULT_TOLERA INTEGER The number of node failures the cluster can tolerate before it
NCE shuts down automatically.
CATALOG_REV_NUM INTEGER The catalog version number.
WOS_BYTES INTEGER The WOS size in bytes (cluster-wide).
WOS_ROWS INTEGER The number of rows in WOS (cluster-wide).
ROS_BYTES INTEGER The ROS size in bytes (cluster-wide).
ROS_ROWS INTEGER The number of rows in ROS (cluster-wide).
BYTES INTEGER The total storage in bytes (WOS + ROS) (cluster-wide).
ROWS INTEGER The total number of rows (WOS + ROS) (cluster-wide).

VT_TABLE
Note: This table is deprecated. Use TABLES (page 419) instead.
Provides information about all tables in the database.

Column Name Data Type Description


SCHEMAID INTEGER The schema ID from the catalog.
SCHEMANAME VARCHAR The name of the schema.
TABLEID INTEGER The table ID from the catalog.
TABLENAME VARCHAR The name of the table.
OWNERID INTEGER The owner ID from the catalog.
OWNERNAME VARCHAR The name of the user who created the table.
SYSTABLE BOOLEAN Is 'f' for user-created tables, 't' for Vertica system tables"

Example
The following command returns information on all tables in the vmart schema:

-492-
SQL System Tables (Monitoring APIs)

SELECT * FROM VT_TABLE;

VT_TABLE_STORAGE
Note: This table is deprecated.
Monitors the amount of disk storage used by each table on each node.

Column Name Data Type Description


TIMESTAMP VARCHA The Linux system time of query execution in a format that can be
R used as a Date/Time Expression (page 76).
NODE_NAME VARCHA The name of the node that is reporting the requested information.
R
TABLE_NAME VARCHA The table name.
R
NUM_PROJECTIONS INTEGER The number of projections using columns of the table.
NUM_COLUMNS INTEGER The number of columns in the table.
NUM_ROWS INTEGER The number of rows in the table (cardinality).
NUM_BYTES INTEGER The number of bytes used to store the projections.

VT_TUPLE_MOVER
Note: This table is deprecated. Use TUPLE_MOVER_OPERATIONS (page 462) instead.
Monitors the status of the Tuple Mover on each node.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a format that can be used
as a Date/Time Expression (page 76).
NODE_NAME VARCHAR The name of the node that is reporting the requested information.
OPERATION VARCHAR One of the following operations:
Moveout
Mergeout

-493-
SQL Reference Manual

Analyze Statistics
STATUS VARCHAR Running or an empty string to indicate 'not running.'
PROJ VARCHAR The name of the projection being processed.
START_EPOCH INTEGER The first epoch of the mergeout operation. (Not applicable for other
operations.)
END_EPOCH INTEGER The last epoch of the mergeout operation (not applicable for other
operations).
NUM_MINI_ROS INTEGER The number of ROS containers.
SIZE INTEGER The size in bytes of all ROS containers in the mergeout operation. (Not
applicable for other operations.)
PLAN VARCHAR One of the following values:
Moveout
Mergeout
Analyze
Replay Delete

Notes
No output from VT_TUPLE_MOVER means that the Tuple Mover is not performing an operation.
See TUPLE_MOVER_OPERATIONS (page 462) for active example.

VT_VIEW
Note: This table is deprecated. Use VIEWS (page 422) instead.
Provides information about all views within the system. See Using Views for more information.

Column Name Data Type Description


SCHEMANAME VARCHAR The name of the schema that contains the view.
VIEWNAME VARCHAR The name of the view.
OWNERNAME VARCHAR The name of the user who created the view.
DEFINITION VARCHAR The query used to define the view.

VT_WOS_STORAGE
Note: This table is deprecated. Use WOS_CONTAINER_STORAGE (page 463) instead.
Monitors information about WOS storage, which is divided into regions. Each region allocates
blocks of a specific size to store rows.

Column Name Data Type Description


TIMESTAMP VARCHAR The Linux system time of query execution in a format that can

-494-
SQL System Tables (Monitoring APIs)

be used as a DATE/TIME expression.


NODE_NAME VARCHAR The node where the WOS data is stored.
TYPE VARCHAR Either system or user data.
ALLOCATOR_REGION VARCHAR The block size allocated by region in KB. The summary line
sums the amount of memory used by all regions.
VIRTUAL_SIZE INTEGER The amount of virtual memory in use by region in KB. Virtual
size is greater than or equal to allocated size, which is greater
than or equal to in-use size.
ALLOCATED_SIZE INTEGER The amount of physical memory in use by a particular region in
KB.
IN_USE_SIZE INTEGER The actual number of bytes of data stored by the region in KB.

Notes
• The WOS allocator can use large amounts of virtual memory without assigning physical
memory.
• To see the difference between virtual size and allocated size, look at the IN_USE_SIZE
column to see if the WOS is full. The summary line tells you the amount of memory used by
the WOS, which is typically capped at one quarter of physical memory per node.

Examples
SELECT node_name, type, allocator_region, virtual_size, allocated_size, in_use_size
FROM MONITORING.VT_WOS_STORAGE;
node_name | type | allocator_region | virtual_size | allocated_size | in_use_size
-----------+--------+------------------+--------------+----------------+-------------
site01 | user | Summary | 0 | 0 | 0
site01 | system | 16 KB Region | 102400 | 16 | 0
site01 | system | Summary | 102400 | 16 | 0
site02 | user | Summary | 0 | 0 | 0
site02 | system | Summary | 0 | 0 | 0
site03 | user | Summary | 0 | 0 | 0
site03 | system | Summary | 0 | 0 | 0
site04 | user | Summary | 0 | 0 | 0
site04 | system | Summary | 0 | 0 | 0
(9 rows)

-495-
C

Index CANCEL_DEPLOYMENT • 252, 259, 265, 302


CANCEL_REFRESH • 252
CASE Expressions • 65, 72, 73, 194, 195, 197,
A 198, 204, 388, 398
About the Documentation • 35 CAST • 51, 54, 56, 66, 71, 89, 114, 115, 147,
ABS • 177 153, 167, 170, 220, 234
ACOS • 177 CBRT • 179
ACTIVE_EVENTS • 418, 433, 478 CEILING (CEIL) • 179
ADD_DESIGN_TABLES • 242, 254 CHAR • 94
ADD_LOCATION • 243, 245, 291, 299, 300, Character Data Types • 66, 89, 94, 353, 356
313 CHARACTER_LENGTH • 200, 202, 214, 217
ADD_MONTHS • 141 CHR • 199, 203
ADVANCE_EPOCH • 244, 269, 271 CLEAR_DESIGN_SEGMENTATION_TABLE
AGE • 142 • 253, 311
Aggregate Expressions • 72, 112, 388 CLEAR_DESIGN_TABLES • 243, 253, 297
Aggregate Functions • 72, 112, 125, 129, 394 CLEAR_QUERY_REPOSITORY • 254
ALTER PROJECTION • 244, 271, 281, 282, CLIENT_ENCODING • 204
283, 318 CLOCK_TIMESTAMP • 143, 154, 161, 166
ALTER SCHEMA • 318, 349 CLOSE_ALL_SESSIONS • 254
ALTER TABLE • 250, 320, 324, 354, 357, 359 CLOSE_SESSION • 256
ALTER USER • 325 COALESCE • 193, 194, 195, 196, 197, 198
ALTER_LOCATION_USE • 244 Column length limits
Analytic Functions • 125, 389 fixed length • 49
ANALYZE_CONSTRAINTS • 245, 267, 296, variable length • 49
335, 340, 427 Column References • 74, 83, 112, 116, 120, 121,
ANALYZE_STATISTICS • 251, 278, 287, 296 122, 123, 124, 125, 182, 185, 346, 347, 348,
AND operator • 65 388, 394
ASCII • 199, 203 COLUMN_STORAGE • 269, 418, 435, 479
ASIN • 178 column-constraint • 320, 353, 354
ATAN • 178 column-definition • 320, 351, 353
ATAN2 • 178 COLUMNS • 418, 419
AVG • 112 column-value-predicate • 81, 392
Comments • 75
B COMMIT • 46, 326, 335
BETWEEN-predicate • 79, 392 Comparison Operators • 65, 81
Binary Data Types • 66, 89, 353, 356 Compound key • 323
Binary Operators • 62, 90, 91, 92 Concurrent connections
BIT_AND • 91, 92, 113 per cluster • 49
BIT_LENGTH • 200, 203, 214, 217 per node • 49
BIT_OR • 91, 92, 114 CONFIGURE_DEPLOYMENT • 258
BIT_XOR • 91, 92, 115 Constants • 55, 346, 348, 388
BITCOUNT • 92, 201, 228 COPY • 46, 92, 105, 106, 109, 239, 250, 327,
BITSTRING_TO_BINARY • 90, 92, 201, 228 377, 380
Boolean Data Type • 65, 66, 80, 93, 353, 356 COS • 180
Boolean Operators • 65, 80, 93, 392 COT • 180
Boolean-predicate • 65, 78, 80, 93, 392 COUNT • 116
BTRIM • 202, 216, 223, 230 COUNT(*) • 119
CREATE PROJECTION • 341, 365

-497-
SQL Reference Manual

CREATE SCHEMA • 319, 348 DISK_RESOURCE_REJECTIONS • 418, 440,


CREATE TABLE • 250, 271, 349, 351, 359, 482
379, 404 DISK_STORAGE • 418, 441, 483
CREATE TEMPORARY TABLE • 352, 356 DISPLAY_LICENSE • 267
CREATE USER • 360 DO_TM_TASK • 268, 269, 271, 274, 275, 293,
CREATE VIEW • 361 294, 318
CREATE_DESIGN • 259, 265, 301 DOUBLE PRECISION • 56
CREATE_DESIGN_CONFIGURATION • 260 DOUBLE PRECISION (FLOAT) • 69, 105, 108,
CREATE_DESIGN_CONTEXT • 260 153, 177, 186
CREATE_DESIGN_QUERIES_TABLE • 261, DROP PROJECTION • 288, 352, 365, 368
287, 298, 309 DROP SCHEMA • 319, 349, 365
CREATE_PROJECTION_DESIGN • 262 DROP TABLE • 352, 356, 359, 364, 365, 367,
CURRENT_DATABASE • 237 411
CURRENT_DATE • 78, 142, 144 DROP USER • 369
CURRENT_SCHEMA • 237 DROP VIEW • 361, 362, 369, 477
CURRENT_SESSION • 418, 437, 479 DROP_LOCATION • 269, 299
CURRENT_TIME • 78, 144 DROP_PARTITION • 269, 270, 274, 275, 293,
CURRENT_TIMESTAMP • 78, 144, 159, 165 294, 352
CURRENT_USER • 238, 239, 240 DUMP_CATALOG • 273
DUMP_LOCKTABLE • 274, 452, 488
D DUMP_PARTITION_KEYS • 269, 271, 274,
Data Type Coercion Operators (CAST) • 56, 66, 275, 293, 294, 352
89, 91, 92 DUMP_PROJECTION_PARTITION_KEYS •
Database size • 49 269, 272, 274, 275, 293, 294, 352
DATE • 59, 96, 148, 149 DUMP_TABLE_PARTITION_KEYS • 269,
Date/Time Constants • 58, 403 272, 274, 275, 293, 294, 353
Date/Time Data Types • 66, 96, 145, 353, 356
E
Date/Time Expressions • 58, 76, 153, 154, 155,
161, 433, 435, 437, 440, 441, 445, 447, 449, encoding-type • 341, 344
453, 457, 458, 462, 463, 466, 467, 470, 471, EVENT_CONFIGURATIONS • 418, 444
472, 482, 483, 494, 495, 496, 497, 499, 502, EXECUTION_ENGINE_PROFILES • 418, 440,
503, 504 445, 483
Date/Time Functions • 78, 140 EXP • 181
Date/Time Operators • 68 EXPLAIN • 371
DATE_PART • 145 EXPORT_CATALOG • 275
DATE_TRUNC • 147 EXPORT_DESIGN_CONFIGURATION • 276,
DATEDIFF • 148 277
DATESTYLE • 96, 279, 306, 401, 403 EXPORT_DESIGN_TABLES • 276, 277
Day of the Week Names • 59 EXPORT_STATISTICS • 278, 296
DECODE • 204 Expressions • 71, 87
DEGREES • 181 EXTRACT • 145, 153, 167
DELETE • 45, 358, 359, 363, 368, 377, 410,
411, 415
F
DEPLOY_DESIGN • 252, 259, 264, 300, 302, FALSE • 65
316 FIRST_VALUE / LAST_VALUE • 125, 126,
Deprecated System Tables • 477 128, 164, 165
Depth of nesting subqueries • 49 FLOOR • 181
DISABLE_DUPLICATE_KEY_ERROR • 247, FOREIGN_KEYS • 418, 420
250, 265, 296 Formatting Functions • 167

-498-
Index

FROM Clause • 81, 82, 83, 86, 119, 388, 390,


415
J
joined-table • 390, 391
G
join-predicate • 83, 341, 391, 392
GET_AHM_EPOCH • 278
GET_AHM_TIME • 278
K
GET_CURRENT_EPOCH • 279 Key size • 49
GET_DESIGN_SCRIPT • 263, 264, 279 Keywords • 51, 218
GET_LAST_GOOD_EPOCH • 280 Keywords and Reserved Words • 51
GET_NUM_ACCEPTED_ROWS • 280
GET_NUM_REJECTED_ROWS • 280 L
GET_PROJECTION_STATUS • 281, 283, 343, LAST_DAY • 155
365 LCOPY • 327, 380
GET_PROJECTIONS, LEAD / LAG • 126, 131
GET_TABLE_PROJECTIONS • 282, 343, LEAST • 207, 211
365 LEFT • 213
GETDATE • 154, 161 Length
GETUTCDATE • 154 Basic names • 49
GRANT (Schema) • 376, 382 Fixed-length column • 49
GRANT (Table) • 376, 377, 383 Length for a variable-length column • 49
GRANT (View) • 362, 376, 378 LENGTH • 92, 200, 203, 213, 217
GRANTS • 418, 421, 484 Length for a fixed-length column • 49
GREATEST • 205, 213 LIKE-predicate • 84, 392
GROUP BY Clause • 78, 94, 112, 388, 389, 394 LIMIT Clause • 388, 400, 401
Limits
H
Basic names • 49
HAS_TABLE_PRIVILEGE • 238 Columns per table • 49
HASH • 182, 185, 346, 347 Concurrent connections per cluster • 49
hash-segmentation-clause • 341, 342, 346 Connections per node, number • 49
HAVING Clause • 112, 388, 389, 397 Database size • 49
HEX_TO_BINARY • 90, 91, 92, 207 Depth of nesting subqueries • 49
HOST_RESOURCES • 418, 446, 489 Fixed-length column • 49
Key size • 49
I
Projections per database • 49
Identifiers • 55, 218 Row size • 49
IMPLEMENT_TEMP_DESIGN • 283, 359 Rows per load • 49
INET_ATON • 92, 207, 209, 235 Table size • 49
INET_NTOA • 92, 208 Tables per database • 49
INITCAP • 209 Variable-length column • 49
IN-predicate • 82, 392 LN • 183
INSERT • 379 LOAD_DATA_STATISTICS • 286, 296
INSTR • 210 LOAD_DESIGN_QUERIES • 261, 287, 309
INTEGER • 56, 107, 108 LOAD_STREAMS • 334, 419, 449, 486
INTERRUPT_STATEMENT • 253, 283 LOCAL_NODES • 419, 450
INTERVAL • 96, 101, 102, 149 LOCALTIME • 78, 156
Interval Values • 61, 103, 406 LOCALTIMESTAMP • 78, 157
ISFINITE • 155 LOCKS • 419, 450, 456, 467, 487, 493
ISNULL • 194, 197 LOG • 183
ISO 8601 • 58 LOWER • 214

-499-
SQL Reference Manual

LPAD • 215 OVERLAY • 217


LTRIM • 202, 215, 223, 230
P
M
Pacific Standard Time • 58
MAKE_AHM_NOW • 288, 304, 306 PARTITION_PROJECTION • 269, 272, 274,
MARK_DESIGN_KSAFE • 288, 289, 313, 314, 275, 292, 293, 294, 353
365 PARTITION_TABLE • 272, 274, 275, 292, 293,
Mathematical Functions • 110, 177 353
Mathematical Operators • 68, 388 PARTITIONS • 419, 454, 490
MAX • 91, 92, 120 PI • 185
MD5 • 216 POSITION • 218, 225
MEASURE_LOCATION_PERFORMANCE • POWER • 185
290, 312, 313 Predicates • 79
MERGE_PARTITIONS • 272, 291 Preface • 43
MIN • 91, 92, 120 PRIMARY_KEYS • 418, 423
MOD • 184 Printing Full Books • 36
MODULARHASH • 182, 183, 184, 346, 347 PROJECTION_REFRESHES • 313, 314, 419,
Month Names • 60 453, 455, 492
MONTHS_BETWEEN • 157 PROJECTION_STORAGE • 269, 271, 419, 457,
Multi-column key • 323 494
PROJECTIONS • 313, 314, 418, 419, 424, 491
N Projections per database • 49
NaN • 56 PST • 58
NODE_RESOURCES • 348, 419, 453, 489 PURGE • 294, 303
NOW • 77, 159 PURGE_PROJECTION • 294
NULL • 65, 78 PURGE_TABLE • 294, 295
NULL Operators • 70
Q
NULL Value • 78, 86, 87
NULL-handling Functions • 193 QUERY_METRICS • 419, 458, 494
NULLIF • 195 QUERY_PROFILES • 419, 440, 459, 495
NULL-predicate • 80, 81, 86, 392 QUOTE_IDENT • 218
Number of columns per table • 49 QUOTE_LITERAL • 219
Number of connections per node • 49 Quoted identifiers • 55
Number of rows per load • 49
NUMERIC • 107
R
Numeric Constants • 56, 61 RADIANS • 186
Numeric Data Types • 66, 103, 353, 356 RANDOM • 186
Numeric Expressions • 56, 78 RANDOMINT • 187
NVL • 194, 195, 196 range-segmentation-clause • 341, 342, 347
NVL2 • 197 RANK / DENSE_RANK • 126, 135
READ_DATA_STATISTICS • 278, 287, 295
O
Reading the Online Documentation • 35
OCTET_LENGTH • 200, 203, 214, 216 REENABLE_DUPLICATE_KEY_ERROR •
OFFSET Clause • 388, 400, 401 247, 265, 267, 296
Operators • 62 RELEASE SAVEPOINT • 47, 380, 386, 387
OR operator • 65 REMOVE_DEPLOYMENT_ENTRY • 297
ORDER BY Clause • 112, 388, 389, 398, 400, REMOVE_DESIGN • 254, 260, 296, 297
401 REMOVE_DESIGN_CONTEXT • 254, 261,
OVERLAPS • 159 297

-500-
Index

REPEAT • 92, 220 SET_DESIGN_QUERY_CLUSTER_LEVEL •


REPLACE • 221 309
Reserved Words • 54 SET_DESIGN_SEGMENTATION_COLUMN •
RESET_DESIGN_QUERIES_TABLE • 261, 253, 310
298, 309 SET_DESIGN_SEGMENTATION_TABLE •
RESOURCE_REJECTIONS • 419, 461, 496 242, 243, 311
RESOURCE_USAGE • 419, 463, 497 SET_DESIGN_TABLE_ROWS • 242, 243, 311
RESTORE_LOCATION • 298, 299, 300 SET_LOCATION_PERFORMANCE • 312
RETIRE_LOCATION • 243, 245, 270, 291, 299 SHOW • 403, 406, 409
REVERT_DEPLOYMENT • 252, 259, 265, 300, SHOW SEARCH_PATH • 409
302 SIGN • 189
REVOKE (Schema) • 382 SIN • 189
REVOKE (Table) • 383 SPLIT_PART • 223
REVOKE (View) • 362, 383 SQL Data Types • 89, 340
RIGHT • 222 SQL Functions • 111, 388
ROLLBACK • 46, 47, 335, 385 SQL Language Elements • 51
ROLLBACK TO SAVEPOINT • 47, 381, 385, SQL Overview • 45
387 SQL Statements • 317
ROUND • 187 SQL System Tables (Monitoring APIs) • 290,
Row size • 49 417, 477
ROW_NUMBER • 126, 138 SQRT • 189
RPAD • 222 START_REFRESH • 253, 313, 343
RTRIM • 202, 216, 223, 230 STATEMENT_TIMESTAMP • 143, 160, 166
RUN_DEPLOYMENT • 252, 259, 265, 300, Statistical analysis • 121, 122, 124, 125
301, 316 STDDEV • 121
STDDEV_POP • 121
S STDDEV_SAMP • 121, 122
SAVE_DESIGN_VERSION • 302 STORAGE_CONTAINERS • 419, 469
SAVE_QUERY_REPOSITORY • 302 String Concatenation Operators • 70, 388
SAVEPOINT • 47, 381, 386 String Constants (Dollar-Quoted) • 57
Search Conditions • 86 String Constants (Standard) • 57, 239
SEARCH_PATH • 349, 404 String Functions • 199
SELECT • 112, 361, 362, 363, 379, 388, 414 STRPOS • 218, 225
SELECT CURRENT_SCHEMA • 302 SUBSTR • 213, 222, 225, 227
SESSION CHARACTERISTICS • 46, 379, 401, SUBSTRING • 92, 226
405 Suggested Reading Paths • 35, 36
SESSION_PROFILES • 419, 440, 453, 466, 500 SUM • 107, 122
SESSION_USER • 238, 239 SUM_FLOAT • 107, 123
SESSIONS • 254, 256, 258, 419, 467, 499 SYNC_CURRENT_DESIGN • 314, 316
SET • 401 SYSDATE • 154, 161
SET_AHM_EPOCH • 288, 303, 306 SYSTEM • 269, 271, 290, 304, 419, 470, 501
SET_AHM_TIME • 288, 303, 304, 305 System Information Functions • 237
SET_DESIGN_KSAFETY • 306 System Limits • 49
SET_DESIGN_LOG_FILE • 307 SYSTEM_TABLES • 418, 431
SET_DESIGN_LOG_LEVEL • 307
T
SET_DESIGN_PARAMETER • 308
SET_DESIGN_QUERIES_TABLE • 261, 287, Table size • 49
298, 308 TABLE_CONSTRAINTS • 418, 426
table-constraint • 320, 321, 323

-501-
SQL Reference Manual

table-primary • 390, 391 USERS • 418, 429


table-reference • 390 UTC • 58
TABLES • 418, 427, 502
Tables per database • 49
V
TAN • 190 V_CATALOG Schema • 417, 419
Technical Support • 33, 35, 273, 274, 371, 451, V_MONITOR Schema • 417, 432
488 V6_ATON • 92, 231, 233
TEMP_DESIGN_SCRIPT • 314 V6_NTOA • 92, 232
Template Pattern Modifiers for Date/Time V6_SUBNETA • 92, 233, 234
Formatting • 170, 171, 172, 175, 177, 330 V6_SUBNETN • 92, 233, 234
Template Patterns for Date/Time Formatting • V6_TYPE • 92, 234
140, 167, 169, 170, 172, 330 VAR_POP • 124
Template Patterns for Numeric Formatting • 167, VAR_SAMP • 124, 125
169, 170, 172, 175 VARBINARY • 89
TIME • 59, 97, 149 VARIANCE • 125
TIME AT TIME ZONE • 97, 99, 100, 101 VERSION • 240
TIME ZONE • 401, 406 Vertica Functions • 45, 242
Time Zone Names for Setting TIME ZONE • VIEW_COLUMNS • 418, 430
406, 407 VIEWS • 418, 431, 504
Time Zone Values • 58 VT_ACTIVE_EVENTS • 478
TIME_SLICE • 131, 161 VT_COLUMN_STORAGE • 272, 479
TIMEOFDAY • 165 VT_CURRENT_SESSION • 479
TIMESTAMP • 99, 148, 149, 279, 305, 306 VT_DISK_RESOURCE_REJECTIONS • 482
TO_BITSTRING • 92, 227 VT_DISK_STORAGE • 483
TO_CHAR • 167 VT_EE_PROFILING • 482, 483
TO_DATE • 169 VT_GRANT • 484
TO_HEX • 91, 92, 207, 228 VT_LOAD_STREAMS • 280, 281, 486
TO_NUMBER • 171 VT_LOCK • 487
TO_TIMESTAMP • 170 VT_NODE_INFO • 489
TRANSACTION_TIMESTAMP • 143, 160, VT_PARTITIONS • 490
161, 165 VT_PROJECTION • 272, 491
TRANSLATE • 229 VT_PROJECTION_REFRESH • 253, 492
TRIM • 202, 216, 223, 229 VT_PROJECTION_STORAGE • 494
TRUE • 65 VT_QUERY_METRICS • 494
TRUNC • 147, 190 VT_QUERY_PROFILING • 482, 495
TRUNCATE TABLE • 358, 364, 368, 410 VT_RESOURCE_REJECTIONS • 496
TUPLE_MOVER_OPERATIONS • 419, 471, VT_RESOURCE_USAGE • 497
503, 504 VT_SCHEMA • 498
TYPES • 418, 428 VT_SESSION • 253, 499
Typographical Conventions • 40 VT_SESSION_PROFILING • 482, 500
VT_SYSTEM • 501
U
VT_TABLE • 502
UNION • 388, 411 VT_TABLE_STORAGE • 503
Unquoted identifiers • 55 VT_TUPLE_MOVER • 503
UPDATE • 45, 363, 377, 415 VT_VIEW • 504
UPDATE_DESIGN • 259, 260, 265, 301, 314, VT_WOS_STORAGE • 505
315
UPPER • 230
W
USER • 238, 239, 240 WAIT_DEPLOYMENT • 316

-502-
Index

WHERE Clause • 363, 377, 388, 389, 392, 397,


415
Where to Find Additional Information • 39
Where to Find the Vertica Documentation • 35
WIDTH_BUCKET • 191
WOS_CONTAINER_STORAGE • 419, 472,
505
Z
Zulu • 58

-503-

You might also like