CS 200: Introduction to Python¶

This notebook mirrors the google python course introduction.

How to read this document

Well, first you should read it. OK? But you should also launch a Python window and type in the expressions. Learning a programming language is like learning to play a musical instrument. You don't learn to play the piano by just watching Horowitz or Billy Joel. You have to put your own hands on the keyboard.

That's what you need to do now. Put your hands on the keyboard.

Python has Types but No Type Declarations¶

Unlike C, Python variables (and functions) do not have type declarations. That is, you do not need to specify that x is an integer or a string or an array. The type(object) tells you the type of an object.

In [1]:
a = 9
In [2]:
a
Out[2]:
9
In [3]:
type(a)
Out[3]:
int
In [4]:
a = 'a string'
In [5]:
type(a)
Out[5]:
str
In [6]:
type(3.3)
Out[6]:
float
In [7]:
type(float)
Out[7]:
type

Operator Overloading¶

Python supports operator overloading. That is, the same operator can mean different things depending on the types of its operands. This is a key concept in object oriented programming, which is well-supported by Python.

The + operator is overloaded.

int + int => addition

string + string => concatenation

int + string -> ERROR

In [8]:
a = 9 + 3
In [9]:
a
Out[9]:
12
In [10]:
a = "hello" + " world!"
In [11]:
a
Out[11]:
'hello world!'
In [1]:
3 + "hello"
---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
Input In [1], in <cell line: 1>()
----> 1 3 + "hello"

TypeError: unsupported operand type(s) for +: 'int' and 'str'

The * operator is overloaded as well.

int * int => multiplication

int * string => replication

string * int => replication

In [2]:
b = 3 * 4
In [3]:
b
Out[3]:
12
In [4]:
b = 'hello ' * 3
In [5]:
b
Out[5]:
'hello hello hello '
In [6]:
4 * "good-bye "
Out[6]:
'good-bye good-bye good-bye good-bye '

Python Filename extensions

Python source code typically has the filename extension: ".py"

Compiled byte code typically has the filename extension ".pyc"

When you give the command import filename, python looks for the compiled file filename.pyc, but if it is not found, python loads the source code file, filename.py, and saves the bytecode file in the directory \_\_pycache\_\_ in the current working directory.

Executing Python files from the command line¶

Following the standard UNIX convention, if the first two characters of a python file are "#!" ("shuh-bang") the remainder of the line is interpreted as the name of the program to be invoked in executing the remainder of the file. For example, if the first line is

#! /usr/bin/python

The python interpreter with that fully-qualified pathname will be executed on the current file. See collatz.py

Comment character¶

# is the comment character in python. Everything to the right of the hash is ignored, unless the hash appears inside a string.

In [7]:
# b = 'something else'
In [8]:
b
Out[8]:
'hello hello hello '
In [9]:
b = '#something else'
In [10]:
b
Out[10]:
'#something else'

dir(module)¶

dir(module) gives you the properties, methods, and other objects inside the given module.

dir() lists the objects in the top-level "main" environment.

In [11]:
dir()
Out[11]:
['In',
 'Out',
 '_',
 '_10',
 '_3',
 '_5',
 '_6',
 '_8',
 '__',
 '___',
 '__builtin__',
 '__builtins__',
 '__doc__',
 '__loader__',
 '__name__',
 '__package__',
 '__spec__',
 '_dh',
 '_i',
 '_i1',
 '_i10',
 '_i11',
 '_i2',
 '_i3',
 '_i4',
 '_i5',
 '_i6',
 '_i7',
 '_i8',
 '_i9',
 '_ih',
 '_ii',
 '_iii',
 '_oh',
 'b',
 'exit',
 'get_ipython',
 'quit',
 'register_readline_completion',
 'sys']
In [12]:
__name__
Out[12]:
'__main__'

__name__ is the variable which holds the name string for the given module. Here is an example with the collatz module.

In [13]:
import collatz
In [14]:
dir(collatz)
Out[14]:
['__builtins__',
 '__cached__',
 '__doc__',
 '__file__',
 '__loader__',
 '__name__',
 '__package__',
 '__spec__',
 'collatz',
 'collatzg',
 'cseries']
In [15]:
collatz.__name__
Out[15]:
'collatz'
In [16]:
collatz.__file__
Out[16]:
'/home/httpd/html/zoo/classes/cs200/lectures/collatz.py'

command line arguments¶

collatz.py included the following code:

if __name__ == '__main__':
    import sys
    print (sys.argv)
    if len(sys.argv) > 1:
        for n in sys.argv[1:]:
            cseries(int(n))

We import the sys module, which has the argv property which is a list of the command line arguments.

(Note that Jupyter notebooks do not support command line arguments.)

function definition¶

Use the def keyword to define functions in Python. Use indentation to specify blocks of code. You do not use parens or curly brackets.

In [17]:
def add1(n):
    return n + 1
In [18]:
add1(9)
Out[18]:
10
In [19]:
def bad1(n):
    return x+1

Note that there is an error in the definition. However, the error is not caught at compile time. When you run it though, the error throws a NameError exception.

In [20]:
bad1(3)
---------------------------------------------------------------------------
NameError                                 Traceback (most recent call last)
Input In [20], in <cell line: 1>()
----> 1 bad1(3)

Input In [19], in bad1(n)
      1 def bad1(n):
----> 2     return x+1

NameError: name 'x' is not defined
In [21]:
x = 3
In [22]:
bad1(4)
Out[22]:
4
In [23]:
x
Out[23]:
3

Syntax errors versus Runtime errors¶

Some errors are detected at compile time, namely, syntax errors. If you enter an illegal Python expression, Python will complain.

In [24]:
l l  = 8
  Input In [24]
    l l  = 8
      ^
SyntaxError: invalid syntax
In [25]:
l = 8 8 8
  Input In [25]
    l = 8 8 8
          ^
SyntaxError: invalid syntax
In [26]:
x =-=-- 9
  Input In [26]
    x =-=-- 9
       ^
SyntaxError: invalid syntax
In [27]:
def add1(n)::
        return n+1
  Input In [27]
    def add1(n)::
                ^
SyntaxError: invalid syntax

Python PEP documents and modules¶

Python is a crowdsourced language. There are lots of developers. When someone wants to modify the language, they create a "Python Enhancement Proposal" or PEP, which are given sequential positive integers. PEP8 (https://www.python.org/dev/peps/pep-0008/) is the Style Guide for Python Code. Check it out.

Developers not only modify the Python language, they also create boatloads of useful modules. The standard modules are in the library: (https://docs.python.org/3/library/)

Online help¶

There are lots of ways to get help for python.

google. e.g., search for python string lowercase

The python website: (https://docs.python.org/3/)

StackOverflow: (https://stackoverflow.com/questions/tagged/python)

as well as the help() and dir() functions inside python

In [28]:
help(len)
Help on built-in function len in module builtins:

len(obj, /)
    Return the number of items in a container.

In [29]:
len('hello')
Out[29]:
5
In [30]:
len(range(10))
Out[30]:
10
In [31]:
help(help)
Help on _Helper in module _sitebuiltins object:

class _Helper(builtins.object)
 |  Define the builtin 'help'.
 |  
 |  This is a wrapper around pydoc.help that provides a helpful message
 |  when 'help' is typed at the Python interactive prompt.
 |  
 |  Calling help() at the Python prompt starts an interactive help session.
 |  Calling help(thing) prints help for the python object 'thing'.
 |  
 |  Methods defined here:
 |  
 |  __call__(self, *args, **kwds)
 |      Call self as a function.
 |  
 |  __repr__(self)
 |      Return repr(self).
 |  
 |  ----------------------------------------------------------------------
 |  Data descriptors defined here:
 |  
 |  __dict__
 |      dictionary for instance variables (if defined)
 |  
 |  __weakref__
 |      list of weak references to the object (if defined)

In [32]:
help(sys)
Help on built-in module sys:

NAME
    sys

MODULE REFERENCE
    https://docs.python.org/3.10/library/sys.html
    
    The following documentation is automatically generated from the Python
    source files.  It may be incomplete, incorrect or include features that
    are considered implementation detail and may vary between Python
    implementations.  When in doubt, consult the module reference at the
    location listed above.

DESCRIPTION
    This module provides access to some objects used or maintained by the
    interpreter and to functions that interact strongly with the interpreter.
    
    Dynamic objects:
    
    argv -- command line arguments; argv[0] is the script pathname if known
    path -- module search path; path[0] is the script directory, else ''
    modules -- dictionary of loaded modules
    
    displayhook -- called to show results in an interactive session
    excepthook -- called to handle any uncaught exception other than SystemExit
      To customize printing in an interactive session or to install a custom
      top-level exception handler, assign other functions to replace these.
    
    stdin -- standard input file object; used by input()
    stdout -- standard output file object; used by print()
    stderr -- standard error object; used for error messages
      By assigning other file objects (or objects that behave like files)
      to these, it is possible to redirect all of the interpreter's I/O.
    
    last_type -- type of last uncaught exception
    last_value -- value of last uncaught exception
    last_traceback -- traceback of last uncaught exception
      These three are only available in an interactive session after a
      traceback has been printed.
    
    Static objects:
    
    builtin_module_names -- tuple of module names built into this interpreter
    copyright -- copyright notice pertaining to this interpreter
    exec_prefix -- prefix used to find the machine-specific Python library
    executable -- absolute path of the executable binary of the Python interpreter
    float_info -- a named tuple with information about the float implementation.
    float_repr_style -- string indicating the style of repr() output for floats
    hash_info -- a named tuple with information about the hash algorithm.
    hexversion -- version information encoded as a single integer
    implementation -- Python implementation information.
    int_info -- a named tuple with information about the int implementation.
    maxsize -- the largest supported length of containers.
    maxunicode -- the value of the largest Unicode code point
    platform -- platform identifier
    prefix -- prefix used to find the Python library
    thread_info -- a named tuple with information about the thread implementation.
    version -- the version of this interpreter as a string
    version_info -- version information as a named tuple
    __stdin__ -- the original stdin; don't touch!
    __stdout__ -- the original stdout; don't touch!
    __stderr__ -- the original stderr; don't touch!
    __displayhook__ -- the original displayhook; don't touch!
    __excepthook__ -- the original excepthook; don't touch!
    
    Functions:
    
    displayhook() -- print an object to the screen, and save it in builtins._
    excepthook() -- print an exception and its traceback to sys.stderr
    exc_info() -- return thread-safe information about the current exception
    exit() -- exit the interpreter by raising SystemExit
    getdlopenflags() -- returns flags to be used for dlopen() calls
    getprofile() -- get the global profiling function
    getrefcount() -- return the reference count for an object (plus one :-)
    getrecursionlimit() -- return the max recursion depth for the interpreter
    getsizeof() -- return the size of an object in bytes
    gettrace() -- get the global debug tracing function
    setdlopenflags() -- set the flags to be used for dlopen() calls
    setprofile() -- set the global profiling function
    setrecursionlimit() -- set the max recursion depth for the interpreter
    settrace() -- set the global debug tracing function

FUNCTIONS
    __breakpointhook__ = breakpointhook(...)
        breakpointhook(*args, **kws)
        
        This hook function is called by built-in breakpoint().
    
    __displayhook__ = displayhook(object, /)
        Print an object to sys.stdout and also save it in builtins._
    
    __excepthook__ = excepthook(exctype, value, traceback, /)
        Handle an exception by displaying it with a traceback on sys.stderr.
    
    __unraisablehook__ = unraisablehook(unraisable, /)
        Handle an unraisable exception.
        
        The unraisable argument has the following attributes:
        
        * exc_type: Exception type.
        * exc_value: Exception value, can be None.
        * exc_traceback: Exception traceback, can be None.
        * err_msg: Error message, can be None.
        * object: Object causing the exception, can be None.
    
    addaudithook(hook)
        Adds a new audit hook callback.
    
    audit(...)
        audit(event, *args)
        
        Passes the event to any audit hooks that are attached.
    
    breakpointhook(...)
        breakpointhook(*args, **kws)
        
        This hook function is called by built-in breakpoint().
    
    call_tracing(func, args, /)
        Call func(*args), while tracing is enabled.
        
        The tracing state is saved, and restored afterwards.  This is intended
        to be called from a debugger from a checkpoint, to recursively debug
        some other code.
    
    exc_info()
        Return current exception information: (type, value, traceback).
        
        Return information about the most recent exception caught by an except
        clause in the current stack frame or in an older stack frame.
    
    exit(status=None, /)
        Exit the interpreter by raising SystemExit(status).
        
        If the status is omitted or None, it defaults to zero (i.e., success).
        If the status is an integer, it will be used as the system exit status.
        If it is another kind of object, it will be printed and the system
        exit status will be one (i.e., failure).
    
    get_asyncgen_hooks()
        Return the installed asynchronous generators hooks.
        
        This returns a namedtuple of the form (firstiter, finalizer).
    
    get_coroutine_origin_tracking_depth()
        Check status of origin tracking for coroutine objects in this thread.
    
    getallocatedblocks()
        Return the number of memory blocks currently allocated.
    
    getdefaultencoding()
        Return the current default encoding used by the Unicode implementation.
    
    getdlopenflags()
        Return the current value of the flags that are used for dlopen calls.
        
        The flag constants are defined in the os module.
    
    getfilesystemencodeerrors()
        Return the error mode used Unicode to OS filename conversion.
    
    getfilesystemencoding()
        Return the encoding used to convert Unicode filenames to OS filenames.
    
    getprofile()
        Return the profiling function set with sys.setprofile.
        
        See the profiler chapter in the library manual.
    
    getrecursionlimit()
        Return the current value of the recursion limit.
        
        The recursion limit is the maximum depth of the Python interpreter
        stack.  This limit prevents infinite recursion from causing an overflow
        of the C stack and crashing Python.
    
    getrefcount(object, /)
        Return the reference count of object.
        
        The count returned is generally one higher than you might expect,
        because it includes the (temporary) reference as an argument to
        getrefcount().
    
    getsizeof(...)
        getsizeof(object [, default]) -> int
        
        Return the size of object in bytes.
    
    getswitchinterval()
        Return the current thread switch interval; see sys.setswitchinterval().
    
    gettrace()
        Return the global debug tracing function set with sys.settrace.
        
        See the debugger chapter in the library manual.
    
    intern(string, /)
        ``Intern'' the given string.
        
        This enters the string in the (global) table of interned strings whose
        purpose is to speed up dictionary lookups. Return the string itself or
        the previously interned string object with the same value.
    
    is_finalizing()
        Return True if Python is exiting.
    
    set_asyncgen_hooks(...)
        set_asyncgen_hooks(* [, firstiter] [, finalizer])
        
        Set a finalizer for async generators objects.
    
    set_coroutine_origin_tracking_depth(depth)
        Enable or disable origin tracking for coroutine objects in this thread.
        
        Coroutine objects will track 'depth' frames of traceback information
        about where they came from, available in their cr_origin attribute.
        
        Set a depth of 0 to disable.
    
    setdlopenflags(flags, /)
        Set the flags used by the interpreter for dlopen calls.
        
        This is used, for example, when the interpreter loads extension
        modules. Among other things, this will enable a lazy resolving of
        symbols when importing a module, if called as sys.setdlopenflags(0).
        To share symbols across extension modules, call as
        sys.setdlopenflags(os.RTLD_GLOBAL).  Symbolic names for the flag
        modules can be found in the os module (RTLD_xxx constants, e.g.
        os.RTLD_LAZY).
    
    setprofile(...)
        setprofile(function)
        
        Set the profiling function.  It will be called on each function call
        and return.  See the profiler chapter in the library manual.
    
    setrecursionlimit(limit, /)
        Set the maximum depth of the Python interpreter stack to n.
        
        This limit prevents infinite recursion from causing an overflow of the C
        stack and crashing Python.  The highest possible limit is platform-
        dependent.
    
    setswitchinterval(interval, /)
        Set the ideal thread switching delay inside the Python interpreter.
        
        The actual frequency of switching threads can be lower if the
        interpreter executes long sequences of uninterruptible code
        (this is implementation-specific and workload-dependent).
        
        The parameter must represent the desired switching delay in seconds
        A typical value is 0.005 (5 milliseconds).
    
    settrace(...)
        settrace(function)
        
        Set the global debug tracing function.  It will be called on each
        function call.  See the debugger chapter in the library manual.
    
    unraisablehook(unraisable, /)
        Handle an unraisable exception.
        
        The unraisable argument has the following attributes:
        
        * exc_type: Exception type.
        * exc_value: Exception value, can be None.
        * exc_traceback: Exception traceback, can be None.
        * err_msg: Error message, can be None.
        * object: Object causing the exception, can be None.

DATA
    __stderr__ = <_io.TextIOWrapper name='<stderr>' mode='w' encoding='utf...
    __stdin__ = <_io.TextIOWrapper name='<stdin>' mode='r' encoding='utf-8...
    __stdout__ = <_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf...
    abiflags = ''
    api_version = 1013
    argv = ['/usr/lib/python3.10/site-packages/ipykernel_launcher.py', '-f...
    base_exec_prefix = '/usr'
    base_prefix = '/usr'
    builtin_module_names = ('_abc', '_ast', '_codecs', '_collections', '_f...
    byteorder = 'little'
    copyright = 'Copyright (c) 2001-2022 Python Software Foundati...ematis...
    displayhook = <ipykernel.displayhook.ZMQShellDisplayHook object>
    dont_write_bytecode = False
    exec_prefix = '/usr'
    executable = '/usr/bin/python3'
    flags = sys.flags(debug=0, inspect=0, interactive=0, opt...mode=False,...
    float_info = sys.float_info(max=1.7976931348623157e+308, max_...epsilo...
    float_repr_style = 'short'
    hash_info = sys.hash_info(width=64, modulus=2305843009213693...iphash2...
    hexversion = 50988528
    implementation = namespace(name='cpython', cache_tag='cpython-310...xv...
    int_info = sys.int_info(bits_per_digit=30, sizeof_digit=4)
    last_value = SyntaxError('invalid syntax', ('/tmp/ipykernel_4.../53649...
    maxsize = 9223372036854775807
    maxunicode = 1114111
    meta_path = [<class '_frozen_importlib.BuiltinImporter'>, <class '_fro...
    modules = {'IPython': <module 'IPython' from '/usr/lib/python3.10/site...
    orig_argv = ['/usr/bin/python3', '-m', 'ipykernel_launcher', '-f', '/h...
    path = ['/home/httpd/html/zoo/classes/cs200/lectures', '/usr/lib64/pyt...
    path_hooks = [<class 'zipimport.zipimporter'>, <function FileFinder.pa...
    path_importer_cache = {'/home/accts/sbs5/.ipython/extensions': FileFin...
    platform = 'linux'
    platlibdir = 'lib64'
    prefix = '/usr'
    ps1 = 'In : '
    ps2 = '...: '
    ps3 = 'Out: '
    pycache_prefix = None
    stderr = <ipykernel.iostream.OutStream object>
    stdin = <_io.TextIOWrapper name='<stdin>' mode='r' encoding='utf-8'>
    stdlib_module_names = frozenset({'__future__', '_abc', '_aix_support',...
    stdout = <ipykernel.iostream.OutStream object>
    thread_info = sys.thread_info(name='pthread', lock='semaphore', versio...
    version = '3.10.5 (main, Jun  9 2022, 00:00:00) [GCC 12.1.1 20220507 (...
    version_info = sys.version_info(major=3, minor=10, micro=5, releaselev...
    warnoptions = []

FILE
    (built-in)


In [33]:
help(dir)
Help on built-in function dir in module builtins:

dir(...)
    dir([object]) -> list of strings
    
    If called without an argument, return the names in the current scope.
    Else, return an alphabetized list of names comprising (some of) the attributes
    of the given object, and of attributes reachable from it.
    If the object supplies a method named __dir__, it will be used; otherwise
    the default dir() logic is used and returns:
      for a module object: the module's attributes.
      for a class object:  its attributes, and recursively the attributes
        of its bases.
      for any other object: its attributes, its class's attributes, and
        recursively the attributes of its class's base classes.

In [34]:
dir(sys)
Out[34]:
['__breakpointhook__',
 '__displayhook__',
 '__doc__',
 '__excepthook__',
 '__interactivehook__',
 '__loader__',
 '__name__',
 '__package__',
 '__spec__',
 '__stderr__',
 '__stdin__',
 '__stdout__',
 '__unraisablehook__',
 '_base_executable',
 '_clear_type_cache',
 '_current_exceptions',
 '_current_frames',
 '_deactivate_opcache',
 '_debugmallocstats',
 '_framework',
 '_getframe',
 '_git',
 '_home',
 '_xoptions',
 'abiflags',
 'addaudithook',
 'api_version',
 'argv',
 'audit',
 'base_exec_prefix',
 'base_prefix',
 'breakpointhook',
 'builtin_module_names',
 'byteorder',
 'call_tracing',
 'copyright',
 'displayhook',
 'dont_write_bytecode',
 'exc_info',
 'excepthook',
 'exec_prefix',
 'executable',
 'exit',
 'flags',
 'float_info',
 'float_repr_style',
 'get_asyncgen_hooks',
 'get_coroutine_origin_tracking_depth',
 'getallocatedblocks',
 'getdefaultencoding',
 'getdlopenflags',
 'getfilesystemencodeerrors',
 'getfilesystemencoding',
 'getprofile',
 'getrecursionlimit',
 'getrefcount',
 'getsizeof',
 'getswitchinterval',
 'gettrace',
 'hash_info',
 'hexversion',
 'implementation',
 'int_info',
 'intern',
 'is_finalizing',
 'last_traceback',
 'last_type',
 'last_value',
 'maxsize',
 'maxunicode',
 'meta_path',
 'modules',
 'orig_argv',
 'path',
 'path_hooks',
 'path_importer_cache',
 'platform',
 'platlibdir',
 'prefix',
 'ps1',
 'ps2',
 'ps3',
 'pycache_prefix',
 'set_asyncgen_hooks',
 'set_coroutine_origin_tracking_depth',
 'setdlopenflags',
 'setprofile',
 'setrecursionlimit',
 'setswitchinterval',
 'settrace',
 'stderr',
 'stdin',
 'stdlib_module_names',
 'stdout',
 'thread_info',
 'unraisablehook',
 'version',
 'version_info',
 'warnoptions']
In [35]:
import collatz
In [36]:
dir(collatz)
Out[36]:
['__builtins__',
 '__cached__',
 '__doc__',
 '__file__',
 '__loader__',
 '__name__',
 '__package__',
 '__spec__',
 'collatz',
 'collatzg',
 'cseries']
In [45]:
collatz.__name__
Out[45]:
'collatz'
In [46]:
collatz.cseries
Out[46]:
<function collatz.cseries(n)>
In [47]:
collatz.cseries(16)
16
8
4
2
1
done
In [48]:
collatz.__file__
Out[48]:
'/home/httpd/html/zoo/classes/cs200/lectures/collatz.py'

End of Introduction notebook.