{"id":1332,"date":"2013-07-15T11:28:12","date_gmt":"2013-07-15T09:28:12","guid":{"rendered":"http:\/\/blog.gocept.com\/?p=1332"},"modified":"2013-07-15T11:28:12","modified_gmt":"2013-07-15T09:28:12","slug":"reliable-file-updates-with-python","status":"publish","type":"post","link":"https:\/\/blog.gocept.com\/2013\/07\/15\/reliable-file-updates-with-python\/","title":{"rendered":"Reliable file updates with Python"},"content":{"rendered":"<p>Programs need to update files. Although most programmers know that unexpected things can happen while performing I\/O, I often see code that has been written in a surprisingly na\u00efve way. In this article, I would like to share some insights on how to improve I\/O reliability in Python code.<\/p>\n<div>\n<div>\n<div>\n<div id=\"reliable-file-updates-with-python\">\n<p>Consider the following Python snippet. Some operation is performed on data coming from and going back into a file:<\/p>\n<div>\n<div>\n<pre>with open(filename) as f:\r\n   input = f.read()\r\noutput = do_something(input)\r\nwith open(filename, 'w') as f:\r\n   f.write(output)<\/pre>\n<\/div>\n<\/div>\n<p>Pretty simple? Probably not as simple as it looks at the first glance. I often debug applications that show strange behaviour on production servers. Here are examples of failure modes I have seen:<\/p>\n<ul>\n<li>A run away server process spills out huge amounts of logs and the disk fills up. <cite>write()<\/cite> raises an exception right after truncating the file, leaving the file empty.<\/li>\n<li>Several instances of our application happen to run in parallel. After they have finished, the file contents is garbage because it intermingles output from multiple instances.<\/li>\n<li>The application triggers some follow-up action after completing the write. Seconds later, the power goes off. After we have restarted the server, we see the old file contents again. The data already passed to other applications does not correspond to what we see in the file anymore.<\/li>\n<\/ul>\n<p>Nothing of what follows is really new. My goal is to present common approaches and techniques to Python developers who are less experienced in system programming. I will provide code examples to make it easy for developers to incorporate these approaches into their own code.<\/p>\n<div id=\"what-does-reliability-mean-anyway\">\n<h2>What does \u201creliability\u201d mean anyway?<\/h2>\n<p>In the broadest sense, reliability means that an operation is performing its required function under all stated conditions. With regard to file updates, the function in question is to create, replace or extend the contents of a file. It might be rewarding to seek inspiration from database theory here. The <a href=\"https:\/\/en.wikipedia.org\/wiki\/ACID\">ACID<\/a> properties of the classic transaction model will serve as guidelines to improve reliability.<\/p>\n<p>To get started, let\u2019s see how the initial example can be rated against the four ACID properties:<\/p>\n<ul>\n<li><strong>Atomicity<\/strong> requires that a transaction either succeeds or fails completely. In the example shown above, a full disk will likely result in a partially written file. Additionally, if other programs read the file while it is being written, they get a half-finished version even in the absence of write errors.<\/li>\n<li><strong>Consistency<\/strong> denotes that updates must bring the system from one valid state to another. Consistency can be subdivided into internal and external consistency: Internal consistency means that the file\u2019s data structures are consistent. External consistency means that the file\u2019s contents is aligned with other data related to it. In this example, it is hard to reason about consistency since we don\u2019t know enough about the application. But since consistency requires atomicity, we can say at least that internal consistency is not guaranteed.<\/li>\n<li><strong>Isolation<\/strong> is violated if running transactions concurrently yields different results from running the same transactions sequentially. It is clear that the code above has no protection against <a href=\"http:\/\/drtom.ch\/posts\/2011\/11\/12\/The_Lost_Update_Problem_-_Part_1\/\">lost updates<\/a> or other isolation failures.<\/li>\n<li><strong>Durability<\/strong> means that changes need to be permanent. Before we signal success to the user, we must be sure that our data hits non-volatile storage and not just a write cache. Perhaps the code above has been written with the assumption in mind that disk I\/O takes place immediately when we call <cite>write()<\/cite>. This assumption is not warranted by POSIX semantics.<\/li>\n<\/ul>\n<\/div>\n<div id=\"use-a-database-system-if-you-can\">\n<h2>Use a database system if you can<\/h2>\n<p>If we would be able to gain all four ACID properties, we would have come a long way towards increased reliability. But this requires significant coding effort. Why reinvent the wheel? Most database systems already have ACID transactions.<\/p>\n<p>Reliable data storage is a solved problem. <strong>If you need reliable storage, use a database.<\/strong> Chances are high that you will not do it by yourself as good as those who have been working on it for years if not decades. If you do not want to set up a \u201cbig\u201d database server, you can use <a href=\"http:\/\/www.sqlite.org\">sqlite<\/a> for example. It has ACID transactions, it\u2019s small, it\u2019s free, and it\u2019s included in Python\u2019s <a href=\"http:\/\/docs.python.org\/3\/library\/sqlite3.html\">standard library<\/a>.<\/p>\n<p>The article could finish here. But there are valid reasons not to use a database. They are often tied to <em>file format<\/em> or <em>file location<\/em> constraints. Both are not easily controllable with database systems. Reasons include:<\/p>\n<ul>\n<li>we must process files generated by other applications, which are in a fixed format or at a fixed location<\/li>\n<li>we must write files for consumption by other applications (and the same restrictions apply)<\/li>\n<li>our files must be human-readable or human-editable<\/li>\n<\/ul>\n<p>&#8230;and so on. You get the point.<\/p>\n<p>If we are set out to implement reliable file updates on our own, there are some programming techniques to consider. In the following, I will present four common patterns of performing file updates. After that, I will discuss what steps can be taken to establish ACID properties with each file update pattern.<\/p>\n<\/div>\n<div id=\"file-update-patterns\">\n<h2>File update patterns<\/h2>\n<p>Files can be updated in a multitude of ways, but I see at least four common patterns. These will serve as a basis for the rest of this article.<\/p>\n<div id=\"truncate-write\">\n<h3>Truncate-Write<\/h3>\n<p>This is probably the most basic pattern. In the following example, hypothetical domain model code reads data, performs some computation, and re-opens the existing file in write mode:<\/p>\n<div>\n<div>\n<pre>with open(filename, 'r') as f:\r\n   model.read(f)\r\nmodel.process()\r\nwith open(filename, 'w') as f:\r\n   model.write(f)<\/pre>\n<\/div>\n<\/div>\n<p>A variant of this pattern opens the file in read-write mode (the \u201cplus\u201d modes in Python), seeks to the start, issues an explicit <cite>truncate()<\/cite> call and rewrites the contents:<\/p>\n<div>\n<div>\n<pre>with open(filename, 'a+') as f:\r\n   f.seek(0)\r\n   model.input(f.read())\r\n   model.compute()\r\n   f.seek(0)\r\n   f.truncate()\r\n   f.write(model.output())<\/pre>\n<\/div>\n<\/div>\n<p>An advantage of this variant is that we open file only once and keep it open all the time. This simplifies locking for example.<\/p>\n<\/div>\n<div id=\"write-replace\">\n<h3>Write-Replace<\/h3>\n<p>Another widely used pattern is to write new contents into a temporary file and replace the original file after that:<\/p>\n<div>\n<div>\n<pre>with tempfile.NamedTemporaryFile(\r\n      'w', dir=os.path.dirname(filename), delete=False) as tf:\r\n   tf.write(model.output())\r\n   tempname = tf.name\r\nos.rename(tempname, filename)<\/pre>\n<\/div>\n<\/div>\n<p>This method is more robust against errors than the <em>truncate-write<\/em> method. See below for a discussion of atomicity and consistency properties. It is used by many applications.<\/p>\n<p>These first two patterns are so common that the ext4 filesystem in the Linux kernel even <a href=\"http:\/\/www.mjmwired.net\/kernel\/Documentation\/filesystems\/ext4.txt#310\">detects them<\/a> and fixes some reliability shortcomings automatically. But don\u2019t depend on it: you are not always using ext4, and the administrator might have disabled this feature.<\/p>\n<\/div>\n<div id=\"append\">\n<h3>Append<\/h3>\n<p>The third pattern is to append new data to an existing file:<\/p>\n<div>\n<div>\n<pre>with open(filename, 'a') as f:\r\n   f.write(model.output())<\/pre>\n<\/div>\n<\/div>\n<p>This pattern is used for writing log files and other cumulative data processing tasks. Technically, its outstanding feature is its extreme simplicity. An interesting extension is to perform append-only updates during regular operation and to reorganize the file into a more compact form periodically.<\/p>\n<\/div>\n<div id=\"spooldir\">\n<h3>Spooldir<\/h3>\n<p>Here we treat a directory as logical data store and create a new uniquely named file for each record:<\/p>\n<div>\n<div>\n<pre>with open(unique_filename(), 'w') as f:\r\n   f.write(model.output())<\/pre>\n<\/div>\n<\/div>\n<p>This pattern shares its cumulative nature with the <em>append<\/em> pattern. A big advantage is that we can put a little amount of metadata into the file name. This can be used, for example, to convey information about the processing status. A particular clever implementation of the <em>spooldir<\/em> pattern is the <a href=\"https:\/\/en.wikipedia.org\/wiki\/Maildir\">maildir<\/a> format. Maildirs use a naming scheme with additional subdirectories to perform update operations in a reliable and lock-free way. The <a href=\"https:\/\/pypi.python.org\/pypi\/md\/\">md<\/a> and <a href=\"https:\/\/pypi.python.org\/pypi\/gocept.filestore\/\">gocept.filestore<\/a> libraries provide convenient wrappers for maildir operations.<\/p>\n<p>If your file name generation is not guaranteed to give unique results, there is even a possibility to demand that the file must be actually new. Use the low-level <cite>os.open()<\/cite> call with proper flags:<\/p>\n<div>\n<div>\n<pre>fd = os.open(filename, os.O_WRONLY | os.O_CREAT| os.O_EXCL, 0o666)\r\nwith os.fdopen(fd, 'w') as f:\r\n   f.write(...)<\/pre>\n<\/div>\n<\/div>\n<p>After opening the file with <cite>O_EXCL<\/cite>, we use <cite>os.fdopen<\/cite> to convert the raw file descriptor into a regular Python file object.<\/p>\n<\/div>\n<\/div>\n<div id=\"applying-acid-properties-to-file-updates\">\n<h2>Applying ACID properties to file updates<\/h2>\n<p>In the following, I will try to enhance the file update patterns. Let\u2019s see what we can do to meet each ACID property in turn. I will keep this as simple as possible, since we are not planning to write a complete database system. Please note that the material presented in this section is not exhaustive, but it may give you a good starting point for your own experimentation.<\/p>\n<div id=\"atomicity\">\n<h3>Atomicity<\/h3>\n<p>The <strong>write-replace<\/strong> pattern gives you atomicity for free since the underlying <cite>os.rename()<\/cite> function <a href=\"http:\/\/rcrowley.org\/2010\/01\/06\/things-unix-can-do-atomically.html\">is atomic<\/a>. This means that at any given point in time, any process sees either the old or the new file. This pattern has a natural robustness against write errors: if the write operation triggers an exception, the rename operation is never performed and thus, we are not in the danger of overwriting a good old file with a damaged new one.<\/p>\n<p>The <strong>append<\/strong> patterns is not atomic by itself, because we risk to append incomplete records. But there is a trick to make updates appear atomic: Annotate each written record with a checksum. When reading the log later on, discard all records that do not have a valid checksum. This way, only complete records will be processed. In the following example, an application makes periodic measurements and appends a one-line JSON record each time to a log. We compute a CRC32 checksum of the record\u2019s byte representation and append it to the same line:<\/p>\n<div>\n<div>\n<pre>with open(logfile, 'ab') as f:\r\n    for i in range(3):\r\n        measure = {'timestamp': time.time(), 'value': random.random()}\r\n        record = json.dumps(measure).encode()\r\n        checksum = '{:8x}'.format(zlib.crc32(record)).encode()\r\n        f.write(record + b' ' + checksum + b'\\n')<\/pre>\n<\/div>\n<\/div>\n<p>This example code simulates the measurements by creating a random value every second.<\/p>\n<div>\n<pre>$ cat log\r\n{\"timestamp\": 1373396987.258189, \"value\": 0.9360123151217828} 9495b87a\r\n{\"timestamp\": 1373396987.25825, \"value\": 0.40429005476999424} 149afc22\r\n{\"timestamp\": 1373396987.258291, \"value\": 0.232021160265939} d229d937<\/pre>\n<\/div>\n<p>To process the log file, we read one record per line, split off the checksum, and compare it to the read record:<\/p>\n<div>\n<div>\n<pre>with open(logfile, 'rb') as f:\r\n    for line in f:\r\n        record, checksum = line.strip().rsplit(b' ', 1)\r\n        if checksum.decode() == '{:8x}'.format(zlib.crc32(record)):\r\n            print('read measure: {}'.format(json.loads(record.decode())))\r\n        else:\r\n            print('checksum error for record {}'.format(record))<\/pre>\n<\/div>\n<\/div>\n<p>Now we simulate a truncated write by chopping the last line:<\/p>\n<div>\n<pre>$ cat log\r\n{\"timestamp\": 1373396987.258189, \"value\": 0.9360123151217828} 9495b87a\r\n{\"timestamp\": 1373396987.25825, \"value\": 0.40429005476999424} 149afc22\r\n{\"timestamp\": 1373396987.258291, \"value\": 0.23202<\/pre>\n<\/div>\n<p>When the log is read, the last incomplete line is rejected:<\/p>\n<div>\n<pre>$ read_checksummed_log.py log\r\nread measure: {'timestamp': 1373396987.258189, 'value': 0.9360123151217828}\r\nread measure: {'timestamp': 1373396987.25825, 'value': 0.40429005476999424}\r\nchecksum error for record b'{\"timestamp\": 1373396987.258291, \"value\":'<\/pre>\n<\/div>\n<p>The checksummed log record approach is used by a large number of applications including many database systems.<\/p>\n<p>Individual files in the <strong>spooldir<\/strong> can likewise feature a checksum in each file. Another, probably easier, approach is to borrow from the <em>write-replace<\/em> pattern: first write the file aside and move it to its final location afterwards. Devise a naming scheme that protects work-in-progress files from being processed by consumers. In the following example, all file names ending with <tt>.tmp<\/tt> are ignored by readers and are thus safe to use during write operations:<\/p>\n<div>\n<div>\n<pre>newfile = generate_id()\r\nwith open(newfile + '.tmp', 'w') as f:\r\n   f.write(model.output())\r\nos.rename(newfile + '.tmp', newfile)<\/pre>\n<\/div>\n<\/div>\n<p>At last, <strong>truncate-write<\/strong> is non-atomic. I am sorry that I am not able to offer you an atomic variant. Right after performing the truncate operation, the file is nulled and no new content has been written yet. If a concurrent program reads the file now or, worse yet, an exception occurs and our program gets aborted, we see neither the old nor the new version.<\/p>\n<\/div>\n<div id=\"consistency\">\n<h3>Consistency<\/h3>\n<p>Most things I have said about atomicity can be applied to consistency as well. In fact, atomic updates are a prerequisite for internal consistency. External consistency means to update several files in sync. As this cannot easily be done, lock files can be used to ensure that read and write access do not interfere. Consider a directory where files need to be consistent with each other. A common pattern is to designate a lock file, which controls access for the whole directory.<\/p>\n<p>Example writer code:<\/p>\n<div>\n<div>\n<pre>with open(os.path.join(dirname, '.lock'), 'a+') as lockfile:\r\n   fcntl.flock(lockfile, fcntl.LOCK_EX)\r\n   model.update(dirname)<\/pre>\n<\/div>\n<\/div>\n<p>Example reader code:<\/p>\n<div>\n<div>\n<pre>with open(os.path.join(dirname, '.lock'), 'a+') as lockfile:\r\n   fcntl.flock(lockfile, fcntl.LOCK_SH)\r\n   model.readall(dirname)<\/pre>\n<\/div>\n<\/div>\n<p>This method only works if we have control over all readers. Since there may be only one writer active at a time (the exclusive lock is blocking all shared locks), the scalability of this method is limited.<\/p>\n<p>To take it one step further, we can apply the <strong>write-replace<\/strong> pattern to whole directories. This involves creating a new directory for each update <em>generation<\/em> and changing a symlink once the update is complete. For example, a mirroring application maintains a directory of tarballs together with an index file, which lists file name, file size, and a checksum. When the upstream mirror gets updated, it is not enough to implement an atomic file update for every tarball and the index file in isolation. Instead, we need to flip both the tarballs and the index file at the same time to avoid checksum mismatches. To solve this problem, we maintain a subdirectory for each generation and symlink the active generation:<\/p>\n<div>\n<pre>mirror\r\n|-- 483\r\n|   |-- a.tgz\r\n|   |-- b.tgz\r\n|   `-- index.json\r\n|-- 484\r\n|   |-- a.tgz\r\n|   |-- b.tgz\r\n|   |-- c.tgz\r\n|   `-- index.json\r\n`-- current -&gt; 483<\/pre>\n<\/div>\n<p>Here, the new generation 484 is in the process of being updated. When all tarballs are present and the index file is up to date, we can switch the <tt>current<\/tt> symlink with a single, atomic <cite>os.symlink()<\/cite> call. Other applications see always either the complete old or the complete new generation. It is important that readers need to <cite>os.chdir()<\/cite> into the <tt>current<\/tt> directory and refer to files without their full path names. Otherwise, there is a race condition when a reader first opens <tt>current\/index.json<\/tt> and then opens <tt>current\/a.tgz<\/tt>, but in the meanwhile the symlink target has been changed.<\/p>\n<\/div>\n<div id=\"isolation\">\n<h3>Isolation<\/h3>\n<p>Isolation means that concurrent updates to the same file are <em>serializable<\/em> \u2014 there exists a serial schedule that gives the same results as the parallel schedule actually performed. \u201cReal\u201d database systems use advanced techniques like <a href=\"https:\/\/en.wikipedia.org\/wiki\/Multiversion_concurrency_control\">MVCC<\/a> to maintain serializability while allowing for a great degree of parallelism. Back on our own, we better use locks to serialize file updates.<\/p>\n<p>Locking <strong>truncate-write<\/strong> updates is easy. Just acquire an exclusive lock prior to all file operations. The following example code reads an integer from a file, increments it, and updates the file:<\/p>\n<div>\n<div>\n<pre>def update():\r\n   with open(filename, 'r+') as f:\r\n      fcntl.flock(f, fcntl.LOCK_EX)\r\n      n = int(f.read())\r\n      n += 1\r\n      f.seek(0)\r\n      f.truncate()\r\n      f.write('{}\\n'.format(n))<\/pre>\n<\/div>\n<\/div>\n<p>Locking updates using the <strong>write-replace<\/strong> pattern can be tricky. Using a lock the same way as in <em>truncate-write<\/em> can lead to updates conflicts. A na\u00efve implementation could look like this:<\/p>\n<div>\n<div>\n<pre>def update():\r\n   with open(filename) as f:\r\n      fcntl.flock(f, fcntl.LOCK_EX)\r\n      n = int(f.read())\r\n      n += 1\r\n      with tempfile.NamedTemporaryFile(\r\n            'w', dir=os.path.dirname(filename), delete=False) as tf:\r\n         tf.write('{}\\n'.format(n))\r\n         tempname = tf.name\r\n      os.rename(tempname, filename)<\/pre>\n<\/div>\n<\/div>\n<p>What is wrong with this code? Imagine two processes compete to update a file. The first process just goes ahead, but the second process is blocked in the <cite>fcntl.flock()<\/cite> call. When the first process replaces the file and releases the lock, the already open file descriptor in the second process now points to a \u201cghost\u201d file (not reachable by any path name) with old contents. To avoid this conflict, we must check that our open file is still the same after returning from <cite>fcntl.flock()<\/cite>. So I have written a new <cite>LockedOpen<\/cite> context manager to replace the built-in <cite>open<\/cite> context. It ensures that we actually open the right file:<\/p>\n<div>\n<div>\n<pre>class LockedOpen(object):\r\n\r\n    def __init__(self, filename, *args, **kwargs):\r\n        self.filename = filename\r\n        self.open_args = args\r\n        self.open_kwargs = kwargs\r\n        self.fileobj = None\r\n\r\n    def __enter__(self):\r\n        f = open(self.filename, *self.open_args, **self.open_kwargs)\r\n        while True:\r\n            fcntl.flock(f, fcntl.LOCK_EX)\r\n            fnew = open(self.filename, *self.open_args, **self.open_kwargs)\r\n            if os.path.sameopenfile(f.fileno(), fnew.fileno()):\r\n                fnew.close()\r\n                break\r\n            else:\r\n                f.close()\r\n                f = fnew\r\n        self.fileobj = f\r\n        return f\r\n\r\n    def __exit__(self, _exc_type, _exc_value, _traceback):\r\n        self.fileobj.close()<\/pre>\n<\/div>\n<\/div>\n<div>\n<div>\n<pre>    def update(self):\r\n        with LockedOpen(filename, 'r+') as f:\r\n            n = int(f.read())\r\n            n += 1\r\n            with tempfile.NamedTemporaryFile(\r\n                    'w', dir=os.path.dirname(filename), delete=False) as tf:\r\n                tf.write('{}\\n'.format(n))\r\n                tempname = tf.name\r\n            os.rename(tempname, filename)<\/pre>\n<\/div>\n<\/div>\n<p>Locking <strong>append<\/strong> updates is as easy as locking <em>truncate-write<\/em> updates: acquire an exclusive lock, append, done. Long-running processes, which leave a file permanently open, may need to release locks between updates to let others in.<\/p>\n<p>The <strong>spooldir<\/strong> pattern has the elegant property that it does not require any locking. Again, it depends on using a clever naming scheme and a robust unique file name generation. The <a href=\"http:\/\/cr.yp.to\/proto\/maildir.html\">maildir specification<\/a> is a good example for a spooldir design. It can be easily adapted to other cases, which have nothing to do with mail.<\/p>\n<\/div>\n<div id=\"durability\">\n<h3>Durability<\/h3>\n<p>Durability is a bit special because it depends not only on the application, but also on OS and hardware configuration. In theory, we can assume that <cite>os.fsync()<\/cite> or <cite>os.fdatasync()<\/cite> calls do not return until data has reached permanent storage. In practice, we may run into several problems: we may be facing incomplete fsync implementations or awkward disk controller configurations, which never give any persistence guarantee. A talk from a <a href=\"http:\/\/www.oscon.com\/oscon2008\/public\/schedule\/detail\/3172\">MySQL dev<\/a> goes into great detail of what can go wrong. Some database systems like PostgreSQL even offer a <a href=\"http:\/\/www.postgresql.org\/docs\/9.2\/static\/runtime-config-wal.html\">choice of persistence mechanisms<\/a> so that the administrator can select the best suited one at runtime. The poor man\u2019s option although is to just use <cite>os.fsync()<\/cite> and hope that it has been implemented correctly.<\/p>\n<p>With the <strong>truncate-write<\/strong> pattern, we have to issue an fsync after finishing write operations but before closing the file. Note that there is usually another level of write caching involved. The <em>glibc buffer<\/em> holds back writes inside the process even before they are passed to the kernel. To get the glibc buffer empty as well, we have to <cite>flush()<\/cite> it before fsync\u2019ing:<\/p>\n<div>\n<div>\n<pre>with open(filename, 'w') as f:\r\n   model.write(f)\r\n   f.flush()\r\n   os.fdatasync(f)<\/pre>\n<\/div>\n<\/div>\n<p>Alternatively, you can invoke Python with the <strong>-u<\/strong> flag to get unbuffered writes for all file I\/O.<\/p>\n<p>I prefer <cite>os.fdatasync()<\/cite> over <cite>os.fsync()<\/cite> most of the time to avoid synchronous metadata updates (ownership, size, mtime, &#8230;). Metadata updates can result in seeky disk I\/O, which slows things down quite a bit.<\/p>\n<p>Applying the same trick to <strong>write-replace<\/strong> style updates is only half of the story. We make sure that the newly written file has been pushed to non-volatile storage before replacing the old file, but what about the replace operation itself? We have no guarantee that the directory update is performed right on. There are lengthy discussions on <a href=\"http:\/\/stackoverflow.com\/questions\/3764822\/how-to-durably-rename-a-file-in-posix\/5809073#5809073\">how to sync a directory update<\/a> on the net, but in our case (old and new file are in the same directory) we can get away with this rather simple solution:<\/p>\n<div>\n<div>\n<pre>os.rename(tempname, filename)\r\ndirfd = os.open(os.path.dirname(filename), os.O_DIRECTORY)\r\nos.fsync(dirfd)\r\nos.close(dirfd)<\/pre>\n<\/div>\n<\/div>\n<p>We open the directory with the low-level <cite>os.open()<\/cite> call (Python\u2019s built-in <cite>open()<\/cite> does not support opening directories) and perform a <cite>os.fsync()<\/cite> on the directory\u2019s file descriptor.<\/p>\n<p>Persisting <strong>append<\/strong> updates is again quite similar to what I have said about <em>truncate-write<\/em>.<\/p>\n<p>The <strong>spooldir<\/strong> pattern has the same directory sync problems as the <em>write-replace<\/em> pattern. Fortunately, the same solution applies here as well: first sync the file, then sync the directory.<\/p>\n<\/div>\n<\/div>\n<div id=\"conclusion\">\n<h2>Conclusion<\/h2>\n<p>It is possible to update files reliably. I have shown that all four ACID properties can be met. The code examples presented above may serve as a toolbox. Pick the programming techniques that match your needs best. At times, you don\u2019t need all four ACID properties but only one or two. I hope that this article helps you to make an informed decision about what to implement and what to leave out.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Programs need to update files. Although most programmers know that unexpected things can happen while performing I\/O, I often see code that has been written in a surprisingly na\u00efve way. In this article, I would like to share some insights on how to improve I\/O reliability in Python code. Consider the following Python snippet. Some &hellip; <a href=\"https:\/\/blog.gocept.com\/2013\/07\/15\/reliable-file-updates-with-python\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Reliable file updates with Python&#8221;<\/span><\/a><\/p>\n","protected":false},"author":11966441,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_coblocks_attr":"","_coblocks_dimensions":"","_coblocks_responsive_height":"","_coblocks_accordion_ie_support":"","advanced_seo_description":"","jetpack_seo_html_title":"","jetpack_seo_noindex":false,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_newsletter_tier_id":0,"footnotes":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[10221],"tags":[196,832],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_likes_enabled":true,"jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pFP3y-lu","jetpack-related-posts":[{"id":268,"url":"https:\/\/blog.gocept.com\/2012\/11\/09\/python-2-and-3-compatible-builds-with-zc-buildout\/","url_meta":{"origin":1332,"position":0},"title":"Python 2 and 3 compatible builds with zc.buildout","author":"","date":"November 9, 2012","format":false,"excerpt":"Creating a single-source build environment with zc.buildout that works for both Python 2 and 3 is a bit of a hassle. This blog post shows how to do it for a minimal demo project. During the sprints at PyCon DE 2012, we tried to make the upcoming 1.0 release of\u2026","rel":"","context":"In &quot;en&quot;","block_context":{"text":"en","link":"https:\/\/blog.gocept.com\/category\/en\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":1873,"url":"https:\/\/blog.gocept.com\/2016\/10\/26\/towards-restrictedpython-3\/","url_meta":{"origin":1332,"position":1},"title":"Towards RestrictedPython 3","author":"Michael Howitz","date":"October 26, 2016","format":false,"excerpt":"The biggest blocker to port Zope to Python 3 is\u00a0RestrictedPython. What is RestrictedPython? It is a library used by Zope to restrict Python code at instruction level to a bare minimum of trusted functionality. It\u00a0parses and filters the code for not\u00a0allowed constructs (such as\u00a0open()) and adds wrappers around\u00a0each access on\u2026","rel":"","context":"In &quot;en&quot;","block_context":{"text":"en","link":"https:\/\/blog.gocept.com\/category\/en\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2016\/10\/zope-is-not-dead.jpg?fit=1200%2C658&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2016\/10\/zope-is-not-dead.jpg?fit=1200%2C658&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2016\/10\/zope-is-not-dead.jpg?fit=1200%2C658&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2016\/10\/zope-is-not-dead.jpg?fit=1200%2C658&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2016\/10\/zope-is-not-dead.jpg?fit=1200%2C658&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":1262,"url":"https:\/\/blog.gocept.com\/2013\/04\/17\/running-tests-using-gocept-selenium-on-travis-ci\/","url_meta":{"origin":1332,"position":2},"title":"Running tests using gocept.selenium on Travis-CI","author":"Michael Howitz","date":"April 17, 2013","format":false,"excerpt":"Travis-CI\u00a0is a\u00a0free hosted continuous integration platform for the open source community. It has a good integration with Github, so each push to a project runs the tests \u00a0of the project. gocept.selenium\u00a0is a python package our company has developed as a test-friendly Python API for Selenium\u00a0which allows to run tests in\u2026","rel":"","context":"In &quot;en&quot;","block_context":{"text":"en","link":"https:\/\/blog.gocept.com\/category\/en\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":2407,"url":"https:\/\/blog.gocept.com\/2017\/05\/05\/zope-2-resurrection-sprint-goal-accomplished\/","url_meta":{"origin":1332,"position":3},"title":"Zope 2 Resurrection Sprint \u2013 Goal accomplished","author":"Michael Howitz","date":"May 5, 2017","format":false,"excerpt":"The sprint days were really busy for\u00a0Earl Zope II\u00a0and the people helping him with the Python 3 wonderland immigration authorities. Zope can be installed using Python 3 can be started and renders some views has more than 1.700 of more than 2.300 tests running has some\u00a0optional dependencies left to be\u2026","rel":"","context":"In &quot;en&quot;","block_context":{"text":"en","link":"https:\/\/blog.gocept.com\/category\/en\/"},"img":{"alt_text":"Many screens","src":"https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2017\/05\/img_20170503_183916-e1493991770316.jpg?fit=1200%2C943&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2017\/05\/img_20170503_183916-e1493991770316.jpg?fit=1200%2C943&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2017\/05\/img_20170503_183916-e1493991770316.jpg?fit=1200%2C943&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2017\/05\/img_20170503_183916-e1493991770316.jpg?fit=1200%2C943&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2017\/05\/img_20170503_183916-e1493991770316.jpg?fit=1200%2C943&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":3229,"url":"https:\/\/blog.gocept.com\/2018\/06\/07\/migrate-a-zope-zodb-data-fs-to-python-3\/","url_meta":{"origin":1332,"position":4},"title":"Migrate a Zope ZODB Data.fs to Python 3","author":"Michael Howitz","date":"June 7, 2018","format":false,"excerpt":"TL;DR Use\u00a0zodbupdate. Problem A ZODB\u00a0Data.fs\u00a0which was created under Python 2 cannot be opened under Python 3. This is prevented by using a different magic code in the first bytes of the file. This is done on purpose because str\u00a0has a different meaning for the two Python versions: Under Python 2\u2026","rel":"","context":"In &quot;en&quot;","block_context":{"text":"en","link":"https:\/\/blog.gocept.com\/category\/en\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2018\/06\/spring-3383890_1280.jpg?fit=1200%2C797&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2018\/06\/spring-3383890_1280.jpg?fit=1200%2C797&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2018\/06\/spring-3383890_1280.jpg?fit=1200%2C797&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2018\/06\/spring-3383890_1280.jpg?fit=1200%2C797&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2018\/06\/spring-3383890_1280.jpg?fit=1200%2C797&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":3350,"url":"https:\/\/blog.gocept.com\/2019\/11\/13\/union-cms-released-on-python-3\/","url_meta":{"origin":1332,"position":5},"title":"union.cms released on Python 3","author":"Michael Howitz","date":"November 13, 2019","format":false,"excerpt":"union.cms is a content management system which was once developed on Zope 2. It was one of the early adopters of the Five technology aka using Zope 3 components in Zope 2. Now it is one of the proud early adopters of Zope 4 on Python 3. It is used\u2026","rel":"","context":"In &quot;en&quot;","block_context":{"text":"en","link":"https:\/\/blog.gocept.com\/category\/en\/"},"img":{"alt_text":"Green tree python","src":"https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2019\/11\/green-tree-python-1312700.jpg?fit=1200%2C863&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2019\/11\/green-tree-python-1312700.jpg?fit=1200%2C863&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2019\/11\/green-tree-python-1312700.jpg?fit=1200%2C863&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2019\/11\/green-tree-python-1312700.jpg?fit=1200%2C863&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/blog.gocept.com\/wp-content\/uploads\/2019\/11\/green-tree-python-1312700.jpg?fit=1200%2C863&ssl=1&resize=1050%2C600 3x"},"classes":[]}],"_links":{"self":[{"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/posts\/1332"}],"collection":[{"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/users\/11966441"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/comments?post=1332"}],"version-history":[{"count":5,"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/posts\/1332\/revisions"}],"predecessor-version":[{"id":1338,"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/posts\/1332\/revisions\/1338"}],"wp:attachment":[{"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/media?parent=1332"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/categories?post=1332"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.gocept.com\/wp-json\/wp\/v2\/tags?post=1332"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}