Make various fixes throughout the code, use separate logging using Python logging #47

vkoukis · 2015-03-30T12:43:02Z

Make various fixes throughout the code.
Convert argument-parsing to python-argparse, improve help messages.
Use separate logging procedure, separate it from interacting with the user (Output classes).
Use it to introduce the support for debugging messages throughout the code.
Use a persistent logfile, ensure all errors are logged.
This is a work-in-progress.

The end goal is to unify all frontends into a single framework:

To create an Image is to run a series of steps. Each step has input, output, and depends on other steps. Each step modifies global state (image context). Create a single framework for executing steps, which will be able to detect dependencies between the steps, and run them either in an unattended fashion, or with manual interaction.

Finally, make the functionality of Output classes completely distinct, so a single codebase can run either with manual intervention and a dialog-based frontend (the current snf-image-creator), or with minimal intervention and a text-based frontend (the current snf-mkimage), or any other combination in between.

Factor out checking for root privileges into a separate utility function. Also ensure "snf-image-creator --help" runs successfully without root privileges.

Port dialog_main.py to use argparse instead of optparse. Add explicit positional argument, improve help text for every argument, add description and epilog blocks to output of --help.

Media is the plural of medium. This program only supports a single medium at a time, a raw hard disk, or a disk image file, so `medium' is appropriate.

Make the process of resetting the terminal and outputting the text of an unexpected exception to stderr more robust. It seems resetting the terminal after using dialog races with printing the text of the exception, and overwrites it with the dialog background. Sleep for a tiny amount of time to ensure the exception is visible. This is an ugly workaround and should be replaced.

Re-write the procedure of selecting a directory to hold temporary files, so it does not use an arbitrary list of directories including /mnt and the user's home directory (which is presumably /root), but starts with a list of all mountpoints and /tmp, /var/tmp as candidates. Also fix bug where the creator would choose a directory under a read-only mount for storing temporary files, and fail. Still, disable this code path temporarily, and only choose between /tmp and /var/tmp, if the user has made no explicit choice. It is very un-intuitive to create temporary files in unrelated directories without the user's explicit consent. Perhaps this code path can be enabled again when snf-image-creator has the ability to propose these alternate locations to the user, and have them choose explicitly.

Separate the process of logging from the process of interacting with the user ("Output" instances) to ensure emitting debugging information is possible from anywhere inside the code. Use the standard Python logging module for logging, instead of a custom approach. This brings lots of advantages, including proper log formatting and timestamping. Use logging.handlers.SysLogHandler for logging to syslog, instead of calling the syslog module directly. Also use a standard default logfile /var/log/snf-image-creator for persistence, instead of a temporary logfile, and remove redundant functions show_log(), copy_file(). This simplifies the error handling path significantly. The general idea of this commit is that the instances of "Output" should be independent frontends for interacting with the user, and can be extended to support Input as well as Output in the future. This is a Work in Progress. The previous code assumed that all messages to the user via Output instances would also make it to the log file. For now, only what is explicitly logged reaches the logfile. A workaround is to have the base Output class log all messages to the user, and will be implemented in a future commit.

louridas · 2015-03-31T04:18:13Z

Hello,

That's a very substantial pull request, thank you! To be able to incorporate parts or all of it to Synnefo, if the work is not already under GRNET's copyright, we will need to have the appropriate rights, preferably through a Copyright License Agreement (CLA), which is the norm in large projects accepting contributions. A CLA is essential to ensure the users of the software that they need not worry about the continuity of the code and they are guaranteed against any potential future problems. See for example the FAQ at the Django CLA.

vkoukis · 2015-03-31T07:08:00Z

Thank you Panos. Yes, I agree completely: Since my contribution is not under GRNET's copyright, it's important to ensure continuity of the code. The Django CLA is fine. Will you provide me with a version of it for GRNET, so we can proceed?

louridas · 2015-03-31T07:17:00Z

We'll draft a GRNET CLA ASAP and send it over to you.

louridas · 2015-03-31T10:00:10Z

Please find the CLA at:

https://www.synnefo.org/assets/grnet_cla.txt

Sign, scan, and send to [email protected] so that we can use the contribution.

vkoukis · 2015-03-31T10:41:53Z

Panos, I have read the CLA.

I see two differences from the Apache CLA:

The GRNET CLA reads
"This license is for your protection as a Contributor as well as the protection of GRNET and the users of its software;"
while the Apache CLA reads
"This license is for your protection as a Contributor as well as the protection of the Foundation and its users;"

Given that I retain Copyright for my contribution to snf-image-creator,
it does not make sense to refer to snf-image-creator as "its software" here.

If the above issue is sorted out, I can make another pull request fixing the minor issues pointed out by Nikos.

Also, the phrase "In return, the Foundation shall not use Your Contributions in a way that is contrary to the public benefit or inconsistent with its nonprofit status and bylaws in effect at the time of the contribution." is missing, but I don't have a problem with this missing part.

Regarding style, do I leave my added Copyright line in every new/modified file, the way OpenStack does, or does it suffice to change the line to "Copyright GRNET and contributors" in new/modified files and in the main README.md and just have my name in AUTHORS, like Django does?

louridas · 2015-03-31T10:50:37Z

The first comment, regarding "users of its software" is correct. We'll remove it.
The second comment relates to the modifications essential for the CLA by a company, not a foundation. So it's not missing by accident. Essentially it's what Google does in its own Apache-derived CLA.
If you do not remove your copyright byline from the source files, we'll have to do it and replace it with GRNET's. So you can save us the trouble :-)
Of course your name goes to AUTHORS, Contributors, or whatever else we have in each project (if it's not already there).

vkoukis · 2015-03-31T10:57:00Z

Panos,

About the first part, thanks, please update the CLA.
About the second part, OK.

About replacing my Copyright with GRNET's, what do you mean? You cannot remove the Copyright line, since I retain the Copyright. The point is whether this fact is displayed in individual files, or in the single Copyright line in the root README.md.

louridas · 2015-03-31T11:35:14Z

We updated the CLA.

Regarding the copyright notice, you grant GRNET a non-exclusive copyright license. Non-exclusive means that GRNET can exercise the right in question, i.e., copyright, but so can you, too. That's in contrast to an exclusive license, which would mean that only GRNET would have the right in question from now on.

As explained above, the modus operandi is exactly equivalent to say, Google and Ganeti. It's only the Google copyright that appears in Ganeti contributions, not individual contributors' (neither internal not external). We may have contributors listed in a related file, not copyright holders in our source files, apart from GRNET.

vkoukis · 2015-03-31T12:20:14Z

Panos, I am not a lawyer, but your comment does not hold.
This is a Copyright License Agreement, not a Copyright Assignment Agreement.
The CLA mentions it explicitly: I own my Contribution and am the Copyright owner.
I do not assign Copyright of my contribution to GRNET. With the CLA, I give GRNET the right
to do essentially whatever they want with the code, but the Copyright is still mine.

This is also explicitly mentioned
in the Django CLA FAQ (https://www.djangoproject.com/foundation/cla/faq/):
"
Am I giving away the copyright to my contributions?

No. This is a pure license agreement, not a copyright assignment. You still maintain the full copyright for your contributions. You are only providing a license to the DSF to distribute your code without further restrictions. This is not the case for all CLA's, but it is the case for the one we are using.
"

Regarding the modus operandi:
Django acknowledges the fact that there are multiple Copyright holders, by specifically listing
"Copyright (c) Django Software Foundation and individual contributors."
in its LICENSE.

Google also acknowledges this fact, and makes it explicit, for example in the AUTHORS file for Go:
https://go.googlesource.com/go/+/master/AUTHORS
"This is the official list of Go authors for copyright purposes.
This file is distinct from the CONTRIBUTORS files.
See the latter for an explanation."

Both of these projects, and many others, like OpenStack, use the same Apache / Google CLA.
I do not know why Ganeti does not have a similar Copyright line, although it uses the same Google CLA,
or why GRNET did not do it when contributing patches to Ganeti.

Also, please note how the AUTHORS file of Go mentions the Copyright implications
explicitly. It is not a list of people allowed to contribute, it is a list of Copyright holders,
as also explained in CONTRIBUTORS:
https://go.googlesource.com/go/+/master/CONTRIBUTORS

So, either we list all contributors individually, as I have done in this pull request,
or we add the relevant section in the top of the AUTHORS file,
and change Copyright lines of modified files to read "Copyright GRNET and contributors".
If GRNET does not want to list the individual copyright holders in source files,
this is understandable. But the Copyright line in the root README.md and the
single Copyright line in source files being modified should read "Copyright GRNET
and contributors", reflecting this reality.

louridas · 2015-03-31T16:58:27Z

IANAL either, but here is what I do understand.

If you write a patch for a piece of software, and you do not transfer the copyright, you may own the copyright to the patch. That does not include the original code.
If you contribute an entirely new file, and you do not transfer the copyright, you own the copyright to the file.
Of course copyright ownership is distinct from the moral right of the author to be identified as such. Even if you sign away all rights, you still retain the moral right to be identified as an author of a piece of work.
In the age of git and open source projects all this is simplified by the fact that a simple git blame can show who contributed what and when.

In our specific case now. Needless to say we have every reason to welcome contributors and to show appreciation for their work, that's what Open Source is all about. Adding contributors by name in each and every file is a no-go, we might end up with dozens, very ugly. Also what will happen if in a future commit we factor away some contributors' code? We cannot be making rounds in committed code to see who is still a contributor to each file. So we can do the following:

Add a file CONTRIBUTORS. We put "Copyright (C) 2011-2015 GRNET S.A. and individual contributors" to those files with external contributions after they have signed the CLA. We add the name of external contributors to the CONTRIBUTORS file.

Note that the choice of Google in naming files is very poor, it has authors and contributors the wrong way round.

vkoukis · 2015-04-01T10:21:21Z

OK, Panos, thank you. I will sign the CLA, send it to you, and remake the pull request incorporating the agreed changes.

vkoukis added 7 commits March 28, 2015 12:42

Change "entrance point" to "entry point"

7a63bcc

Factor out checking for root privileges

6300743

Factor out checking for root privileges into a separate utility function. Also ensure "snf-image-creator --help" runs successfully without root privileges.

Port dialog_main.py to argparse, improve help text

41a054c

Port dialog_main.py to use argparse instead of optparse. Add explicit positional argument, improve help text for every argument, add description and epilog blocks to output of --help.

Replace all instances of media' with medium'

d412b4d

Media is the plural of medium. This program only supports a single medium at a time, a raw hard disk, or a disk image file, so `medium' is appropriate.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make various fixes throughout the code, use separate logging using Python logging #47

Make various fixes throughout the code, use separate logging using Python logging #47

vkoukis commented Mar 30, 2015

louridas commented Mar 31, 2015

vkoukis commented Mar 31, 2015

louridas commented Mar 31, 2015

louridas commented Mar 31, 2015

vkoukis commented Mar 31, 2015

louridas commented Mar 31, 2015

vkoukis commented Mar 31, 2015

louridas commented Mar 31, 2015

vkoukis commented Mar 31, 2015

louridas commented Mar 31, 2015

vkoukis commented Apr 1, 2015

Make various fixes throughout the code, use separate logging using Python logging #47

Are you sure you want to change the base?

Make various fixes throughout the code, use separate logging using Python logging #47

Conversation

vkoukis commented Mar 30, 2015

louridas commented Mar 31, 2015

vkoukis commented Mar 31, 2015

louridas commented Mar 31, 2015

louridas commented Mar 31, 2015

vkoukis commented Mar 31, 2015

louridas commented Mar 31, 2015

vkoukis commented Mar 31, 2015

louridas commented Mar 31, 2015

vkoukis commented Mar 31, 2015

louridas commented Mar 31, 2015

vkoukis commented Apr 1, 2015